You might want to consider the start of this tutorial.

Short introductions to other TF datasets:

Sharing data features

This tutorial is a companion to the Text-Fabric documentation on data sharing.

Explore additional data

The ETCBC has a few other repositories with data that work in conjunction with the BHSA data. One of them you have already seen: phono, for phonetic transcriptions. There is also parallels for detecting parallel passages, and valence for studying patterns around verbs that determine their meanings.

Make your own data

If you study the additional data, you can observe how that data is created and also how it is turned into a text-fabric data module. The last step is incredibly easy. You can write out every Python dictionary where the keys are numbers and the values string or numbers as a Text-Fabric feature. When you are creating data, you have already constructed those dictionaries, so writing them out is just one method call. See for example how the flowchart notebook in valence writes out verb sense data.

Share your new data

You can then easily share your new features on GitHub, so that your colleagues everywhere can try it out for themselves.

Here is how you draw in other data, for example

You can add such data on the fly, by passing a mod={org}/{repo}/{path} parameter, or a bunch of them separated by commas, or packed in a list or tuple.

If the data is there, it will be auto-downloaded and stored on your machine.

Let's do it.

In [1]:
%load_ext autoreload
%autoreload 2

Incantation

The ins and outs of installing Text-Fabric, getting the corpus, and initializing a notebook are explained in the start tutorial.

In [2]:
from tf.app import use

First we are going to include the work of Cody Kingham on heads of phrases and some earlier work by Janet Dyk and Dirk Roorda on verbal valence.

In [3]:
A = use('etcbc/bhsa', mod="etcbc/lingo/heads/tf,etcbc/valence/tf", hoist=globals())
TF-app: ~/text-fabric-data/etcbc/bhsa/app
data: ~/text-fabric-data/etcbc/bhsa/tf/2021
data: ~/text-fabric-data/etcbc/lingo/heads/tf/2021
data: ~/text-fabric-data/etcbc/valence/tf/2021
data: ~/text-fabric-data/etcbc/phono/tf/2021
data: ~/text-fabric-data/etcbc/parallels/tf/2021
This is Text-Fabric 9.2.3
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

135 features found and 0 ignored
Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
Parallel Passages
int
🆗 links between similar passages
author:
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
coreData:
BHSA
dateWritten:
2021-12-09T14:40:46Z
provenance:
Parallels notebook, see https://github.com/ETCBC/parallels
version:
2021
writtenBy:
Text-Fabric
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
str
✅ book name in Latin (Genesis; Numeri; Reges1; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:55Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ book name in amharic (ኣማርኛ)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:20:27Z
encoders:
Dirk Roorda (TF)
language:
ኣማርኛ
languageCode:
am
languageEnglish:
amharic
provenance:
book names from wikipedia and other sources
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ chapter number (1; 2; 3; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:55Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ identifier of a clause atom relationship (0; 74; 367; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:56Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
det
str
✅ determinedness of phrase(atom) (det; und; NA.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:56Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:57Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ frequency of lexemes
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:24:45Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
computed on the basis of the ETCBC core set of features
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:57Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:57Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:58Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ lexeme pointed-transliterated (B.:- R;>CIJT [email protected]@> >:ELOH ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:58Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:17:59Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word pointed-transliterated (B.:- R;>CI73JT [email protected]@74> >:ELOHI92JM)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:04Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:04Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
🆗 english translation of lexeme (beginning create god(s))
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:13Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
gn
str
✅ grammatical gender (m; f; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:05Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:06Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ of word or lexeme (Hebrew; Aramaic.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:13Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
lex
str
✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:14Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
ls
str
✅ lexical set, subclassification of part-of-speech (card; ques; mult)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
⚠️ named entity type (pers; mens; gens; topo; ppde.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
nme
str
✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:08Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
nu
str
✅ grammatical number (sg; du; pl; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:08Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ sequence number of an object within its context
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:09Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:22:50Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional paragraph file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
pdp
str
✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:10Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
pfm
str
✅ preformative consonantal-transliterated (absent; n/a; J, ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:11Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
prs
str
✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:11Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ pronominal suffix gender (m; f; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:11Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ pronominal suffix number (sg; du; pl; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:12Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ pronominal suffix person (p1; p2; p3; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:12Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
ps
str
✅ grammatical person (p1; p2; p3; NA; unknown.)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:12Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word pointed-transliterated masoretic reading correction
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:23:29Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional ketiv/qere file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ interword material -pointed-transliterated (Masoretic correction)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:23:29Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional ketiv/qere file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ interword material -pointed-transliterated (Masoretic correction)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:23:29Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional ketiv/qere file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ word pointed-Hebrew masoretic reading correction
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:23:29Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional ketiv/qere file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ ranking of lexemes based on freqnuecy
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:24:46Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
computed on the basis of the ETCBC core set of features
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:13Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
sp
str
✅ part-of-speech (art; verb; subs; nmpr, ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:16Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
st
str
✅ state of a noun (a (absolute); c (construct); e (emphatic).)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:14Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
tab
int
✅ clause atom: its level in the linguistic embedding
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:16Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ interword material pointed-transliterated (& 00 05 00_P ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:01Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ interword material pointed-Hebrew (־ ׃)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:01Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
txt
str
✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:16Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
typ
str
✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:16Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
uvf
str
✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:17Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vbe
str
✅ verbal ending consonantal-transliterated (n/a; W; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:17Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vbs
str
✅ root formation consonantal-transliterated (absent; n/a; H; ...)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:17Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
✅ verse number
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:18Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:16Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:17Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
provenance:
from additional lexicon file provided by the ETCBC
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vs
str
✅ verbal stem (qal; piel; hif; apel; pael)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:18Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vt
str
✅ verbal tense (perf; impv; wayq; infc)
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:18Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
none
✅ linguistic dependency between textual objects
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:18:22Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
none
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2021-12-09T14:21:17Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
2021
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
etcbc/lingo/heads/tf
none
coreData:
BHSA
coreVersion:
2021
created_by:
Cody Kingham
dateWritten:
2021-12-13T11:38:35Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
none
coreData:
BHSA
coreVersion:
2021
created_by:
Cody Kingham
dateWritten:
2021-12-13T11:38:36Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
none
coreData:
BHSA
coreVersion:
2021
created_by:
Cody Kingham
dateWritten:
2021-12-13T11:38:37Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
Phonetic Transcriptions
str
🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)
author:
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:25:55Z
provenance:
computed by the phono notebook, see https://github.com/ETCBC/phono
version:
2021
writtenBy:
Text-Fabric
str
🆗 interword material in phonological transcription
author:
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:25:55Z
provenance:
computed by the phono notebook, see https://github.com/ETCBC/phono
version:
2021
writtenBy:
Text-Fabric
etcbc/valence/tf
str
❗️ corrected phrase function, only present for phrases that were in a correction sheet
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:56Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ whether the phrase function has been manually corrected
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:56Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ constituent role main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:56Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ additional lexical characteristics
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:57Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ default value before enrichment logic has been applied
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:57Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ verbal function main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:57Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ whether the generated enrichment features have been manually changed
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:57Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ additional semantic characteristics
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:57Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ sense label of verb occurrences (d-; i.; -p; d-; ...)
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:39:56Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
str
❗️ verbal valence main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
dateWritten:
2021-12-09T14:38:58Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
provenance:
computed by the enrich and flowchart notebooks, see https://github.com/ETCBC/valence
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
version:
2021
writtenBy:
Text-Fabric
Text-Fabric API: names N F E L T S C TF directly usable

You see that the features from the etcbc/valence/tf and etcbc/lingo/heads/tf modules have been added to the mix.

ETCBC Valence

Click the triangle before etcbc/valence/tf to see what features have been contributed.

Note that edge features are in bold italic.

Let's find out more about sense.

You can start with clicking the triangle afte "sense str" above. It tells you where the feature comes from, and it shows you the context where it has been constructed. You might go there to see additional documentation.

But we can also dive directly into its data:

In [4]:
F.sense.freqList()
Out[4]:
(('--', 17941),
 ('d-', 9975),
 ('-p', 6537),
 ('-i', 3604),
 ('-c', 3231),
 ('dp', 1899),
 ('dc', 1002),
 ('di', 918),
 ('l.', 876),
 ('i.', 630),
 ('n.', 532),
 ('-b', 64),
 ('db', 61),
 ('c.', 57),
 ('k.', 54))

Which nodes have a sense feature?

In [5]:
{F.otype.v(n) for n in N.walk() if F.sense.v(n)}
Out[5]:
{'word'}
In [6]:
results = A.search(
    """
word sense
"""
)
  0.24s 47381 results

Let's show some of the rarer sense values:

In [7]:
results = A.search(
    """
word sense=k.
"""
)
  0.29s 54 results
In [8]:
A.table(results, end=5)
npword
1Genesis 4:17יִּקְרָא֙
2Genesis 13:16שַׂמְתִּ֥י
3Genesis 32:13שַׂמְתִּ֤י
4Genesis 34:31יַעֲשֶׂ֖ה
5Genesis 48:20יְשִֽׂמְךָ֣

If we do a pretty display, the sense feature shows up.

In [9]:
A.show(results, start=1, end=1, withNodes=True)

result 1

verse:1414485
sentence:1172591
clause:427946
phrase:652729
1943 וַ
phrase:652730
sense=d-
phrase:652731
phrase:652732
sentence:1172592
clause:427947
phrase:652733
1948 וַ
phrase:652734
sense=--
sentence:1172593
clause:427948
phrase:652735
1950 וַ
phrase:652736
sense=d-
phrase:652737
sentence:1172594
clause:427949
phrase:652738
1954 וַֽ
phrase:652739
sense=--
clause:427950
phrase:652740
sense=d-
phrase:652741
sentence:1172595
clause:427951
phrase:652742
1958 וַ
phrase:652743
sense=k.
phrase:652744

Lingo heads

If you click the triangle before etcbc/lingo/heads/tf you see what features it contributes. Unfortunately, the authors have not provided a description of this feature, but if you click on the triangle after heads none, you see where the feature comes from and who has made it.

Moreover, the fact that heads is in italics makes clear that it is an edge feature.

Let's use it in a query: Now, heads is an edge feature, we cannot directly make it visible in pretty displays, but we can use it in queries.

We also want to make the feature sense visible, so we mention the feature in the query, without restricting the results.

In [10]:
results = A.search(
    """
book book=Genesis
  chapter chapter=1
    clause
      phrase
      -heads> word sense*
"""
)
  0.87s 402 results
In [11]:
A.show(results, start=1, end=2)

result 1

book Genesis
book=Genesis
chapter Genesis 1
book=Genesischapter=1

result 2

book Genesis
book=Genesis
chapter Genesis 1
book=Genesischapter=1

Note how the words that are heads of their phrases are highlighted within their phrases.

Participants

Now we are going to add another promising module, provided by Christian Canu Højgaard, from this repo: participants.

Let's do it in the straightforward way:

In [12]:
A = use(
    'etcbc/bhsa',
    mod=(
        "etcbc/lingo/heads/tf",
        "etcbc/valence/tf",
        "ch-jensen/participants/actor/tf"
    ),
    hoist=globals(),
)
TF-app: ~/text-fabric-data/etcbc/bhsa/app
data: ~/text-fabric-data/etcbc/bhsa/tf/2021
data: ~/text-fabric-data/etcbc/lingo/heads/tf/2021
data: ~/text-fabric-data/etcbc/valence/tf/2021
The requested data is not available offline
	~/text-fabric-data/ch-jensen/participants/actor/tf not found
rate limit is 5000 requests per hour, with 5000 left for this hour
	connecting to online GitHub repo ch-jensen/participants ... connected
No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dcWill try something else
	Failed
No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dc	Failed
data: ~/text-fabric-data/etcbc/phono/tf/2021
data: ~/text-fabric-data/etcbc/parallels/tf/2021
There were problems with loading data.
The Text-Fabric API has not been loaded!
The app "etcbc/bhsa" will not work!

The features are not there!

If we have a look on Github in this repo we see under actor/tf the directory c only. Christian has produced his features against version c of the BHSA.

Ok, then we go back, and run our command for version c.

In [17]:
A = use(
    'etcbc/bhsa',
    version="c",
    mod=(
        "etcbc/lingo/heads/tf",
        "etcbc/valence/tf",
        "ch-jensen/participants/actor/tf"
    ),
    hoist=globals(),
)
TF-app: ~/text-fabric-data/etcbc/bhsa/app
data: ~/text-fabric-data/etcbc/bhsa/tf/c
data: ~/text-fabric-data/etcbc/lingo/heads/tf/c
data: ~/text-fabric-data/etcbc/valence/tf/c
data: ~/text-fabric-data/ch-jensen/participants/actor/tf/c
data: ~/text-fabric-data/etcbc/phono/tf/c
data: ~/text-fabric-data/etcbc/parallels/tf/c
This is Text-Fabric 9.2.3
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

136 features found and 0 ignored
Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
Parallel Passages
int
author:
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:18:08Z
source:
Parallels Module
writtenBy:
Text-Fabric
ch-jensen/participants/actor/tf
str
Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491
coreData:
BHSA
coreVersion:
c
dateWritten:
2020-05-11T13:34:09Z
writtenBy:
Text-Fabric
str
Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491
coreData:
BHSA
coreVersion:
c
dateWritten:
2020-05-11T13:34:13Z
writtenBy:
Text-Fabric
none
Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491
coreData:
BHSA
coreVersion:
c
dateWritten:
2020-05-11T13:34:16Z
writtenBy:
Text-Fabric
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:15Z
encoders:
Dirk Roorda (TF)
language:
ኣማርኛ
languageCode:
am
languageEnglish:
amharic
provenance:
book names from wikipedia and other sources
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
det
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:15Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:19Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:14:58Z
encoders:
Dirk Roorda (TF)
provenance:
computed addition to core set of features
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:19Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:19Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:20Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:21Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:22Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:34Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:34Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2019-01-31T17:40:54Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
gn
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:35Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:37Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:51Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
lex
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:53Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:54Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
ls
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:55Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2019-01-31T17:40:54Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
nme
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:41Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
nu
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:42Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:43Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:56Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:13:35Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
pdp
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:46Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
pfm
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:46Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
prs
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:47Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:48Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:49Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:50Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
ps
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:50Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:13:50Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:13:50Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:13:50Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:13:50Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:15:00Z
encoders:
Dirk Roorda (TF)
provenance:
computed addition to core set of features
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:53Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
sp
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:57Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
st
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:54Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
tab
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:57Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:27Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:28Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
txt
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:58Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
typ
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:58Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
uvf
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:07:59Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vbe
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:00Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vbs
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:00Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
int
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:01Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2019-01-31T17:40:54Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2019-01-31T17:40:55Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vs
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:01Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
vt
str
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:02Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
none
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:08:09Z
encoders:
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
none
author:
Eep Talstra Centre for Bible and Computer
dataset:
BHSA
datasetName:
Biblia Hebraica Stuttgartensia Amstelodamensis
dateWritten:
2018-10-08T15:11:57Z
encoders:
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
version:
c
website:
https://shebanq.ancient-data.org
writtenBy:
Text-Fabric
etcbc/lingo/heads/tf
none
coreData:
BHSA
coreVersion:
c
created_by:
Cody Kingham
dateWritten:
2018-11-06T14:47:00Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
none
coreData:
BHSA
coreVersion:
c
created_by:
Cody Kingham
dateWritten:
2018-11-06T14:47:01Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
none
coreData:
BHSA
coreVersion:
c
created_by:
Cody Kingham
dateWritten:
2018-11-06T14:47:02Z
source:
see the notebook at https://github.com/etcbc/lingo/heads
writtenBy:
Text-Fabric
Phonetic Transcriptions
str
author:
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:16:04Z
source:
Phono Notebook applied to BHSA Data
writtenBy:
Text-Fabric
str
author:
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:16:04Z
source:
Phono Notebook applied to BHSA Data
writtenBy:
Text-Fabric
etcbc/valence/tf
str
corrected phrase function, only present for phrases that were in a correction sheet
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:06Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
whether the phrase function has been manually corrected
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:06Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
constituent role main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:07Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
additional lexical characteristics
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:07Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
default value before enrichment logic has been applied
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:08Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
verbal function main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:08Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
whether the generated enrichment features have been manually changed
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:09Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
additional semantic characteristics
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:09Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
str
sense label verb occurrences, computed by the flowchart algorithm, see https://github.com/ETCBC/valence/wiki/Legend
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:54Z
writtenBy:
Text-Fabric
str
verbal valence main classification
author:
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
coreData:
BHSA
coreVersion:
_temp
dateWritten:
2018-10-08T15:17:09Z
method:
Generated blank correction and enrichment spreadsheets with selected clauses
purpose:
Support the decision process of assigning valence to verbs
steps:
sheets filled out by researcher; read back in by program; generated new features based on contents
title:
Correction and enrichment features
writtenBy:
Text-Fabric
Text-Fabric API: names N F E L T S C TF directly usable

While this succeeded, there are scenoarios where you have more trouble. For example, you decide that you really, really need the bhsa data as in release 1.7.1.

Then you discover that this does note work:

A = use(
    'etcbc/bhsa',
    version="c",
    checkout="v1.7.1",
    mod=("etcbc/lingo/heads/tf" ,"etcbc/valence/tf", "ch-jensen/participants/actor/tf"), 
    hoist=globals(),
)

because the BHSA invokes two standard modules, etcbc/phono/tf and etcbc/parallels/tf and if you go to their GitHub repos, you see that they do not have a release v1.7.1. You have to walk through their releases and find one with the right data version. Having found them, you can then get it all like this:

A = use(
    'etcbc/bhsa',
    version="c",
    checkout="v1.7.1",
    mod=(
        "etcbc/phono/tf:1.2",
        "etcbc/parallels/tf:v1.2",
        "etcbc/lingo/heads/tf",
        "etcbc/valence/tf",
        "ch-jensen/participants/actor/tf",
    ),
    hoist=globals(),
)

Semantic actors

Let's find out about actor.

Again, we can click on the triangles and see information about the features. Christian has provided descriptions in the metadata of the features.

And we can look into the data itself.

In [18]:
fl = F.actor.freqList()
len(fl)
Out[18]:
415
In [19]:
fl[0:10]
Out[19]:
(('JHWH', 358),
 ('BN JFR>L', 205),
 ('>JC', 101),
 ('2sm"YOUSgmas"', 67),
 ('MCH', 60),
 ('>RY', 58),
 ('>TM', 45),
 ('>X "YOUSgmas"', 36),
 ('JFR>L', 35),
 ('KHN', 33))

Which nodes have an actor feature?

In [20]:
{F.otype.v(n) for n in N.walk() if F.actor.v(n)}
Out[20]:
{'phrase_atom', 'subphrase'}
In [21]:
results = A.search(
    """
phrase_atom actor
"""
)
  0.12s 2062 results

Let's show some of the rarer actor values:

In [22]:
results = A.search(
    """
phrase_atom actor=KHN
"""
)
  0.17s 30 results
In [23]:
A.table(results)
npphrase_atom
1Leviticus 17:5אֶל־הַכֹּהֵ֑ן
2Leviticus 17:6זָרַ֨ק
3Leviticus 17:6הַכֹּהֵ֤ן
4Leviticus 17:6הִקְטִ֣יר
5Leviticus 19:22כִפֶּר֩
6Leviticus 19:22הַכֹּהֵ֜ן
7Leviticus 21:1אֶל־הַכֹּהֲנִ֖ים
8Leviticus 21:1בְּנֵ֣י אַהֲרֹ֑ן
9Leviticus 21:5יִקְרְח֤וּ
10Leviticus 21:5יְגַלֵּ֑חוּ
11Leviticus 21:5יִשְׂרְט֖וּ
12Leviticus 21:6קְדֹשִׁ֤ים
13Leviticus 21:6יִהְיוּ֙
14Leviticus 21:6יְחַלְּל֔וּ
15Leviticus 21:6הֵ֥ם
16Leviticus 21:6מַקְרִיבִ֖ם
17Leviticus 21:6הָ֥יוּ
18Leviticus 21:6קֹֽדֶשׁ׃
19Leviticus 21:7יִקָּ֔חוּ
20Leviticus 21:7יִקָּ֑חוּ
21Leviticus 22:11כֹהֵ֗ן
22Leviticus 22:11יִקְנֶ֥ה
23Leviticus 22:14לַכֹּהֵ֖ן
24Leviticus 23:10אֶל־הַכֹּהֵֽן׃
25Leviticus 23:11הֵנִ֧יף
26Leviticus 23:11יְנִיפֶ֖נּוּ
27Leviticus 23:11הַכֹּהֵֽן׃
28Leviticus 23:20הֵנִ֣יף
29Leviticus 23:20הַכֹּהֵ֣ן׀
30Leviticus 23:20לַכֹּהֵֽן׃

We see no highlights! That is because phrase atoms are hidden by default. So let's unhide:

In [25]:
A.displaySetup(hiddenTypes="subphrase clause_atom sentence_atom half_verse")

The next calls to show() will work as if hiddenTypes="subphrase clause_atom sentence_atom half_verse" is passed to them.

In [26]:
A.show(results, start=1, end=1)

result 1

verse
sentence
clause
phrase
phrase_atom
actor=BN JFR>L
phrase
phrase
phrase_atom
actor=ZBX BN JFR>L
clause
phrase
phrase_atom
phrase
phrase_atom
actor=BN JFR>L
phrase
phrase_atom
actor=BN JFR>L
clause
phrase
phrase_atom
phrase
phrase_atom
actor=BN JFR>L
phrase
phrase_atom
actor=JHWH
phrase
phrase
phrase_atom
actor=KHN
clause
phrase
phrase_atom
phrase
phrase_atom
actor=BN JFR>L
phrase
phrase

We make the feature sense from the valence module visible:

In [27]:
A.show(results, start=1, end=3, withNodes=True, extraFeatures="sense")

result 1

verse:1417594
sentence:1181377
clause:439665
phrase:688387
phrase_atom:943218
phrase:688388
phrase_atom:943219
actor=BN JFR>L
phrase:688389
phrase_atom:943220
actor=BN JFR>L
phrase:688390
phrase_atom:943221
actor=ZBX BN JFR>L
clause:439666
phrase:688391
phrase_atom:943222
phrase:688392
phrase_atom:943223
actor=BN JFR>L
63102 הֵ֣ם
phrase:688393
phrase_atom:943224
actor=BN JFR>L
sense=-p
phrase:688394
phrase_atom:943225
clause:439667
phrase:688395
phrase_atom:943226
63108 וֶֽ
phrase:688396
phrase_atom:943227
actor=BN JFR>L
phrase:688397
phrase_atom:943228
actor=JHWH
phrase:688398
phrase_atom:943229
actor=PTX >HL MW<D
phrase:688399
phrase_atom:943230
actor=KHN
63116 אֶל־
63117 הַ
clause:439668
phrase:688400
phrase_atom:943231
63119 וְ
phrase:688401
phrase_atom:943232
actor=BN JFR>L
sense=n.
phrase:688402
phrase_atom:943233
phrase_atom:943234
actor=JHWH
phrase:688403
phrase_atom:943235

result 2

verse:1417595
sentence:1181378
clause:439669
phrase:688404
phrase_atom:943236
63126 וְ
phrase:688405
phrase_atom:943237
actor=KHN
sense=dp
phrase:688406
phrase_atom:943238
actor=KHN
phrase:688407
phrase_atom:943239
actor=DM
63130 אֶת־
63131 הַ
phrase:688408
phrase_atom:943240
actor=MZBX JHWH
phrase_atom:943241
actor=PTX >HL MW<D
sentence:1181379
clause:439670
phrase:688409
phrase_atom:943242
63139 וְ
phrase:688410
phrase_atom:943243
actor=KHN
phrase:688411
phrase_atom:943244
63141 הַ
phrase:688412
phrase_atom:943245
phrase_atom:943246
actor=JHWH

result 3

verse:1417595
sentence:1181378
clause:439669
phrase:688404
phrase_atom:943236
63126 וְ
phrase:688405
phrase_atom:943237
actor=KHN
sense=dp
phrase:688406
phrase_atom:943238
actor=KHN
phrase:688407
phrase_atom:943239
actor=DM
63130 אֶת־
63131 הַ
phrase:688408
phrase_atom:943240
actor=MZBX JHWH
phrase_atom:943241
actor=PTX >HL MW<D
sentence:1181379
clause:439670
phrase:688409
phrase_atom:943242
63139 וְ
phrase:688410
phrase_atom:943243
actor=KHN
phrase:688411
phrase_atom:943244
63141 הַ
phrase:688412
phrase_atom:943245
phrase_atom:943246
actor=JHWH

All together!

Here is a query that shows results with all features.

In [28]:
results = A.search(
    """
book book=Leviticus
  phrase sense*
    phrase_atom actor=KHN
  -heads> word
"""
)
  0.80s 30 results
In [29]:
A.displaySetup(
    condensed=True,
    condenseType="verse",
    hiddenTypes="subphrase clause_atom sentence_atom half_verse",
)
A.show(results, start=8, end=8)
A.displaySetup()

verse 8

verse
book=Leviticus
sentence
clause
phrase
phrase_atom
phrase
phrase_atom
actor=KHN
clause
phrase
phrase_atom
phrase
phrase_atom
actor=KHN
phrase
phrase_atom
actor=NPC_3
clause
phrase
phrase_atom
actor=NPC_3
phrase
phrase_atom
actor=NPC_3
phrase
phrase_atom
sentence
clause
phrase
phrase_atom
phrase
phrase_atom
actor=JLJD BJT KHN
clause
phrase
phrase_atom
actor=JLJD BJT KHN
phrase
phrase_atom
actor=JLJD BJT KHN
phrase

Exercise

See whether you can find the quote in the Easter egg that is in etcbc/lingo/easter/tf !

All steps

  • start your first step in mastering the bible computationally
  • display become an expert in creating pretty displays of your text structures
  • search turbo charge your hand-coding with search templates
  • exportExcel make tailor-made spreadsheets out of your results
  • share draw in other people's data and let them use yours
  • export export your dataset as an Emdros database
  • annotate annotate plain text by means of other tools and import the annotations as TF features
  • map map somebody else's annotations to a new version of the corpus
  • volumes work with selected books only
  • trees work with the BHSA data as syntax trees

CC-BY Dirk Roorda