Volume support¶

int

🆗 links between similar passages

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

str

✅ book name in amharic (ኣማርኛ)

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

int

✅ frequency of lexemes

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

str

🆗 english translation of lexeme (beginning create god(s))

str

✅ grammatical gender (m; f; NA; unknown.)

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

str

✅ grammatical number (sg; du; pl; NA; unknown.)

int

✅ sequence number of an object within its context

str

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

str

✅ pronominal suffix gender (m; f; NA; unknown.)

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ word pointed-Hebrew masoretic reading correction

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

int

✅ verse number

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

str

✅ verbal stem (qal; piel; hif; apel; pael)

str

✅ verbal tense (perf; impv; wayq; infc)

none

✅ linguistic dependency between textual objects

none

Phonetic Transcriptions

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

str

🆗 interword material in phonological transcription

Settings:

specified

apiVersion: 3
appName: ETCBC/bhsa
appPath: /Users/me/github/ETCBC/bhsa/app
commit: no value
css: ''
dataDisplay:
- exampleSectionHtml:
  <code>Genesis 1:1</code> (use <a href="https://github.com/{org}/{repo}/blob/master/tf/{version}/book%40en.tf" target="_blank">English book names</a>)
- excludedFeatures:
  - g_uvf_utf8
  - g_vbs
  - kq_hybrid
  - languageISO
  - g_nme
  - lex0
  - is_root
  - g_vbs_utf8
  - g_uvf
  - dist
  - root
  - suffix_person
  - g_vbe
  - dist_unit
  - suffix_number
  - distributional_parent
  - kq_hybrid_utf8
  - crossrefSET
  - instruction
  - g_prs
  - lexeme_count
  - rank_occ
  - g_pfm_utf8
  - freq_occ
  - crossrefLCS
  - functional_parent
  - g_pfm
  - g_nme_utf8
  - g_vbe_utf8
  - kind
  - g_prs_utf8
  - suffix_gender
  - mother_object_type
- noneValues:
  - absent
  - n/a
  - none
  - unknown
  - no value
  - NA
docs:
- docBase: {docRoot}/{repo}
- docExt: ''
- docPage: ''
- docRoot: https://{org}.github.io
- featurePage: 0_home
interfaceDefaults: {}
isCompatible: True
local: clone
localDir: /Users/me/github/ETCBC/bhsa/_temp
provenanceSpec:
- corpus: BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
- doi: 10.5281/zenodo.1007624
- moduleSpecs:
  - :
    backend: no value
    corpus: Phonetic Transcriptions
    docUrl:
    https://nbviewer.jupyter.org/github/etcbc/phono/blob/master/programs/phono.ipynb
    doi: 10.5281/zenodo.1007636
    org: ETCBC
    relative: /tf
    repo: phono
  - :
    backend: no value
    corpus: Parallel Passages
    docUrl:
    https://nbviewer.jupyter.org/github/ETCBC/parallels/blob/master/programs/parallels.ipynb
    doi: 10.5281/zenodo.1007642
    org: ETCBC
    relative: /tf
    repo: parallels
- org: ETCBC
- relative: /tf
- repo: bhsa
- version: 2021
- webBase: https://shebanq.ancient-data.org/hebrew
- webHint: Show this on SHEBANQ
- webLang: la
- webLexId: True
- webUrl:
  {webBase}/text?book=<1>&chapter=<2>&verse=<3>&version={version}&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: no value
typeDisplay:
- clause:
  - label: {typ} {rela}
  - style: ''
- clause_atom:
  - hidden: True
  - label: {code}
  - level: 1
  - style: ''
- half_verse:
  - hidden: True
  - label: {label}
  - style: ''
  - verselike: True
- lex:
  - featuresBare: gloss
  - label: {voc_lex_utf8}
  - lexOcc: word
  - style: orig
  - template: {voc_lex_utf8}
- phrase:
  - label: {typ} {function}
  - style: ''
- phrase_atom:
  - hidden: True
  - label: {typ} {rela}
  - level: 1
  - style: ''
- sentence:
  - label: {number}
  - style: ''
- sentence_atom:
  - hidden: True
  - label: {number}
  - level: 1
  - style: ''
- subphrase:
  - hidden: True
  - label: {number}
  - style: ''
- word:
  - features: pdp vs vt
  - featuresBare: lex:gloss
writing: hbo

We check that the features of interest are loaded:

In [6]:

Aw.isLoaded(features="lex phono crossref")

crossref             edge (int) 🆗 links between similar passages
lex                  node (str) ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)
phono                node (str) 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

We can now extract volumes by using the extract() method on the app object which is held in the variable Aw.

Note: we are going to load several volumes and collections too, so instead storing the handle to the API in a variable with the name A, we choose one with the name Aw.

Extract volumes¶

In [7]:

volumes = Aw.extract(VOLUMES, overwrite=True)

  0.00s Check volumes ...
   |   Volume tiny exists and will be recreated
   |   Volume small exists and will be recreated
   |   Volume medium exists and will be recreated
   |   Work consists of 39 books:
   |   book Genesis             : with    28764 slots
   |   book Exodus              : with    23748 slots
   |   book Leviticus           : with    17099 slots
   |   book Numbers             : with    23188 slots
   |   book Deuteronomy         : with    20128 slots
   |   book Joshua              : with    14526 slots
   |   book Judges              : with    14086 slots
   |   book 1_Samuel            : with    18929 slots
   |   book 2_Samuel            : with    15612 slots
   |   book 1_Kings             : with    18685 slots
   |   book 2_Kings             : with    17307 slots
   |   book Isaiah              : with    22931 slots
   |   book Jeremiah            : with    29736 slots
   |   book Ezekiel             : with    26182 slots
   |   book Hosea               : with     3146 slots
   |   book Joel                : with     1318 slots
   |   book Amos                : with     2780 slots
   |   book Obadiah             : with      392 slots
   |   book Jonah               : with      985 slots
   |   book Micah               : with     1895 slots
   |   book Nahum               : with      746 slots
   |   book Habakkuk            : with      897 slots
   |   book Zephaniah           : with     1037 slots
   |   book Haggai              : with      877 slots
   |   book Zechariah           : with     4471 slots
   |   book Malachi             : with     1187 slots
   |   book Psalms              : with    25372 slots
   |   book Job                 : with    10912 slots
   |   book Proverbs            : with     8859 slots
   |   book Ruth                : with     1802 slots
   |   book Song_of_songs       : with     1682 slots
   |   book Ecclesiastes        : with     4233 slots
   |   book Lamentations        : with     1945 slots
   |   book Esther              : with     4621 slots
   |   book Daniel              : with     8072 slots
   |   book Ezra                : with     5268 slots
   |   book Nehemiah            : with     7842 slots
   |   book 1_Chronicles        : with    15566 slots
   |   book 2_Chronicles        : with    19764 slots
  0.09s volumes ok
  0.09s Distribute nodes over volumes ...
   |     0.00s volume tiny ...
   |      |     0.00s book Obadiah              with 392 slots
   |      |     0.00s book Nahum                with 746 slots
   |      |     0.00s book Haggai               with 877 slots
   |      |     0.00s book Habakkuk             with 897 slots
   |      |     0.00s book Jonah                with 985 slots
   |      |     0.00s book Micah                with 1895 slots
   |     0.01s volume tiny                 with 5792 slots and    21779 nodes ...
   |     0.01s volume small ...
   |      |     0.00s book Malachi              with 1187 slots
   |      |     0.00s book Joel                 with 1318 slots
   |     0.01s volume small                with 2505 slots and     9495 nodes ...
   |     0.01s volume medium ...
   |      |     0.00s book Ezra                 with 5268 slots
   |     0.02s volume medium               with 5268 slots and    17286 nodes ...
  0.11s distribution done
  0.11s Remap features ...
   |     0.00s volume tiny with    21779 nodes ...
   |     0.17s volume small with     9495 nodes ...
   |     0.24s volume medium with    17286 nodes ...
  0.45s remapping done
  0.45s Write volumes as TF datasets
   |     0.00s Writing volume tiny
   |     0.14s Writing volume small
   |     0.20s Writing volume medium
  0.77s writing done
  0.77s All done

Inspect the volumes¶

The extract() method returns basic information about the volumes: their location on disk.

In [8]:

if volumes:
    for (name, info) in volumes.items():
        loc = info["location"]
        new = "(new)     " if info["new"] else "(existing)"
        print(f"volume {name:<7}: {new} at {ux(loc)}")
else:
    print(volumes)

volume medium : (new)      at ~/github/ETCBC/bhsa/tf/2021/_local/medium
volume small  : (new)      at ~/github/ETCBC/bhsa/tf/2021/_local/small
volume tiny   : (new)      at ~/github/ETCBC/bhsa/tf/2021/_local/tiny

Load single volumes¶

We load the volumes separately. For each volume we get a handle, which we store in a dictionary As, keyed by its name.

In [10]:

As = {}

for name in volumes:
    As[name] = use("ETCBC/bhsa:clone", checkout="clone", version="2021", volume=name)

Locating corpus resources ...

app: ~/github/ETCBC/bhsa/app

data: ~/github/ETCBC/bhsa/tf/2021

data: ~/github/ETCBC/phono/tf/2021

data: ~/github/ETCBC/parallels/tf/2021

Text-Fabric: Text-Fabric API 12.0.4, ETCBC/bhsa/app v3, Search Reference
Data: ETCBC - bhsa 2021 volume medium:Ezra, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	1	5268.00	100
chapter	10	526.80	100
verse	280	18.81	100
sentence	491	10.73	100
half_verse	492	10.71	100
sentence_atom	506	10.41	100
clause	824	6.39	100
clause_atom	870	6.06	100
lex	991	5.32	100
phrase	2385	2.21	100
phrase_atom	2730	1.93	100
subphrase	2438	1.40	65
word	5268	1.00	100

Sets: no custom sets
Features:

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

str

✅ book name in amharic (ኣማርኛ)

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

int

✅ frequency of lexemes

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

str

🆗 english translation of lexeme (beginning create god(s))

str

✅ grammatical gender (m; f; NA; unknown.)

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

str

✅ grammatical number (sg; du; pl; NA; unknown.)

int

✅ sequence number of an object within its context

ointerfrom

str

all outgoing inter-volume edges

ointerto

str

all incoming inter-volume edges

str

int

mapping from nodes in the volume to nodes in the work

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

str

🆗 interword material in phonological transcription

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

str

✅ pronominal suffix gender (m; f; NA; unknown.)

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ word pointed-Hebrew masoretic reading correction

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

int

✅ verse number

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

str

✅ verbal stem (qal; piel; hif; apel; pael)

str

✅ verbal tense (perf; impv; wayq; infc)

int

🆗 links between similar passages

none

✅ linguistic dependency between textual objects

none

Settings:

specified

apiVersion: 3
appName: ETCBC/bhsa
appPath: /Users/me/github/ETCBC/bhsa/app
commit: no value
css: ''
dataDisplay:
- exampleSectionHtml:
  <code>Genesis 1:1</code> (use <a href="https://github.com/{org}/{repo}/blob/master/tf/{version}/book%40en.tf" target="_blank">English book names</a>)
- excludedFeatures:
  - g_uvf_utf8
  - g_vbs
  - kq_hybrid
  - languageISO
  - g_nme
  - lex0
  - is_root
  - g_vbs_utf8
  - g_uvf
  - dist
  - root
  - suffix_person
  - g_vbe
  - dist_unit
  - suffix_number
  - distributional_parent
  - kq_hybrid_utf8
  - crossrefSET
  - instruction
  - g_prs
  - lexeme_count
  - rank_occ
  - g_pfm_utf8
  - freq_occ
  - crossrefLCS
  - functional_parent
  - g_pfm
  - g_nme_utf8
  - g_vbe_utf8
  - kind
  - g_prs_utf8
  - suffix_gender
  - mother_object_type
- noneValues:
  - absent
  - n/a
  - none
  - unknown
  - no value
  - NA
docs:
- docBase: {docRoot}/{repo}
- docExt: ''
- docPage: ''
- docRoot: https://{org}.github.io
- featurePage: 0_home
interfaceDefaults: {}
isCompatible: True
local: clone
localDir: /Users/me/github/ETCBC/bhsa/_temp
provenanceSpec:
- corpus: BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
- doi: 10.5281/zenodo.1007624
- moduleSpecs:
  - :
    backend: no value
    corpus: Phonetic Transcriptions
    docUrl:
    https://nbviewer.jupyter.org/github/etcbc/phono/blob/master/programs/phono.ipynb
    doi: 10.5281/zenodo.1007636
    org: ETCBC
    relative: /tf
    repo: phono
  - :
    backend: no value
    corpus: Parallel Passages
    docUrl:
    https://nbviewer.jupyter.org/github/ETCBC/parallels/blob/master/programs/parallels.ipynb
    doi: 10.5281/zenodo.1007642
    org: ETCBC
    relative: /tf
    repo: parallels
- org: ETCBC
- relative: /tf
- repo: bhsa
- version: 2021
- webBase: https://shebanq.ancient-data.org/hebrew
- webHint: Show this on SHEBANQ
- webLang: la
- webLexId: True
- webUrl:
  {webBase}/text?book=<1>&chapter=<2>&verse=<3>&version={version}&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: no value
typeDisplay:
- clause:
  - label: {typ} {rela}
  - style: ''
- clause_atom:
  - hidden: True
  - label: {code}
  - level: 1
  - style: ''
- half_verse:
  - hidden: True
  - label: {label}
  - style: ''
  - verselike: True
- lex:
  - featuresBare: gloss
  - label: {voc_lex_utf8}
  - lexOcc: word
  - style: orig
  - template: {voc_lex_utf8}
- phrase:
  - label: {typ} {function}
  - style: ''
- phrase_atom:
  - hidden: True
  - label: {typ} {rela}
  - level: 1
  - style: ''
- sentence:
  - label: {number}
  - style: ''
- sentence_atom:
  - hidden: True
  - label: {number}
  - level: 1
  - style: ''
- subphrase:
  - hidden: True
  - label: {number}
  - style: ''
- word:
  - features: pdp vs vt
  - featuresBare: lex:gloss
writing: hbo

Locating corpus resources ...

app: ~/github/ETCBC/bhsa/app

data: ~/github/ETCBC/bhsa/tf/2021

data: ~/github/ETCBC/phono/tf/2021

data: ~/github/ETCBC/parallels/tf/2021

Text-Fabric: Text-Fabric API 12.0.4, ETCBC/bhsa/app v3, Search Reference
Data: ETCBC - bhsa 2021 volume small:Malachi-Joel, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	2	1252.50	100
chapter	7	357.86	100
verse	128	19.57	100
half_verse	253	9.90	100
sentence	450	5.57	100
sentence_atom	461	5.43	100
clause	582	4.30	100
lex	587	4.27	100
clause_atom	600	4.17	100
phrase	1641	1.53	100
phrase_atom	1681	1.49	100
subphrase	598	1.36	32
word	2505	1.00	100

Sets: no custom sets
Features:

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

str

✅ book name in amharic (ኣማርኛ)

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

int

✅ frequency of lexemes

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

str

🆗 english translation of lexeme (beginning create god(s))

str

✅ grammatical gender (m; f; NA; unknown.)

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

str

✅ grammatical number (sg; du; pl; NA; unknown.)

int

✅ sequence number of an object within its context

ointerfrom

str

all outgoing inter-volume edges

ointerto

str

all incoming inter-volume edges

str

int

mapping from nodes in the volume to nodes in the work

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

str

🆗 interword material in phonological transcription

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

str

✅ pronominal suffix gender (m; f; NA; unknown.)

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ word pointed-Hebrew masoretic reading correction

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

int

✅ verse number

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

str

✅ verbal stem (qal; piel; hif; apel; pael)

str

✅ verbal tense (perf; impv; wayq; infc)

int

🆗 links between similar passages

none

✅ linguistic dependency between textual objects

none

Settings:

specified

apiVersion: 3
appName: ETCBC/bhsa
appPath: /Users/me/github/ETCBC/bhsa/app
commit: no value
css: ''
dataDisplay:
- exampleSectionHtml:
  <code>Genesis 1:1</code> (use <a href="https://github.com/{org}/{repo}/blob/master/tf/{version}/book%40en.tf" target="_blank">English book names</a>)
- excludedFeatures:
  - g_uvf_utf8
  - g_vbs
  - kq_hybrid
  - languageISO
  - g_nme
  - lex0
  - is_root
  - g_vbs_utf8
  - g_uvf
  - dist
  - root
  - suffix_person
  - g_vbe
  - dist_unit
  - suffix_number
  - distributional_parent
  - kq_hybrid_utf8
  - crossrefSET
  - instruction
  - g_prs
  - lexeme_count
  - rank_occ
  - g_pfm_utf8
  - freq_occ
  - crossrefLCS
  - functional_parent
  - g_pfm
  - g_nme_utf8
  - g_vbe_utf8
  - kind
  - g_prs_utf8
  - suffix_gender
  - mother_object_type
- noneValues:
  - absent
  - n/a
  - none
  - unknown
  - no value
  - NA
docs:
- docBase: {docRoot}/{repo}
- docExt: ''
- docPage: ''
- docRoot: https://{org}.github.io
- featurePage: 0_home
interfaceDefaults: {}
isCompatible: True
local: clone
localDir: /Users/me/github/ETCBC/bhsa/_temp
provenanceSpec:
- corpus: BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
- doi: 10.5281/zenodo.1007624
- moduleSpecs:
  - :
    backend: no value
    corpus: Phonetic Transcriptions
    docUrl:
    https://nbviewer.jupyter.org/github/etcbc/phono/blob/master/programs/phono.ipynb
    doi: 10.5281/zenodo.1007636
    org: ETCBC
    relative: /tf
    repo: phono
  - :
    backend: no value
    corpus: Parallel Passages
    docUrl:
    https://nbviewer.jupyter.org/github/ETCBC/parallels/blob/master/programs/parallels.ipynb
    doi: 10.5281/zenodo.1007642
    org: ETCBC
    relative: /tf
    repo: parallels
- org: ETCBC
- relative: /tf
- repo: bhsa
- version: 2021
- webBase: https://shebanq.ancient-data.org/hebrew
- webHint: Show this on SHEBANQ
- webLang: la
- webLexId: True
- webUrl:
  {webBase}/text?book=<1>&chapter=<2>&verse=<3>&version={version}&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: no value
typeDisplay:
- clause:
  - label: {typ} {rela}
  - style: ''
- clause_atom:
  - hidden: True
  - label: {code}
  - level: 1
  - style: ''
- half_verse:
  - hidden: True
  - label: {label}
  - style: ''
  - verselike: True
- lex:
  - featuresBare: gloss
  - label: {voc_lex_utf8}
  - lexOcc: word
  - style: orig
  - template: {voc_lex_utf8}
- phrase:
  - label: {typ} {function}
  - style: ''
- phrase_atom:
  - hidden: True
  - label: {typ} {rela}
  - level: 1
  - style: ''
- sentence:
  - label: {number}
  - style: ''
- sentence_atom:
  - hidden: True
  - label: {number}
  - level: 1
  - style: ''
- subphrase:
  - hidden: True
  - label: {number}
  - style: ''
- word:
  - features: pdp vs vt
  - featuresBare: lex:gloss
writing: hbo

Locating corpus resources ...

app: ~/github/ETCBC/bhsa/app

data: ~/github/ETCBC/bhsa/tf/2021

data: ~/github/ETCBC/phono/tf/2021

data: ~/github/ETCBC/parallels/tf/2021

Text-Fabric: Text-Fabric API 12.0.4, ETCBC/bhsa/app v3, Search Reference
Data: ETCBC - bhsa 2021 volume tiny:Obadiah-Nahum-Haggai-Habakkuk-Jonah-Micah, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	6	965.33	100
chapter	20	289.60	100
verse	315	18.39	100
half_verse	623	9.30	100
sentence	1032	5.61	100
sentence_atom	1046	5.54	100
lex	1173	4.94	100
clause	1399	4.14	100
clause_atom	1426	4.06	100
phrase	3774	1.53	100
phrase_atom	3911	1.48	100
subphrase	1262	1.30	28
word	5792	1.00	100

Sets: no custom sets
Features:

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

str

✅ book name in amharic (ኣማርኛ)

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

int

✅ frequency of lexemes

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

str

🆗 english translation of lexeme (beginning create god(s))

str

✅ grammatical gender (m; f; NA; unknown.)

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

str

✅ grammatical number (sg; du; pl; NA; unknown.)

int

✅ sequence number of an object within its context

ointerfrom

str

all outgoing inter-volume edges

ointerto

str

all incoming inter-volume edges

str

int

mapping from nodes in the volume to nodes in the work

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

str

🆗 interword material in phonological transcription

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

str

✅ pronominal suffix gender (m; f; NA; unknown.)

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ word pointed-Hebrew masoretic reading correction

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

int

✅ verse number

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

str

✅ verbal stem (qal; piel; hif; apel; pael)

str

✅ verbal tense (perf; impv; wayq; infc)

int

🆗 links between similar passages

none

✅ linguistic dependency between textual objects

none

Settings:

specified

apiVersion: 3
appName: ETCBC/bhsa
appPath: /Users/me/github/ETCBC/bhsa/app
commit: no value
css: ''
dataDisplay:
- exampleSectionHtml:
  <code>Genesis 1:1</code> (use <a href="https://github.com/{org}/{repo}/blob/master/tf/{version}/book%40en.tf" target="_blank">English book names</a>)
- excludedFeatures:
  - g_uvf_utf8
  - g_vbs
  - kq_hybrid
  - languageISO
  - g_nme
  - lex0
  - is_root
  - g_vbs_utf8
  - g_uvf
  - dist
  - root
  - suffix_person
  - g_vbe
  - dist_unit
  - suffix_number
  - distributional_parent
  - kq_hybrid_utf8
  - crossrefSET
  - instruction
  - g_prs
  - lexeme_count
  - rank_occ
  - g_pfm_utf8
  - freq_occ
  - crossrefLCS
  - functional_parent
  - g_pfm
  - g_nme_utf8
  - g_vbe_utf8
  - kind
  - g_prs_utf8
  - suffix_gender
  - mother_object_type
- noneValues:
  - absent
  - n/a
  - none
  - unknown
  - no value
  - NA
docs:
- docBase: {docRoot}/{repo}
- docExt: ''
- docPage: ''
- docRoot: https://{org}.github.io
- featurePage: 0_home
interfaceDefaults: {}
isCompatible: True
local: clone
localDir: /Users/me/github/ETCBC/bhsa/_temp
provenanceSpec:
- corpus: BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
- doi: 10.5281/zenodo.1007624
- moduleSpecs:
  - :
    backend: no value
    corpus: Phonetic Transcriptions
    docUrl:
    https://nbviewer.jupyter.org/github/etcbc/phono/blob/master/programs/phono.ipynb
    doi: 10.5281/zenodo.1007636
    org: ETCBC
    relative: /tf
    repo: phono
  - :
    backend: no value
    corpus: Parallel Passages
    docUrl:
    https://nbviewer.jupyter.org/github/ETCBC/parallels/blob/master/programs/parallels.ipynb
    doi: 10.5281/zenodo.1007642
    org: ETCBC
    relative: /tf
    repo: parallels
- org: ETCBC
- relative: /tf
- repo: bhsa
- version: 2021
- webBase: https://shebanq.ancient-data.org/hebrew
- webHint: Show this on SHEBANQ
- webLang: la
- webLexId: True
- webUrl:
  {webBase}/text?book=<1>&chapter=<2>&verse=<3>&version={version}&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: no value
typeDisplay:
- clause:
  - label: {typ} {rela}
  - style: ''
- clause_atom:
  - hidden: True
  - label: {code}
  - level: 1
  - style: ''
- half_verse:
  - hidden: True
  - label: {label}
  - style: ''
  - verselike: True
- lex:
  - featuresBare: gloss
  - label: {voc_lex_utf8}
  - lexOcc: word
  - style: orig
  - template: {voc_lex_utf8}
- phrase:
  - label: {typ} {function}
  - style: ''
- phrase_atom:
  - hidden: True
  - label: {typ} {rela}
  - level: 1
  - style: ''
- sentence:
  - label: {number}
  - style: ''
- sentence_atom:
  - hidden: True
  - label: {number}
  - level: 1
  - style: ''
- subphrase:
  - hidden: True
  - label: {number}
  - style: ''
- word:
  - features: pdp vs vt
  - featuresBare: lex:gloss
writing: hbo

We see it reported that single volumes have been loaded instead of the whole work.

The volume info can be obtained separately by reading the attribute volumeInfo, either on the A or on the TF object:

In [11]:

for name in volumes:
    print(As[name].volumeInfo)

medium:Ezra
small:Malachi-Joel
tiny:Obadiah-Nahum-Haggai-Habakkuk-Jonah-Micah

Generated features¶

When volumes are created, some extra features are generated, which have to do with the relation between the original work and the volume, and what happens at the boundaries of volumes.

In [12]:

for name in volumes:
    print(name)
    for (feat, info) in As[name].isLoaded("owork ointerfrom ointerto", pretty=False).items():
        print(f"\t{feat}: {info['meta']['description']}")

medium
	owork: mapping from nodes in the volume to nodes in the work
	ointerfrom: all outgoing inter-volume edges
	ointerto: all incoming inter-volume edges
small
	owork: mapping from nodes in the volume to nodes in the work
	ointerfrom: all outgoing inter-volume edges
	ointerto: all incoming inter-volume edges
tiny
	owork: mapping from nodes in the volume to nodes in the work
	ointerfrom: all outgoing inter-volume edges
	ointerto: all incoming inter-volume edges

`owork`¶

Note that each volume has an extra feature: owork. Its value for each node in a volume dataset is the corresponding node in the original work from which the volume is taken.

If you use the volume to compute annotations, and you want to publish these annotations against the original work, the feature owork provides the necessary information to do so.

Suppose annotvx is a dict, mapping some nodes in the volume x to interesting values, then you apply them to the original work as follows

{F.owork.v(n): value for (n, value) in annotvx.items}

There is another important function of owork: when collecting volumes, we may encounter nodes in the volumes that come from a single node in the work. We want to merge these nodes in the collected work. The information in owork provides the necessary information for that.

`ointerto`, `ointerfrom`¶

Note that we do have features ointerto and ointerfrom.

We'll come back to them later.

Make collections of volumes¶

We can collect volumes into new works by means of the collect() method on Aw. Let's collect all volumes just created.

In [13]:

Aw.collect(
    tuple(volumes),
    COLLECTION,
    overwrite=True,
)

Collection prophets exists and will be recreated
  0.00s Loading volume medium                                                       from ~/github/ETCBC/bhsa/tf/2021/_local/medium ...
  0.03s Feature overview: 85 for nodes; 3 for edges; 2 configs; 9 computed
  0.05s Loading volume small                                                        from ~/github/ETCBC/bhsa/tf/2021/_local/small ...
  0.02s Feature overview: 85 for nodes; 3 for edges; 2 configs; 9 computed
  0.08s Loading volume tiny                                                         from ~/github/ETCBC/bhsa/tf/2021/_local/tiny ...
  0.04s Feature overview: 85 for nodes; 3 for edges; 2 configs; 9 computed
  0.14s inspect metadata ...
  0.14s metadata sorted out
  0.14s check nodetypes ...
   |   volume medium
   |   volume small
   |   volume tiny
  0.14s node types ok
  0.14s Collect nodes from volumes ...
   |     0.00s Check against overlapping slots ...
   |      |   medium                                                      :     5268 slots
   |      |   small                                                       :     2505 slots
   |      |   tiny                                                        :     5792 slots
   |     0.01s no overlap
   |     0.01s Group non-slot nodes by type
   |      |   medium                                                      :     5269-   17286
   |      |   small                                                       :     2506-    9495
   |      |   tiny                                                        :     5793-   21779
   |     0.01s Mapping nodes from volume to/from work ...
   |      |   book                :    13566 -    13574
   |      |   chapter             :    13575 -    13611
   |      |   clause              :    13612 -    16416
   |      |   clause_atom         :    16417 -    19312
   |      |   half_verse          :    19313 -    20680
   |      |   phrase              :    20681 -    28480
   |      |   phrase_atom         :    28481 -    36802
   |      |   sentence            :    36803 -    38775
   |      |   sentence_atom       :    38776 -    40788
   |      |   subphrase           :    40789 -    45086
   |      |   verse               :    45087 -    45809
   |      |   lex                 :    45810 -    47884
   |     0.02s The new work has 47884 nodes of which 13565 slots
  0.17s collection done
  0.17s remap features ...
  0.42s remapping done
  0.42s write work as TF data set
  0.72s writing done
  0.72s done

Out[13]:

True

Load collection¶

We can load the collection in the same way as a volume, but now using collection=:

In [14]:

Ac = use("ETCBC/bhsa:clone", checkout="clone", version="2021", collection=COLLECTION)

Locating corpus resources ...

app: ~/github/ETCBC/bhsa/app

data: ~/github/ETCBC/bhsa/tf/2021

data: ~/github/ETCBC/phono/tf/2021

data: ~/github/ETCBC/parallels/tf/2021

   |     0.03s T otype                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.30s T oslots               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@ar              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@he              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T lex                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T qere_utf8            from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T qere                 from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T chapter              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T phono                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T g_word               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@ur              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@yo              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@pt              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T verse                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@en              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@am              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T trailer              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@tr              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T g_lex                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@da              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T phono_trailer        from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@el              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T voc_lex_utf8         from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T qere_trailer         from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T qere_trailer_utf8    from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@bn              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@hi              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@ru              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T lex_utf8             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@de              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@fa              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T g_word_utf8          from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@ja              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T g_lex_utf8           from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book                 from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@nl              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@id              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@syc             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T g_cons_utf8          from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@fr              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@pa              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@es              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T g_cons               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@sw              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T trailer_utf8         from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@zh              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@ko              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T book@la              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |      |     0.01s C __levels__           from otype, oslots, otext
   |      |     0.23s C __order__            from otype, oslots, __levels__
   |      |     0.01s C __rank__             from otype, __order__
   |      |     0.50s C __levUp__            from otype, oslots, __rank__
   |      |     0.32s C __levDown__          from otype, __levUp__, __rank__
   |      |     0.04s C __characters__       from otext
   |      |     0.10s C __boundary__         from otype, oslots, __rank__
   |      |     0.00s C __sections__         from otype, oslots, otext, __levUp__, __levels__, book, chapter, verse
   |     0.01s T code                 from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T crossref             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T det                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.01s T domain               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T freq_lex             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.02s T function             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T gloss                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T gn                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.01s T label                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T language             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T ls                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T mother               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.00s T nametype             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T nme                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T nu                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.07s T number               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.13s T ovolume              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.09s T owork                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.01s T pargr                from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T pdp                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T pfm                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T prs                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T prs_gn               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T prs_nu               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T prs_ps               from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T ps                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T rank_lex             from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.05s T rela                 from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T sp                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T st                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.01s T tab                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.01s T txt                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.05s T typ                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T uvf                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T vbe                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T vbs                  from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.04s T voc_lex              from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T vs                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets
   |     0.03s T vt                   from ~/github/ETCBC/bhsa/tf/2021/_local/prophets

Text-Fabric: Text-Fabric API 12.0.4, ETCBC/bhsa/app v3, Search Reference
Data: ETCBC - bhsa 2021 collection prophets:medium,small,tiny, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	9	1507.22	100
chapter	37	366.62	100
verse	723	18.76	100
half_verse	1368	9.92	100
sentence	1973	6.88	100
sentence_atom	2013	6.74	100
clause	2805	4.84	100
clause_atom	2896	4.68	100
lex	2075	3.38	52
phrase	7800	1.74	100
phrase_atom	8322	1.63	100
subphrase	4298	1.36	43
word	13565	1.00	100

Sets: no custom sets
Features:

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

str

✅ book name in amharic (ኣማርኛ)

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

int

✅ frequency of lexemes

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

str

🆗 english translation of lexeme (beginning create god(s))

str

✅ grammatical gender (m; f; NA; unknown.)

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

str

✅ grammatical number (sg; du; pl; NA; unknown.)

int

✅ sequence number of an object within its context

str

ovolume

str

mapping from a node in the work to the volume it comes from and its corresponding node there

int

mapping from nodes in the volume to nodes in the work

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

str

🆗 interword material in phonological transcription

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

str

✅ pronominal suffix gender (m; f; NA; unknown.)

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ interword material -pointed-transliterated (Masoretic correction)

str

✅ word pointed-Hebrew masoretic reading correction

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

int

✅ verse number

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

str

✅ verbal stem (qal; piel; hif; apel; pael)

str

✅ verbal tense (perf; impv; wayq; infc)

int

🆗 links between similar passages

none

✅ linguistic dependency between textual objects