You might want to consider the start of this tutorial.

Short introductions to other TF datasets:

or the

Quran

Rich display¶

Text-Fabric offers pretty and plain displays of textual objects.

A plain display of an object is a simple reference to that object if it is big, or the text of that object if it is small.

A pretty display of an object is a representation of the structure of that object. It contains text and features of sub objects. Provided the object is not too big.

In [1]:

%load_ext autoreload
%autoreload 2

Incantation¶

The ins and outs of installing Text-Fabric, getting the corpus, and initializing a notebook are explained in the start tutorial.

In [2]:

from tf.app import use

In [3]:

A = use("ETCBC/bhsa", hoist=globals())

Locating corpus resources ...

app: ~/text-fabric-data/github/ETCBC/bhsa/app

data: ~/text-fabric-data/github/ETCBC/bhsa/tf/2021

data: ~/text-fabric-data/github/ETCBC/phono/tf/2021

data: ~/text-fabric-data/github/ETCBC/parallels/tf/2021

Text-Fabric: Text-Fabric API 12.0.4, ETCBC/bhsa/app v3, Search Reference
Data: ETCBC - bhsa 2021, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	39	10938.21	100
chapter	929	459.19	100
lex	9230	46.22	100
verse	23213	18.38	100
half_verse	45179	9.44	100
sentence	63717	6.70	100
sentence_atom	64514	6.61	100
clause	88131	4.84	100
clause_atom	90704	4.70	100
phrase	253203	1.68	100
phrase_atom	267532	1.59	100
subphrase	113850	1.42	38
word	426590	1.00	100

Sets: no custom sets
Features:

Parallel Passages

crossref

int

🆗 links between similar passages

BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis

book

str

✅ book name in Latin (Genesis; Numeri; Reges1; ...)

book@ll

str

✅ book name in amharic (ኣማርኛ)

chapter

int

✅ chapter number (1; 2; 3; ...)

code

int

✅ identifier of a clause atom relationship (0; 74; 367; ...)

det

str

✅ determinedness of phrase(atom) (det; und; NA.)

domain

str

✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)

freq_lex

int

✅ frequency of lexemes

function

str

✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)

g_cons

str

✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)

g_cons_utf8

str

✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)

g_lex

str

✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)

g_lex_utf8

str

✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)

g_word

str

✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)

g_word_utf8

str

✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)

gloss

str

🆗 english translation of lexeme (beginning create god(s))

gn

str

✅ grammatical gender (m; f; NA; unknown.)

label

str

✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)

language

str

✅ of word or lexeme (Hebrew; Aramaic.)

lex

str

✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)

lex_utf8

str

✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)

ls

str

✅ lexical set, subclassification of part-of-speech (card; ques; mult)

nametype

str

⚠️ named entity type (pers; mens; gens; topo; ppde.)

nme

str

✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)

nu

str

✅ grammatical number (sg; du; pl; NA; unknown.)

number

int

✅ sequence number of an object within its context

otype

str

pargr

str

🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)

pdp

str

✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)

pfm

str

✅ preformative consonantal-transliterated (absent; n/a; J, ...)

prs

str

✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)

prs_gn

str

✅ pronominal suffix gender (m; f; NA; unknown.)

prs_nu

str

✅ pronominal suffix number (sg; du; pl; NA; unknown.)

prs_ps

str

✅ pronominal suffix person (p1; p2; p3; NA; unknown.)

ps

str

✅ grammatical person (p1; p2; p3; NA; unknown.)

qere

str

✅ word pointed-transliterated masoretic reading correction

qere_trailer

str

✅ interword material -pointed-transliterated (Masoretic correction)

qere_trailer_utf8

str

✅ interword material -pointed-transliterated (Masoretic correction)

qere_utf8

str

✅ word pointed-Hebrew masoretic reading correction

rank_lex

int

✅ ranking of lexemes based on freqnuecy

rela

str

✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)

sp

str

✅ part-of-speech (art; verb; subs; nmpr, ...)

st

str

✅ state of a noun (a (absolute); c (construct); e (emphatic).)

tab

int

✅ clause atom: its level in the linguistic embedding

trailer

str

✅ interword material pointed-transliterated (& 00 05 00_P ...)

trailer_utf8

str

✅ interword material pointed-Hebrew (־ ׃)

txt

str

✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)

typ

str

✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)

uvf

str

✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)

vbe

str

✅ verbal ending consonantal-transliterated (n/a; W; ...)

vbs

str

✅ root formation consonantal-transliterated (absent; n/a; H; ...)

verse

int

✅ verse number

voc_lex

str

✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)

voc_lex_utf8

str

✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)

vs

str

✅ verbal stem (qal; piel; hif; apel; pael)

vt

str

✅ verbal tense (perf; impv; wayq; infc)

mother

none

✅ linguistic dependency between textual objects

oslots

none

Phonetic Transcriptions

phono

str

🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)

phono_trailer

str

🆗 interword material in phonological transcription

Settings:

specified

apiVersion: 3
appName: ETCBC/bhsa
appPath: /Users/me/text-fabric-data/github/ETCBC/bhsa/app
commit: gd905e3fb6e80d0fa537600337614adc2af157309
css: ''
dataDisplay:
- exampleSectionHtml:
  <code>Genesis 1:1</code> (use <a href="https://github.com/{org}/{repo}/blob/master/tf/{version}/book%40en.tf" target="_blank">English book names</a>)
- excludedFeatures:
  - g_uvf_utf8
  - g_vbs
  - kq_hybrid
  - languageISO
  - g_nme
  - lex0
  - is_root
  - g_vbs_utf8
  - g_uvf
  - dist
  - root
  - suffix_person
  - g_vbe
  - dist_unit
  - suffix_number
  - distributional_parent
  - kq_hybrid_utf8
  - crossrefSET
  - instruction
  - g_prs
  - lexeme_count
  - rank_occ
  - g_pfm_utf8
  - freq_occ
  - crossrefLCS
  - functional_parent
  - g_pfm
  - g_nme_utf8
  - g_vbe_utf8
  - kind
  - g_prs_utf8
  - suffix_gender
  - mother_object_type
- noneValues:
  - none
  - unknown
  - no value
  - NA
docs:
- docBase: {docRoot}/{repo}
- docExt: ''
- docPage: ''
- docRoot: https://{org}.github.io
- featurePage: 0_home
interfaceDefaults: {}
isCompatible: True
local: local
localDir: /Users/me/text-fabric-data/github/ETCBC/bhsa/_temp
provenanceSpec:
- corpus: BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis
- doi: 10.5281/zenodo.1007624
- moduleSpecs:
  - :
    backend: no value
    corpus: Phonetic Transcriptions
    docUrl:
    https://nbviewer.jupyter.org/github/etcbc/phono/blob/master/programs/phono.ipynb
    doi: 10.5281/zenodo.1007636
    org: ETCBC
    relative: /tf
    repo: phono
  - :
    backend: no value
    corpus: Parallel Passages
    docUrl:
    https://nbviewer.jupyter.org/github/ETCBC/parallels/blob/master/programs/parallels.ipynb
    doi: 10.5281/zenodo.1007642
    org: ETCBC
    relative: /tf
    repo: parallels
- org: ETCBC
- relative: /tf
- repo: bhsa
- version: 2021
- webBase: https://shebanq.ancient-data.org/hebrew
- webHint: Show this on SHEBANQ
- webLang: la
- webLexId: True
- webUrl:
  {webBase}/text?book=<1>&chapter=<2>&verse=<3>&version={version}&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: v1.8
typeDisplay:
- clause:
  - label: {typ} {rela}
  - style: ''
- clause_atom:
  - hidden: True
  - label: {code}
  - level: 1
  - style: ''
- half_verse:
  - hidden: True
  - label: {label}
  - style: ''
  - verselike: True
- lex:
  - featuresBare: gloss
  - label: {voc_lex_utf8}
  - lexOcc: word
  - style: orig
  - template: {voc_lex_utf8}
- phrase:
  - label: {typ} {function}
  - style: ''
- phrase_atom:
  - hidden: True
  - label: {typ} {rela}
  - level: 1
  - style: ''
- sentence:
  - label: {number}
  - style: ''
- sentence_atom:
  - hidden: True
  - label: {number}
  - level: 1
  - style: ''
- subphrase:
  - hidden: True
  - label: {number}
  - style: ''
- word:
  - features: pdp vs vt
  - featuresBare: lex:gloss
writing: hbo

Text-Fabric API: names N F E L T S C TF Fs Fall Es Eall Cs Call directly usable

Arbitrary nodes¶

We pretty-print some (arbitrary) nodes.

The first verse.

In [4]:

v1 = A.nodeFromSectionStr("Genesis 1:1")
v1

Out[4]:

In [5]:

A.pretty(v1)

Genesis 1:1

verse

sentence 1

clause xQtX NA

phrase PP Time

phrase VP Pred

phrase NP Subj

phrase PP Objc

With standard features displayed:

In [6]:

A.pretty(v1, standardFeatures=True)

Genesis 1:1

verse

sentence 1

clause xQtX NA

phrase PP Time

בְּ

inpdp=prep

רֵאשִׁ֖ית

beginningpdp=subs

phrase VP Pred

בָּרָ֣א

createpdp=verbvs=qalvt=perf

phrase NP Subj

אֱלֹהִ֑ים

god(s)pdp=subs

phrase PP Objc

thepdp=art

heavenspdp=subs

andpdp=conj

thepdp=art

earthpdp=subs

Now a phrase. We display it with little and with much information.

In [7]:

phrase = 651605
A.pretty(phrase, withNodes=False, prettyTypes=False)
A.pretty(phrase, withNodes=True, standardFeatures=True, hideTypes=False)

phrase:651605 PP Cmpl

phrase_atom:904808 PP NA

subphrase:1300549

51 בֵּ֥ין

intervalpdp=prep

52 הָ

thepdp=art

53 אֹ֖ור

lightpdp=subs

54 וּ

andpdp=conj

subphrase:1300550

55 בֵ֥ין

intervalpdp=prep

56 הַ

thepdp=art

57 חֹֽשֶׁךְ׃

darknesspdp=subs

If we want to see the subphrases but not the phrase atoms:

In [8]:

A.pretty(phrase, withNodes=True, standardFeatures=True, hiddenTypes="phrase_atom")

Genesis 1:4

phrase:651605 PP Cmpl

subphrase:1300549

51 בֵּ֥ין

intervalpdp=prep

52 הָ

thepdp=art

53 אֹ֖ור

lightpdp=subs

54 וּ

andpdp=conj

subphrase:1300550

55 בֵ֥ין

intervalpdp=prep

56 הַ

thepdp=art

57 חֹֽשֶׁךְ׃

darknesspdp=subs

Use the following to find out which display options are available and what their current values are.

In [9]:

A.displayShow()

current display options

1. baseTypes

word

2. colorMap

None

3. condenseType

verse

4. condensed

False

5. end

None

6. extraFeatures

()
{}

7. fmt

None

8. full

False

9. hiddenTypes

clause_atom
half_verse
phrase_atom
sentence_atom
subphrase

10. hideTypes

True

11. highlights

{}

12. lineNumbers

None

13. noneValues

none
unknown
None
NA

14. plainGaps

True

15. prettyTypes

True

16. queryFeatures

True

17. showGraphics

None

18. skipCols

set()

19. standardFeatures

False

20. start

None

21. suppress

set()

22. tupleFeatures

()

23. withNodes

False

24. withPassage

True

25. withTypes

False

Where is this phrase on SHEBANQ? You can click on the passage reference.

You can generate a link that points to where a node is on SHEBANQ as follows:

In [10]:

A.webLink(phrase)

Genesis 1:4

If you want just the URL:

In [11]:

A.webLink(phrase, urlOnly=True)

Out[11]:

'https://shebanq.ancient-data.org/hebrew/text?book=Genesis&chapter=1&verse=4&version=2021&mr=m&qw=q&tp=txt_p&tr=hb&wget=v&qget=v&nget=vt'

A link to another passage:

In [12]:

z = A.nodeFromSectionStr("Ezra 3:4")

In [13]:

A.webLink(z)

Ezra 3:4

Plain¶

We can represent a node in plain representation and highlight specific portions.

In [14]:

firstVerse = F.otype.s("verse")[0]
allPhrases = F.otype.s("phrase")
phrases = {allPhrases[1], allPhrases[3]}
words = (2, 4, 6, 9)

In [15]:

firstSentence = F.otype.s("sentence")[0]
A.plain(firstSentence)