Print the syntax tree of a specific verse (N1904-TF)¶

Table of content (ToC)¶

1 - Introduction
2 - Load Text-Fabric app and data
3 - Performing the queries
4 - Notebook version details

1 - Introduction ¶

Back to ToC ¶

This Jupyter Notebook demonstrates how to display the syntax tree for a specific verse. We will begins by explaining how you can use Text-Fabric to select a particular verse from the Greek New Testament. Additionally, it utilizes the A.viewType() function to showcase the differences between the two types of syntax tree presentations included in the dataset.

2 - Load Text-Fabric app and data ¶

Back to ToC ¶

In [1]:

%load_ext autoreload
%autoreload 2

In [2]:

# Loading the Text-Fabric code
# Note: it is assumed Text-Fabric is installed in your environment
from tf.fabric import Fabric
from tf.app import use

In [3]:

# load the N1904 app and data
N1904 = use ("CenterBLC/N1904", version="1.0.0", hoist=globals())

Locating corpus resources ...

app: ~/text-fabric-data/github/CenterBLC/N1904/app

data: ~/text-fabric-data/github/CenterBLC/N1904/tf/1.0.0

TF: TF API 12.6.1, CenterBLC/N1904/app v3, Search Reference
Data: CenterBLC - N1904 1.0.0, Character table, Feature docs

Node types

Name	# of nodes	# slots / node	% coverage
book	27	5102.93	100
chapter	260	529.92	100
verse	7944	17.34	100
sentence	8011	17.20	100
group	8945	7.01	46
clause	42506	8.36	258
wg	106868	6.88	533
phrase	69007	1.90	95
subphrase	116178	1.60	135
word	137779	1.00	100

Sets: no custom sets
Features:

Nestle 1904 Greek New Testament

after

str

material after the end of the word

appositioncontainer

int

1 if it is an apposition container

articular

int

1 if the sentence, group, clause, phrase or wg has an article

before

str

this is XML attribute before

betacode

str

Betacode representation of the unicode surface word

book

str

book name (full name)

bookshort

str

book name (abbreviated) from ref attribute in xml

case

str

grammatical case

chapter

int

chapter number, from ref attribute in xml

clausetype

str

clause type

cls

str

this is XML attribute cls

cltype

str

clause type

criticalsign

str

this is XML attribute criticalsign

crule

str

clause rule (from xml attribute Rule)

degree

str

grammatical degree

discontinuous

int

1 if the word is out of sequence in the xml

domain

str

domain

framespec

str

this is XML attribute framespec

function

str

this is XML attribute function

gender

str

grammatical gender

gloss

str

English gloss (BGVB)

id

str

xml id

junction

str

type of junction

lang

str

language the text is in

lemma

str

lexical lemma

lemmatranslit

str

transliteration of the word lemma

ln

str

ln

mood

str

verbal mood

morph

str

morphological code

nodeid

str

node id (as in the XML source data)

normalized

str

lemma normalized

note

str

annotation of linguistic nature

num

int

generated number (not in xml): book: (Matthew=1, Mark=2, ..., Revelation=27); sentence: numbered per chapter; word: numbered per verse.

number

str

grammatical number

otype

str

person

str

grammatical person

punctuation

str

punctuation found after a word

ref

str

biblical reference with word counting

referent

str

number of referent

rela

str

this is XML attribute rela

role

str

role

rule

str

syntactical rule

sp

str

part-of-speach

strong

int

strong number

subjrefspec

str

this is XML attribute subjrefspec

tense

str

verbal tense

text

str

the text of a word

trailer

str

material after the end of the word (excluding critical signs)

trans

str

translation of the word surface text according to the Berean Interlinear Bible

translit

str

transliteration of the word surface text

typ

str

syntactical type (on sentence, group, clause or phrase)

typems

str

morphological type (on word), syntactical type (on sentence, group, clause, phrase or wg)

unaccent

str

word in unicode characters without accents and diacritical markers

unicode

str

word in unicode characters plus material after it

variant

str

this is XML attribute variant

verse

int

verse number, from ref attribute in xml

voice

str

verbal voice

frame

str

frame

oslots

none

parent

none

parent relationship between words

sibling

int

this is XML attribute sibling

subjref

none

number of subject referent

Settings:

specified

apiVersion: 3
appName: CenterBLC/N1904
appPath: C:/Users/tonyj/text-fabric-data/github/CenterBLC/N1904/app
commit: gdb630837ae89b9468c9e50d13bda05cfd3de4f18
css: ''
dataDisplay:
- excludedFeatures: []
- noneValues:
  - none
  - unknown
  - no value
  - NA
- sectionSep1:
- sectionSep2: :
- textFormat: text-orig-full
docs:
- docBase: https://github.com/CenterBLC/N1904/tree/main/docs
- docPage: about
- docRoot: https://github.com/CenterBLC/N1904
- featureBase:
  https://github.com/CenterBLC/N1904/blob/main/docs/features/<feature>.md
- featurePage: README
interfaceDefaults: {fmt: text-orig-full}
isCompatible: True
local: local
localDir:
C:/Users/tonyj/text-fabric-data/github/CenterBLC/N1904/_temp
provenanceSpec:
- branch: main
- corpus: Nestle 1904 Greek New Testament
- doi: 10.5281/zenodo.13117910
- moduleSpecs: []
- org: CenterBLC
- relative: /tf
- repo: N1904
- repro: N1904
- version: 1.0.0
- webBase: https://learner.bible/text/show_text/nestle1904/
- webHint: Show this on the website
- webLang: en
- webUrl:
  https://learner.bible/text/show_text/nestle1904/<1>/<2>/<3>
- webUrlLex: {webBase}/word?version={version}&id=<lid>
release: 1.0.0
typeDisplay:
- clause:
  - condense: True
  - label: {typ} {function} {rela} \\ {cls} {role} {junction}
  - style: ''
- group:
  - label: {typ} {function} {rela} \\ {typems} {role} {rule}
  - style: ''
- phrase:
  - condense: True
  - label: {typ} {function} {rela} \\ {typems} {role} {rule}
  - style: ''
- sentence:
  - label: {typ} {function} {rela} \\ {role} {rule}
  - style: ''
- subphrase:
  - label: {typ} {function} {rela} \\ {typems} {role} {rule}
  - style: ''
- verse:
  - condense: True
  - label: {book} {chapter}:{verse}
  - style: ''
- wg:
  - condense: True
  - label: {typems} {role} {rule} {junction}
  - style: ''
- word:
  - features:
    lemma
    sp
  - featuresBare: [gloss]
writing: grc

TF API: names N F E L T S C TF Fs Fall Es Eall Cs Call directly usable

Display is setup for viewtype syntax-view

See here for more information on viewtypes

In [4]:

# The following will push the Text-Fabric stylesheet to this notebook (to facilitate proper display with notebook viewer)
N1904.dh(N1904.getCss())

3 - Performing the queries ¶

Back to ToC ¶

3.1 - Show a specific verse ¶

The following example demonstrates a query for a specific verse (Mark 1:1). As expected, the query returns a single result.

In [6]:

# Define the query template
VerseQuery = '''
book book=Mark
  chapter chapter=1
      verse verse=1
'''

In [7]:

# The following will create a list containing ordered tuples consisting of node numbers of the items as they appear in the query
VerseResult = N1904.search(VerseQuery)

  0.01s 1 result

The result stored in object VerseResult is a list of tuples. In this example, the list contains only one tuple. Each tuple corresponds to the nodes retrieved based on the query template, with the number of nodes in the tuple matching the three specified in the query.

You can inspect the contents of VerseResult using the following print statement:

print (VerseResult)

This will display the numeric values for the selected book, chapter, and verse nodes, which are in this example:

[(137781, 137835, 383782)]

Next we will print the syntax tree for the obtained results:

In [36]:

# Print the result
N1904.show(VerseResult,queryFeatures=False)

verse 1

Mark 1:1

verse Mark 1:1

sentence

phrase NP PreC \\ modifier-scope p NPofNP

subphrase \\ common

Ἀρχὴ

subphrase NP \\ modifier-scope DetNP

subphrase \\

τοῦ

subphrase NP \\ modifier-scope NPofNP

subphrase \\ common

εὐαγγελίου

subphrase \\ group Np-Appos

subphrase \\ proper

Ἰησοῦ

subphrase \\ proper

Χριστοῦ

subphrase NP Appo \\ modifier-scope apposition NPofNP

subphrase \\ common

(Υἱοῦ

subphrase \\ common

Θεοῦ).

3.1.1 - Alternative methods ¶

In addition to the straightforward example provided earlier, there are other, more advanced methods for selecting a specific verse in Text-Fabric. In this section two will be briefly described.

Firstly, the code from the previous cells can be combined into a single, compact, and efficient line of code. This approach yields a list of tuples, with each tuple containing only one element (namely a verse node), which is then used as argument for the A.show() function:

N1904.show(N1904.search('verse book=Mark chapter=1 verse=1'))

Another method takes advantage of the fact that A.show() expects a list of tuples. This is achieved by encapsulating the numeric value of the verse node in a list, using square brackets [ ], and making the integer part of a tupe, using parentheses ( , ). Additionally, the function T.nodeFromSection() expects a tuple as input, which is created using ('Mark',1,1). Combining these steps results in the following construction:

N1904.show([(T.nodeFromSection(('Mark',1,1)), )])

3.2 - Selecting individual words of the verse ¶

A similar (but still different) result can be obtained by selecting all words from the verse individualy. Since each word is counted as a separate result, the total number of results is higher — in this case, seven. Additionally, note that the found items (i.e., individual words) are highlighted in yellow. Using the argument condensed=Truecombines all the found items, limiting the display to a single instance of the verse, as all results come from the same verse. If the argument condensed=False were supplied, the verse would be displayed seven times, with each instance highlighting the next consecutive word in yellow.

In [37]:

# Define the query template
AltVerseQuery = '''
word book=Mark chapter=1 verse=1
'''

# The following will create a list containing ordered tuples consisting of node numbers of the items as they appear in the query
AltVerseResult = N1904.search(AltVerseQuery)

# Print some of the results
N1904.show(AltVerseResult, start=1, end=15, condensed=True, queryFeatures=False)

  0.10s 7 results

verse 1

Mark 1:1

verse Mark 1:1

sentence

phrase NP PreC \\ modifier-scope p NPofNP

subphrase \\ common

Ἀρχὴ

subphrase NP \\ modifier-scope DetNP

subphrase \\

τοῦ

subphrase NP \\ modifier-scope NPofNP

subphrase \\ common

εὐαγγελίου

subphrase \\ group Np-Appos

subphrase \\ proper

Ἰησοῦ

subphrase \\ proper

Χριστοῦ

subphrase NP Appo \\ modifier-scope apposition NPofNP

subphrase \\ common

(Υἱοῦ

subphrase \\ common

Θεοῦ).

3.3 - Available text output formats ¶

Text-Fabric's data design enables a very flexible representation of the corpus text (see this section). If no specific format is defined, the default format will be used, which was set during the dataset's creation (for this dataset, the default is text-orig-full). Additionally, the dataset includes several other formats that are particularly relevant to the Greek New Testament corpus.

To view the available formats for displaying the text in this dataset:

In [40]:

T.formats

Out[40]:

{'lex-orig-plain': 'word',
 'lex-translit-plain': 'word',
 'text-orig-full': 'word',
 'text-orig-plain': 'word',
 'text-translit-plain': 'word',
 'text-unaccent-plain': 'word'}

This list reveals that all defined formats are based on word nodes. This means that the output for any given format is generated using a specific set of features associated with word nodes. The exact combination of features used for each text format can be examined by running the following command:

In [42]:

N1904.showFormats()

format	level	template
`lex-orig-plain`	word	`{lemma}{trailer}`
`lex-translit-plain`	word	`{lemmatranslit}{trailer}`
`text-orig-full`	word	`{before}{text}{after}`
`text-orig-plain`	word	`{text}{trailer}`
`text-translit-plain`	word	`{translit}{trailer}`
`text-unaccent-plain`	word	`{unaccent}{trailer}`

Remark regarding data origin¶

This data originates from file otext.tf:

@config
...
@fmt:text-orig-full={before}{text}{after}
...

3.4 - Compare text output formats ¶

Using the example verse, we can illustrate how the different formats in this dataset influence the presentation of Mark 1:1. To do this, we iterate over the defined text formats and display the text associated with the verse node in each format.

In [43]:

# note: node 383782 is of type 'verse' and associated to Mark 1:1 
for formats in T.formats:
    print(f'fmt={formats}\t: {T.text(383782,formats)}')

fmt=lex-orig-plain	: ἀρχή ὁ εὐαγγέλιον Ἰησοῦς Χριστός υἱός θεός. 
fmt=lex-translit-plain	: arkhe o euaggelion Iesous Khristos uios theos. 
fmt=text-orig-full	: Ἀρχὴ τοῦ εὐαγγελίου Ἰησοῦ Χριστοῦ (Υἱοῦ Θεοῦ). 
fmt=text-orig-plain	: Ἀρχὴ τοῦ εὐαγγελίου Ἰησοῦ Χριστοῦ Υἱοῦ Θεοῦ. 
fmt=text-translit-plain	: Arkhe tou euaggeliou Iesou Khristou Uiou Theou. 
fmt=text-unaccent-plain	: Αρχη του ευαγγελιου Ιησου Χριστου Υιου Θεου.

4 - Notebook version details ¶

Back to ToC ¶

Author	Tony Jurg
Version	1.1
Date	9 October 2024

Print the syntax tree of a specific verse (N1904-TF)¶

Table of content (ToC)¶

1 - Introduction ¶

Back to ToC¶

2 - Load Text-Fabric app and data ¶

Back to ToC¶

3 - Performing the queries ¶

Back to ToC¶

3.1 - Show a specific verse¶

3.1.1 - Alternative methods ¶

3.2 - Selecting individual words of the verse ¶

3.3 - Available text output formats ¶

Remark regarding data origin¶

3.4 - Compare text output formats ¶

4 - Notebook version details¶

Back to ToC¶

Back to ToC ¶

Back to ToC ¶

Back to ToC ¶

3.1 - Show a specific verse ¶

4 - Notebook version details ¶

Back to ToC ¶