Notebook

Identifying use of critical signs in the text (N1904GBI)¶

Table of content ¶

1 - Introduction]
2 - Load Text-Fabric app and data
3 - Performing the queries
- 3.1 - Getting an overview of leading critical signs
- 3.2 - Query for all words that contain some critical marks

1 - Introduction ¶

Back to TOC ¶

This Jupyter Notebook investigates the pressense of 'odd' values for feature 'after'.

2 - Load Text-Fabric app and data ¶

Back to TOC ¶

In [1]:

%load_ext autoreload
%autoreload 2

In [1]:

# Loading the New Testament TextFabric code
# Note: it is assumed Text-Fabric is installed in your environment.

from tf.fabric import Fabric
from tf.app import use

In [2]:

# load the app and data
N1904 = use ("tonyjurg/Nestle1904GBI", version="0.4", hoist=globals())

Locating corpus resources ...

The requested app is not available offline
	~/text-fabric-data/github/tonyjurg/Nestle1904GBI/app not found

Status: latest release online 0.4 versus None locally

downloading app, main data and requested additions ...

app: ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/app

The requested data is not available offline
	~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4 not found

Status: latest release online 0.4 versus None locally

downloading app, main data and requested additions ...

data: ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4

   |     0.22s T otype                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     1.91s T oslots               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.56s T after                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.57s T chapter              from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.67s T book                 from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.66s T word                 from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.58s T verse                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |      |     0.05s C __levels__           from otype, oslots, otext
   |      |     1.53s C __order__            from otype, oslots, __levels__
   |      |     0.07s C __rank__             from otype, __order__
   |      |     2.45s C __levUp__            from otype, oslots, __rank__
   |      |     1.50s C __levDown__          from otype, __levUp__, __rank__
   |      |     0.06s C __characters__       from otext
   |      |     0.99s C __boundary__         from otype, oslots, __rank__
   |      |     0.04s C __sections__         from otype, oslots, otext, __levUp__, __levels__, book, chapter, verse
   |      |     0.24s C __structure__        from otype, oslots, otext, __rank__, __levUp__, book, chapter, verse
   |     0.56s T booknum              from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.65s T bookshort            from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.53s T case                 from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.54s T clause               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.07s T clauserule           from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.02s T clausetype           from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.47s T degree               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.57s T formaltag            from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.58s T functionaltag        from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.63s T gloss                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.53s T gn                   from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.63s T lemma                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.58s T lex_dom              from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.60s T ln                   from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.50s T monad                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.51s T mood                 from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.71s T nodeID               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.66s T normalized           from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.56s T nu                   from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.55s T number               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.50s T person               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.79s T phrase               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.30s T phrasefunction       from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.31s T phrasefunctionlong   from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.30s T phrasetype           from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.51s T sentence             from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.57s T sp                   from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.57s T splong               from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.59s T strongs              from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.50s T subj_ref             from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.50s T tense                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.51s T type                 from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4
   |     0.50s T voice                from ~/text-fabric-data/github/tonyjurg/Nestle1904GBI/tf/0.4

Text-Fabric: Text-Fabric API 11.4.10, tonyjurg/Nestle1904GBI/app v3, Search Reference
Data: tonyjurg - Nestle1904GBI 0.4, Character table, Feature docs

Node types

Name	# of nodes	# slots/node	% coverage
book	27	5102.93	100
chapter	260	529.92	100
sentence	5720	24.09	100
verse	7943	17.35	100
clause	16124	8.54	100
phrase	72674	1.90	100
word	137779	1.00	100

Sets: no custom sets
Features:

Nestle 1904 (GBI nodes)

after

str

Character after the word (space or punctuation)

book

str

Book name (fully spelled out)

booknum

int

NT book number (Matthew=1, Mark=2, ..., Revelation=27)

bookshort

str

Book name (abbreviated)

case

str

Gramatical case (Nominative, Genitive, Dative, Accusative, Vocative)

chapter

int

Chapter number inside book

clause

int

Clause number (counted per chapter)

clauserule

str

Clause rule

clausetype

str

Clause type

degree

str

Degree (e.g. Comparitative, Superlative)

formaltag

str

Formal tag (Sandborg-Petersen morphology)

functionaltag

str

gloss

str

English gloss

str

Gramatical gender (Masculine, Feminine, Neuter)

lemma

str

Lexeme (lemma)

lex_dom

str

Lexical domain according to Semantic Dictionary of Biblical Greek, SDBG

str

Lauw-Nida lexical classification

monad

int

Sequence number of the smallest meaningful unit of text (single word)

mood

str

Gramatical mood of the verb (passive, etc)

nodeID

str

Node ID (as in the XML source data)

normalized

str

Surface word stripped of punctations

str

Gramatical number (Singular, Plural)

number

str

Gramatical number of the verb

otype

str

person

str

Gramatical person of the verb (first, second, third)

phrase

int

Phrase number (counted per chapter)

phrasefunction

str

Phrase function (abbreviated)

phrasefunctionlong

str

Phrase function (long description)

phrasetype

str

Phrase type information

sentence

int

Sentence number (counted per chapter)

str

Speech Part (abbreviated)

splong

str

Speech Part (long description)

strongs

str

Strongs number

subj_ref

str

Subject reference (to nodeID in XML source data)

tense

str

Gramatical tense of the verb (e.g. Present, Aorist)

type

str

Gramatical type of noun or pronoun (e.g. Common, Personal)

verse

int

Verse number inside chapter

voice

str

Gramatical voice of the verb

word

str

Word as it appears in the text

oslots

none

Text-Fabric API: names N F E L T S C TF directly usable

3 - Performing the queries ¶

Back to TOC ¶

3.1 - Getting an overview of leading critical signs ¶

Back to TOC ¶

First get a list of all unique words in unicode (including punctuations, critical signs and trailing spaces):

In [4]:

unicodeList = F.word.freqList()
print ('Number of results:',len(unicodeList))

Number of results: 19506

Now just look at the first character:

In [8]:

# Initialize an empty dictionary to store the frequencies
frequencyList = {}
criticalSignsList ={}
criticalSigns={"—","[","("}
              
# Iterate through the list (which is a list of ordered tuples)
for item in unicodeList:
    # Get the first character of the item
    firstChar = item[0][0]

    # Update the frequency in the dictionary for the full list
    frequencyList[firstChar] = frequencyList.get(firstChar, 0) + 1
    # add to other list if critical sign
    if firstChar in criticalSigns:
        criticalSignsList[firstChar]=criticalSignsList.get(firstChar, 0) + 1
    

print("Frequency list of all first character:")
print(frequencyList)
print("\nFrequency list of critical character:")
print(criticalSignsList)

Frequency list of all first character:
{'κ': 1477, 'ὁ': 100, 'ἐ': 2305, 'δ': 1009, 'τ': 533, 'ε': 485, 'α': 197, 'ὅ': 47, 'ο': 187, 'ἡ': 96, 'γ': 389, 'μ': 707, 'Θ': 47, 'π': 2261, 'ἵ': 7, 'ὑ': 323, 'ὡ': 16, 'ἀ': 2000, 'Ἰ': 114, 'σ': 1113, 'Κ': 134, 'λ': 409, 'ἢ': 1, 'ἦ': 16, 'Χ': 34, 'ὃ': 3, 'ἂ': 1, 'ἕ': 39, 'ἔ': 358, 'ν': 208, 'ἰ': 88, 'Ὁ': 7, 'ᾧ': 1, 'ἄ': 211, 'Τ': 99, 'ἃ': 2, 'Π': 193, 'ἣ': 2, 'Υ': 9, 'ὄ': 61, 'ὧ': 3, 'ὥ': 12, 'Μ': 115, 'ὀ': 181, 'ἑ': 146, 'β': 362, 'Ο': 38, 'ἤ': 66, 'χ': 277, 'ἁ': 124, 'Ε': 57, 'ζ': 131, 'θ': 317, 'Ἐ': 149, 'Φ': 55, 'ἧ': 2, 'φ': 398, 'ἓ': 2, 'Σ': 158, 'Ἀ': 216, 'ᾖ': 2, 'Ἁ': 7, 'ἥ': 23, 'ὢ': 1, 'ᾗ': 1, 'Ἱ': 14, 'Γ': 62, 'υ': 16, 'Δ': 90, 'ψ': 78, 'Ἅ': 11, 'Ἠ': 32, 'ἱ': 71, 'Ὡ': 8, 'ὕ': 25, 'ῥ': 76, 'Ἄ': 41, 'ἠ': 157, 'ἴ': 32, 'Β': 81, 'Ν': 50, 'Ὑ': 16, 'ὤ': 11, 'Ὅ': 14, 'ὦ': 5, 'Ἴ': 3, 'Ῥ': 34, 'ἅ': 44, 'ᾔ': 6, 'Ἦ': 6, 'Α': 26, 'Ἔ': 44, 'Ὦ': 1, 'Ζ': 24, 'Λ': 56, 'Ὃ': 3, 'Ὕ': 3, 'ξ': 30, 'Ὥ': 2, 'η': 13, 'ἆ': 5, 'Ἕ': 13, 'Ἢ': 2, 'ᾐ': 7, 'Ι': 2, 'Ὀ': 15, 'ὠ': 38, 'ᾠ': 6, '(': 7, 'Ἑ': 33, 'Ὧ': 2, '[': 6, 'Ἃ': 1, 'Ἆ': 3, 'Ἵ': 2, 'ᾄ': 2, 'Η': 2, 'Ἓ': 2, 'Ἤ': 9, 'Ἥ': 4, 'ἶ': 1, 'Ὄ': 5, '—': 15, 'Ξ': 1, 'Ψ': 2, 'ὂ': 1, 'Ὗ': 1, 'Ὤ': 1, 'ᾑ': 1}

Frequency list of critical character:
{'(': 7, '[': 6, '—': 15}

3.2 - Query for all words that contain some critical marks ¶

Back to TOC ¶

In [10]:

# Library to format table
from tabulate import tabulate

# The actual query
SearchLeadingCriticalMarks = '''
word word~[(\(\[—\)\])]
    '''
MarksList = N1904.search(SearchLeadingCriticalMarks)

# Postprocess the query results
Results=[]
for tuple in MarksList:
    node=tuple[0]
    location="{} {}:{}".format(F.book.v(node),F.chapter.v(node),F.verse.v(node))
    result=(location,F.word.v(node),F.after.v(node))
    Results.append(result)
      
# Produce the table
headers = ["location","unicode","word","after"]
print(tabulate(Results, headers=headers, tablefmt='fancy_grid'))

  0.11s 55 results
╒═════════════════════╤══════════════════╤════════╕
│ location            │ unicode          │ word   │
╞═════════════════════╪══════════════════╪════════╡
│ Mark 1:1            │ (Υἱοῦ            │        │
├─────────────────────┼──────────────────┼────────┤
│ Mark 1:1            │ Θεοῦ)            │ .      │
├─────────────────────┼──────────────────┼────────┤
│ Mark 16:9           │ [[Ἀναστὰς        │        │
├─────────────────────┼──────────────────┼────────┤
│ Mark 16:20          │ σημείων.]]       │        │
├─────────────────────┼──────────────────┼────────┤
│ Mark 16:99          │ [[Πάντα          │        │
├─────────────────────┼──────────────────┼────────┤
│ Mark 16:99          │ σωτηρίας.]]      │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 2:35           │ —καὶ             │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 2:35           │ ῥομφαία—         │ ,      │
├─────────────────────┼──────────────────┼────────┤
│ Luke 23:51          │ —οὗτος           │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:12          │ [[Ὁ              │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:12          │ γεγονός.]]       │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:36          │ [[καὶ            │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:36          │ ὑμῖν]]           │ .      │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:40          │ [[καὶ            │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:40          │ πόδας.]]         │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:51          │ [[καὶ            │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:51          │ οὐρανόν]]        │ .      │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:52          │ [[προσκυνήσαντες │        │
├─────────────────────┼──────────────────┼────────┤
│ Luke 24:52          │ αὐτὸν]]          │        │
├─────────────────────┼──────────────────┼────────┤
│ John 1:38           │ (ὃ               │        │
├─────────────────────┼──────────────────┼────────┤
│ John 1:38           │ Διδάσκαλε,)      │        │
├─────────────────────┼──────────────────┼────────┤
│ John 1:41           │ (ὅ               │        │
├─────────────────────┼──────────────────┼────────┤
│ John 1:41           │ Χριστός)         │ .      │
├─────────────────────┼──────────────────┼────────┤
│ John 1:42           │ (ὃ               │        │
├─────────────────────┼──────────────────┼────────┤
│ John 1:42           │ Πέτρος)          │ .      │
├─────────────────────┼──────────────────┼────────┤
│ John 4:2            │ —καίτοιγε        │        │
├─────────────────────┼──────────────────┼────────┤
│ John 5:3            │ ξηρῶν(           │ ,      │
├─────────────────────┼──────────────────┼────────┤
│ John 5:4            │ νοσήματι)        │ .      │
├─────────────────────┼──────────────────┼────────┤
│ John 7:22           │ —οὐχ             │        │
├─────────────────────┼──────────────────┼────────┤
│ John 7:53           │ (Καὶ             │        │
├─────────────────────┼──────────────────┼────────┤
│ John 8:11           │ ἁμάρτανε.)       │        │
├─────────────────────┼──────────────────┼────────┤
│ John 9:7            │ (ὃ               │        │
├─────────────────────┼──────────────────┼────────┤
│ John 9:7            │ Ἀπεσταλμένος)    │ .      │
├─────────────────────┼──────────────────┼────────┤
│ John 20:16          │ (ὃ               │        │
├─────────────────────┼──────────────────┼────────┤
│ John 20:16          │ Διδάσκαλε)       │ .      │
├─────────────────────┼──────────────────┼────────┤
│ Acts 22:2           │ —ἀκούσαντες      │        │
├─────────────────────┼──────────────────┼────────┤
│ Romans 4:16         │ (ὅς              │        │
├─────────────────────┼──────────────────┼────────┤
│ Romans 4:17         │ σε,)             │        │
├─────────────────────┼──────────────────┼────────┤
│ Romans 15:25        │ —νυνὶ            │        │
├─────────────────────┼──────────────────┼────────┤
│ I_Corinthians 9:15  │ —τὸ              │        │
├─────────────────────┼──────────────────┼────────┤
│ II_Corinthians 6:2  │ —λέγει           │        │
├─────────────────────┼──────────────────┼────────┤
│ II_Corinthians 12:2 │ —εἴτε            │        │
├─────────────────────┼──────────────────┼────────┤
│ II_Corinthians 12:2 │ —ἁρπαγέντα       │        │
├─────────────────────┼──────────────────┼────────┤
│ II_Corinthians 12:3 │ —εἴτε            │        │
├─────────────────────┼──────────────────┼────────┤
│ Galatians 2:6       │ —ὁποῖοί          │        │
├─────────────────────┼──────────────────┼────────┤
│ Ephesians 1:1       │ [ἐν              │        │
├─────────────────────┼──────────────────┼────────┤
│ Ephesians 1:1       │ Ἐφέσῳ]           │        │
├─────────────────────┼──────────────────┼────────┤
│ Ephesians 5:9       │ —ὁ               │        │
├─────────────────────┼──────────────────┼────────┤
│ Ephesians 5:10      │ —δοκιμάζοντες    │        │
├─────────────────────┼──────────────────┼────────┤
│ Colossians 4:10     │ (περὶ            │        │
├─────────────────────┼──────────────────┼────────┤
│ Colossians 4:10     │ αὐτόν,)          │        │
├─────────────────────┼──────────────────┼────────┤
│ I_Timothy 3:5       │ (εἰ              │        │
├─────────────────────┼──────────────────┼────────┤
│ I_Timothy 3:5       │ ἐπιμελήσεται;)   │        │
├─────────────────────┼──────────────────┼────────┤
│ Hebrews 7:20        │ —οἱ              │        │
├─────────────────────┼──────────────────┼────────┤
│ Hebrews 7:22        │ —κατὰ            │        │
╘═════════════════════╧══════════════════╧════════╛

Note: The following site can be used to build and verify a regular expression: regex101.com (choose the 'Pyton flavor')

In [ ]:

Identifying use of critical signs in the text (N1904GBI)¶

Table of content ¶

1 - Introduction ¶

Back to TOC¶

2 - Load Text-Fabric app and data ¶

Back to TOC¶

3 - Performing the queries ¶

Back to TOC¶

3.1 - Getting an overview of leading critical signs¶

Back to TOC¶

3.2 - Query for all words that contain some critical marks¶

Back to TOC¶

Back to TOC ¶

Back to TOC ¶

Back to TOC ¶

3.1 - Getting an overview of leading critical signs ¶

Back to TOC ¶

3.2 - Query for all words that contain some critical marks ¶

Back to TOC ¶