Quads¶

When simple signs get stacked we get composite signs. Here we call them quads. There are several ways to compose quads from sub-quads: there is always an operator involved. And a composition can again be subjected to an other composition. And again ...

In [2]:

%load_ext autoreload
%autoreload 2

In [3]:

from tf.app import use

In [4]:

A = use("Nino-cunei/uruk", hoist=globals())

TF-app: ~/text-fabric-data/Nino-cunei/uruk/app

data: ~/text-fabric-data/Nino-cunei/uruk/tf/uruk/1.0

This is Text-Fabric 9.2.2
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

33 features found and 0 ignored

Text-Fabric: Text-Fabric API 9.2.2, Nino-cunei/uruk/app v3, Search Reference
Data: URUK, Character table, Feature docs
Features:

Uruk IV/III: Proto-cuneiform tablets

catalogId

str

identifier of tablet in catalog (http://www.flutopedia.com/tablets.htm)

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

crossref

str

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

damage

int

indicates damage of signs or quads,corresponds to #-flag in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

depth

int

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

excavation

str

excavation number of tablet

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

fragment

str

level between tablet and face

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

fullNumber

str

the combination of face type and column number on columns

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

grapheme

str

name of a grapheme (glyph)

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

identifier

str

additional information pertaining to the name of a face

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

modifier

str

indicates modifcation of a sign; corresponds to sign@letter in transcription. if the grapheme is a repeat, the modification applies to the whole repeat.

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

modifierFirst

str

indicates the order between modifiers and variants on the same object; if 1, modifiers come before variants

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

modifierInner

str

indicates modifcation of a sign within a repeatcorresponds to sign@letter in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

name

str

name of tablet

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

number

str

number of a column or line or case

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

otype

str

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

period

str

period that characterises the tablet corpus

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

prime

int

indicates the presence/multiplicity of a prime (single quote)

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

remarkable

int

corresponds to ! flag in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

repeat

int

number indicating the number of repeats of a grapheme,especially in numerals; -1 comes from repeat N in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

srcLn

str

transcribed line

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:47Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

srcLnNum

int

line number in transcription file

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

terminal

str

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

text

str

text of comment nodes

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

type

str

type of a face; type of a comment; type of a cluster;type of a sign

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

uncertain

int

corresponds to ?-flag in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

variant

str

allograph for a sign, corresponds to ~x in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

variantOuter

str

allograph for a quad, corresponds to ~x in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

written

str

corresponds to !(xxx) flag in transcription

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

comments

none

links comment nodes to their targets

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

op

str

operator connecting left to right operand in a quad

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:48Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

oslots

none

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:49Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

sub

none

connects line or case with sub-cases, quad with sub-quads; clusters with sub-clusters

dataset:

uruk

datasetName:

Cuneiform tablets from the Uruk IV-III period

dateWritten:

2018-05-01T14:10:49Z

email1:

https://www.universiteitleiden.nl/en/staffmembers/cale-johnson#tab-1

email2:

dirk.roorda@dans.knaw.nl

encoders:

CDLI (transcription),Cale Johnson (expertise)and Dirk Roorda (TF)

source:

CLDI

sourceUrl:

https://cdli.ucla.edu

version:

1.0

writtenBy:

Text-Fabric

Text-Fabric API: names N F E L T S C TF directly usable

data: ~/text-fabric-data/Nino-cunei/uruk/sources/cdli/images

Found 2095 ideograph linearts

Found 2724 tablet linearts

Found 5495 tablet photos

We need our example tablet (again). It is particularly relevant to this chapter in our tutorial: it contains the most deeply nested quad in the whole corpus.

In [5]:

pNum = "P005381"
query = """
tablet catalogId=P005381
"""
results = A.search(query)
A.lineart(results[0][0], width=200)
A.show(results, withNodes=True)

  0.01s 1 result

P005381

result 1

P005381

tablet:148166 P005381

MSVO 3, 70uruk-iiicatalogId=P005381

comment:178162

atf: lang qpc

face:156932 obverse

column:190362 1

line:254173 1

case:167736 1a

106585 2(N14)

106586 SZE~a

106587 SAL

106588 TUR3~a

106589 NUN~a

case:167737 1b

106590 3(N19)

quad:143013

106591 GISZ

.

106592 TE

line:254174 2

106593 1(N14)

106594 NAR

106595 NUN~a

106596 SIG7

line:254175 3

106597 2(N04)#

106598 PIRIG~b1

106599 SIG7

106600 URI3~a

106601 NUN~a

column:190363 2

line:254176 1

106602 3(N04)

quad:143014

106603 GISZ

.

106604 TE

106605 GAR

quad:143015

106606 SZU2

.

quad:143016

quad:143017

106607 HI

+

106608 1(N57)

+

quad:143018

106609 HI

+

106610 1(N57)

106611 GI4~a

line:254177 2

106612 GU7

106613 AZ

106614 SI4~f

face:156933 reverse

column:190364 1

line:254178 1

106615 3(N14)

106616 SZE~a

line:254179 2

106617 3(N19)

106618 5(N04)

line:254180 3

106619 GU7

column:190365 2

line:254181 1

106620 AZ

106621 SI4~f

The components of quads are either sub-quads or signs. Sub-quads are also quads in TF, and they are always a composition. Whenever a member of a sub-quad is no longer a composition, it is a sign.

Let's try to unravel the structure of the biggest quad in this tablet.

Find the quad¶

First we need to get the node of this quad. Above we have seen the source code of the tablet in which it occurs, from that we can pick the node of the case it is in:

In [6]:

case = A.nodeFromCase(("P005381", "obverse:2", "1"))
print(A.getSource(case))
A.pretty(case, withNodes=True)

['1. 3(N04) , |GISZ.TE| GAR |SZU2.((HI+1(N57))+(HI+1(N57)))| GI4~a ']

P005381 obverse:2:1

line:254176 1

106602 3(N04)

quad:143014

106603 GISZ

.

106604 TE

106605 GAR

quad:143015

106606 SZU2

.

quad:143016

quad:143017

106607 HI

+

106608 1(N57)

+

quad:143018

106609 HI

+

106610 1(N57)

106611 GI4~a

We can easily read off the node number of this big quad.

But we can also do it programmatically.

In order to identify our super-quad, we list all quad nodes that are part of this case. For every quad we list the node numbers of the signs contained in it.

In order to know what signs are contained in any given node, we use the feature oslots. Like the feature otype, this is a standard feature that is always available in a TF dataset.

Unlike otype, oslots is an edge feature: there is an edge between every node and every slot contained in it.

Whereas you use F to do stuff with node features, you use E to do business with edge features.

And whereas you use F.feature.v(node) to get the feature value of a node, you use E.oslots.s(node) to get the nodes for which there is an oslots edge from node to it.

In [7]:

for node in L.d(case, otype="quad"):
    print(f"{node:>6} {E.oslots.s(node)}")

143014 array('I', [106603, 106604])
143015 array('I', [106606, 106607, 106608, 106609, 106610])
143016 array('I', [106607, 106608, 106609, 106610])
143017 array('I', [106607, 106608])
143018 array('I', [106609, 106610])

We see what the biggest quad is. We could have been a bit more friendly to our selves by showing the actual graphemes in the quads.

In [8]:

for node in L.d(case, otype="quad"):
    print(f'{node:>6} {" ".join(F.grapheme.v(s) for s in E.oslots.s(node))}')

143014 GISZ TE
143015 SZU2 HI N57 HI N57
143016 HI N57 HI N57
143017 HI N57
143018 HI N57

So let us get the node of the biggest quad.

In [9]:

bigQuad = sorted(
    (quad for quad in L.d(case, otype="quad")), key=lambda q: -len(E.oslots.s(q))
)[0]
bigQuad

Out[9]:

Lo and behold, it is precisely the big quad.

This is what we are talking about:

In [10]:

A.lineart(bigQuad)

|SZU2.((HI+1(N57))+(HI+1(N57)))|

Quad structure¶

Now we are going to retrieve its components by following edges.

When we converted the data to Text-Fabric, we have made edges from quad nodes to the nodes of their component quads and signs.

We also have made edges between sibling quads and signs.

We can distinguish between kinds of edges by means of edge features.

The edges that go down in a structure have a feature sub.

In order to follow the sub edges from a node, you use

E.sub.f(node).

This will give you a list of nodes that can be reached from node by following a sub edge.

Edges can be traveled in the opposite direction as well:

E.sub.t(node).

This will give you the nodes from which there is a sub edge to node.

In [11]:

E.sub.f(bigQuad)

Out[11]:

(106606, 143016)

or, more friendly:

In [12]:

for node in E.sub.f(bigQuad):
    print(f'{node:>6} {" ".join(F.grapheme.v(s) for s in E.oslots.s(node))}')

106606 SZU2
143016 HI N57 HI N57

Let us unravel the whole structure by means of a function:

In [13]:

def unravelQuad(quad):
    if F.otype.v(quad) == "sign":
        return F.grapheme.v(quad)
    subQuads = E.sub.f(quad)
    unraveledSubQuads = [unravelQuad(subQuad) for subQuad in subQuads]
    return f'<{", ".join(unraveledSubQuads)}>'


unravelQuad(bigQuad)

Out[13]:

'<SZU2, <<HI, N57>, <HI, N57>>>'

Operators¶

Where have the operators gone?

They are present as a feature op of edges between sibling quads and signs.

In [14]:

for child in E.sub.f(bigQuad):
    for (right, op) in E.op.f(child):
        print(child, op, right)

106606 . 143016

Note, that whereas E.sub.f yields a list of nodes, E.op.f yields a list of pairs (node, op-value), because the op edges carry a value.

The best way to know this, is to consult the Feature Doc. This link as always present below the cell where you called Cunei for the first time.

Can we try to adapt the unravel function above to get the operators?

Yes:

In [15]:

def unravelQuad(quad):
    if F.otype.v(quad) == "sign":
        return F.grapheme.v(quad)
    subQuads = E.sub.f(quad)
    result = "<"
    for sq in subQuads:
        for (rq, operator) in E.op.f(sq):
            leftRep = unravelQuad(sq)
            rightRep = unravelQuad(rq)
            result += f"{leftRep} {operator} {rightRep}"
    result += ">"
    return result


unravelQuad(bigQuad)

Out[15]:

'<SZU2 . <<HI + N57> + <HI + N57>>>'

This technique is employed fully in the function A.atfFromQuad():

In [16]:

print(A.atfFromQuad(bigQuad))

|SZU2.((HI+1(N57))+(HI+1(N57)))|

We have tested the function A.atfFromQuad() on all quads in the corpus, an it regenerates the exact ATF transliterations for them, except for two cases where the ATF has unnecessary brackets. See checks.

Next¶

jumps

Leap to the next level ...

All chapters: start imagery steps search calc signs quads jumps cases

CC-BY Dirk Roorda