To get started: consult start

Explore additional data¶

Once you analyse a corpus, it is likely that you produce data that others can reuse. Maybe you have defined a set of proper name occurrences, or special numerals, or you have computed part-of-speech assignments.

It is possible to turn these insights into new features, i.e. new .tf files with values assigned to specific nodes.

Make your own data¶

New data is a product of your own methods and computations in the first place. But how do you turn that data into new TF features? It turns out that the last step is not that difficult.

If you can shape your data as a mapping (dictionary) from node numbers (integers) to values (strings or integers), then TF can turn that data into a feature file for you with one command.

You can then easily share your new features on GitHub, so that your colleagues everywhere can try it out for themselves.

You can add such data on the fly, by passing a mod={org}/{repo}/{path} parameter, or a bunch of them separated by commas.

If the data is there, it will be auto-downloaded and stored on your machine.

Let's do it.

In [1]:

%load_ext autoreload
%autoreload 2

In [2]:

import collections
import os

from tf.app import use

In [3]:

A = use("Nino-cunei/oldbabylonian", hoist=globals())

TF-app: ~/text-fabric-data/Nino-cunei/oldbabylonian/app

data: ~/text-fabric-data/Nino-cunei/oldbabylonian/tf/1.0.6

This is Text-Fabric 9.2.2
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

67 features found and 0 ignored

Text-Fabric: Text-Fabric API 9.2.2, Nino-cunei/oldbabylonian/app v3, Search Reference
Data: OLDBABYLONIAN, Character table, Feature docs
Features:

Old Babylonian Letters 1900-1600: Cuneiform tablets

ARK

str

persistent identifier of type ARK from metadata field "UCLA Library ARK"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

after

str

what comes after a sign or word (- or space)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

afterr

str

what comes after a sign or word (- or space); between adjacent signs a ␣ is inserted

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

afteru

str

what comes after a sign when represented as unicode (space)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atf

str

full atf of a sign (without cluster chars) or word (including cluster chars)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atfpost

str

atf of cluster closings at sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atfpre

str

atf of cluster openings at sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

author

str

author from metadata field "Author(s)"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

col

int

ATF column number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

collated

int

whether a sign is collated (*)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

collection

str

collection of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

comment

str

$ comment to line or inline comment to slot ($ and $)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

damage

int

whether a sign is damaged

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

det

int

whether a sign is a determinative gloss - between braces { }

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

docnote

str

additional remarks in the document identification

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

docnumber

str

number of a document within a collection-volume

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

excavation

str

excavation number from metadata field "Excavation no."

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

excised

int

whether a sign is excised - between double angle brackets << >>

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

face

str

full name of a face including the enclosing object

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

flags

str

sequence of flags after a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

fraction

str

fraction of a numeral

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

genre

str

genre from metadata field "Genre"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

grapheme

str

grapheme of a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

graphemer

str

grapheme of a sign using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

graphemeu

str

grapheme of a sign using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lang

str

language of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

langalt

int

1 if a sign is in the alternate language (i.e. Sumerian) - between underscores _ _

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

ln

int

ATF line number of a numbered line, without prime

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lnc

str

ATF line identification of a comment line ($)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lnno

str

ATF line number, may be $ or #, with prime; column number prepended

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

material

str

material indication from metadata field "Material"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

missing

int

whether a sign is missing - between square brackets [ ]

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

museumcode

str

museum code from metadata field "Museum no."

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

museumname

str

museum name from metadata field "Collection"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

object

str

name of an object of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operator

str

the ! or x in a !() or x() construction

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operatorr

str

the ! or x in a !() or x() construction, represented as =, ␣

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operatoru

str

the ! or x in a !() or x() construction, represented as =, ␣

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

otype

str

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

period

str

period indication from metadata field "Period"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

pnumber

str

P number of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

primecol

int

whether a prime is present on a column number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

primeln

int

whether a prime is present on a line number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

pubdate

str

publication date from metadata field "Publication date"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

question

int

whether a sign has the question flag (?)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

reading

str

reading of a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

readingr

str

reading of a sign using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

readingu

str

reading of a sign using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

remarkable

int

whether a sign is remarkable (!)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

remarks

str

# comment to line

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

repeat

int

repeat of a numeral; the value n (unknown) is represented as -1

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcLn

str

full line in source file

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcLnNum

int

line number in source file

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcfile

str

source file name of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

subgenre

str

genre from metadata field "Sub-genre"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

supplied

int

whether a sign is supplied - between angle brackets < >

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

sym

str

essential part of a sign or of a word

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

symr

str

essential part of a sign or of a word using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

symu

str

essential part of a sign or of a word using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

trans

int

whether a line has a translation

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

transcriber

str

person who did the encoding into ATF from metadata field "ATF source"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

translation@ll

str

translation of line in language en = English

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

type

str

name of a type of cluster or kind of sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

uncertain

int

whether a sign is uncertain - between brackets ( )

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

volume

int

volume of a document within a collection

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

oslots

none

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

Text-Fabric API: names N F E L T S C TF directly usable

Making data¶

We illustrate the data creation part by creating a new feature, ummama. The idea is that we mark every sign reading that occurs between um-ma and ma some where in the first 3 lines of a face. We want to mark every occurrence of such signs elsewhere in the corpus with ummama=1.

We only do it if the sign between the um-ma and ma (which must be on the same line) is not missing, damaged, or questionable.

The easiest way to get started is to run a query:

In [4]:

query = """
line ln<4
  =: sign reading=um missing# damage# question#
  <: sign reading=ma missing# damage# question#
% the next sign is the one that we are after
  < sign missing# damage# question#
  < sign reading=ma missing# damage# question#
"""

In [5]:

results = A.search(query)

  1.29s 3466 results

In [6]:

A.table(results, end=10)

n	p	line	sign	sign	sign	sign
1	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	_{d}	ma
2	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	en-	ma
3	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	lil2_-	ma
4	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	sza-	ma
5	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	du-	ma
6	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	u2-	ma
7	P509373 obverse:3	um-ma _{d}en-lil2_-sza-du-u2-ni-ma	um-	ma	ni-	ma
8	P481190 obverse:3	um-ma nu#-ur2#-i3-li2-szu-ma	um-	ma	i3-	ma
9	P481190 obverse:3	um-ma nu#-ur2#-i3-li2-szu-ma	um-	ma	li2-	ma
10	P481190 obverse:3	um-ma nu#-ur2#-i3-li2-szu-ma	um-	ma	szu-	ma

Observe how the signs between um-ma and ma are picked up, except the damaged nu and ur2.

First we are collect these readings, and survey the frequencies in the result.

Some signs do not have a reading, but then they have a grapheme. If they do not have a grapheme, they might be comment signs, and we skip them.

In [7]:

umaReadings = collections.Counter()

# collect

for (line, um, ma1, sign, ma2) in results:
    reading = F.reading.v(sign) or F.grapheme.v(sign)
    if not reading:
        continue
    umaReadings[reading] += 1

# show

print(f"Found {len(umaReadings)} distinct readings")
limit = 20

for (reading, amount) in sorted(
    umaReadings.items(),
    key=lambda x: (-x[1], x[0]),
)[0:limit]:
    print(f"{reading:<6} {amount:>4} x")
print(f" ... and {len(umaReadings) - limit} more ...")

Found 249 distinct readings
d       324 x
a       133 x
ra      128 x
mu      123 x
am      112 x
ha       99 x
na       95 x
pi2      94 x
suen     78 x
i        66 x
ni       66 x
szu      66 x
utu      61 x
um       59 x
li2      55 x
tum      55 x
ma       50 x
marduk   50 x
bi       46 x
nu       43 x
 ... and 229 more ...

Now we visit all signs in the whole corpus and check whether their reading or grapheme is in this set. If so, we give that sign a value 1 in the dictionary ummama.

In [8]:

ummama = {}

allSigns = F.otype.s("sign")

for s in allSigns:
    reading = F.reading.v(s) or F.grapheme.v(s)
    if not reading:
        continue
    if reading in umaReadings:
        ummama[s] = 1

print(f"Assigned `ummama=1` to {len(ummama)} sign occurrences out of {len(allSigns)}")

Assigned `ummama=1` to 182221 sign occurrences out of 203219

Note that the majority of all signs also occurs between um-ma and ma at the start of a document.

Maybe this is an indication that we are not capturing the idea of selecting specific signs, we may have to strengthen our search criterion.

But that is beyond this tutorial. We suppose these ummama words form a valuable set that we want to share.

Saving data¶

The documentation explains how to save this data into a text-fabric data file.

We choose a location where to save it, the exercises repository in the Nino-cunei organization, in the folder analysis.

In order to do this, we restart the TF API, but now with the desired output location in the locations parameter.

In [9]:

GITHUB = os.path.expanduser("~/github")
ORG = "Nino-cunei"
REPO = "exercises"
PATH = "bab-analysis"
VERSION = A.version

Note the version: we have built the version against a specific version of the data:

In [10]:

A.version

Out[10]:

'1.0.6'

Later on, we pass this version on, so that users of our data will get the shared data in exactly the same version as their core data.

We have to specify a bit of metadata for this feature:

In [11]:

metaData = {
    "ummama": dict(
        valueType="int",
        description="reading occurs somewhere between um-ma and ma",
        creator="Dirk Roorda",
    ),
}

Now we can give the save command:

In [12]:

TF.save(
    nodeFeatures=dict(ummama=ummama),
    metaData=metaData,
    location=f"{GITHUB}/{ORG}/{REPO}/{PATH}/tf",
    module=VERSION,
)

  0.00s Exporting 1 node and 0 edge and 0 config features to ~/github/Nino-cunei/exercises/bab-analysis/tf/1.0.6:
   |     0.15s T ummama               to ~/github/Nino-cunei/exercises/bab-analysis/tf/1.0.6
  0.16s Exported 1 node features and 0 edge features and 0 config features to ~/github/Nino-cunei/exercises/bab-analysis/tf/1.0.6

Out[12]:

True

How to share your own data is explained in the documentation.

Here we show it step by step for the ummama feature.

If you commit your changes to the exercises repo, and have done a git push origin master, you already have shared your data!

If you want to make a stable release, so that you can keep developing, while your users fall back on the stable data, you can make a new release.

Go to the GitHub website for that, go to your repo, and click Releases and follow the nudges.

If you want to make it even smoother for your users, you can zip the data and attach it as a binary to the release just created.

We need to zip the data in exactly the right directory structure. Text-Fabric can do that for us:

In [13]:

%%sh

text-fabric-zip Nino-cunei/exercises/bab-analysis/tf

This is a TF dataset
Create release data for Nino-cunei/exercises/bab-analysis/tf
Found 1 versions
zip files end up in ~/Downloads/Nino-cunei-release/exercises
zipping Nino-cunei/exercises      1.0.6 with   1 features ==> bab-analysis-tf-1.0.6.zip

All versions have been zipped, but it works OK if you only attach the newest version to the newest release.

If a user asks for an older version in this release, the system can still find it.

Here is the result for our case

ummama

Use the data¶

We can use the data by calling it up when we say use('Nino-cunei/oldbabylonian', ...).

Here is how:

In [14]:

A = use(
    "Nino-cunei/oldbabylonian:clone",
    checkout="clone",
    hoist=globals(),
    mod="Nino-cunei/exercises/bab-analysis/tf:clone",
)

TF-app: ~/github/Nino-cunei/oldbabylonian/app

data: ~/github/Nino-cunei/oldbabylonian/tf/1.0.6

data: ~/github/Nino-cunei/exercises/bab-analysis/tf/1.0.6

This is Text-Fabric 9.2.2
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

68 features found and 0 ignored
   |      |     0.34s C __characters__       from otext
   |     0.74s T ummama               from ~/github/Nino-cunei/exercises/bab-analysis/tf/1.0.6

Text-Fabric: Text-Fabric API 9.2.2, Nino-cunei/oldbabylonian/app v3, Search Reference
Data: OLDBABYLONIAN, Character table, Feature docs
Features:

Nino-cunei/exercises/bab-analysis/tf

ummama

int

reading occurs somewhere between um-ma and ma

creator:

Dirk Roorda

dateWritten:

2022-01-31T10:31:57Z

writtenBy:

Text-Fabric

Old Babylonian Letters 1900-1600: Cuneiform tablets

ARK

str

persistent identifier of type ARK from metadata field "UCLA Library ARK"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

after

str

what comes after a sign or word (- or space)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

afterr

str

what comes after a sign or word (- or space); between adjacent signs a ␣ is inserted

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

afteru

str

what comes after a sign when represented as unicode (space)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atf

str

full atf of a sign (without cluster chars) or word (including cluster chars)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atfpost

str

atf of cluster closings at sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:07Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

atfpre

str

atf of cluster openings at sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

author

str

author from metadata field "Author(s)"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

col

int

ATF column number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

collated

int

whether a sign is collated (*)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

collection

str

collection of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

comment

str

$ comment to line or inline comment to slot ($ and $)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

damage

int

whether a sign is damaged

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

det

int

whether a sign is a determinative gloss - between braces { }

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

docnote

str

additional remarks in the document identification

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

docnumber

str

number of a document within a collection-volume

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

excavation

str

excavation number from metadata field "Excavation no."

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

excised

int

whether a sign is excised - between double angle brackets << >>

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

face

str

full name of a face including the enclosing object

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

flags

str

sequence of flags after a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

fraction

str

fraction of a numeral

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

genre

str

genre from metadata field "Genre"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

grapheme

str

grapheme of a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

graphemer

str

grapheme of a sign using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

graphemeu

str

grapheme of a sign using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lang

str

language of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

langalt

int

1 if a sign is in the alternate language (i.e. Sumerian) - between underscores _ _

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

ln

int

ATF line number of a numbered line, without prime

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lnc

str

ATF line identification of a comment line ($)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

lnno

str

ATF line number, may be $ or #, with prime; column number prepended

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

material

str

material indication from metadata field "Material"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

missing

int

whether a sign is missing - between square brackets [ ]

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

museumcode

str

museum code from metadata field "Museum no."

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

museumname

str

museum name from metadata field "Collection"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

object

str

name of an object of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operator

str

the ! or x in a !() or x() construction

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operatorr

str

the ! or x in a !() or x() construction, represented as =, ␣

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

operatoru

str

the ! or x in a !() or x() construction, represented as =, ␣

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

otype

str

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

period

str

period indication from metadata field "Period"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

pnumber

str

P number of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

primecol

int

whether a prime is present on a column number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

primeln

int

whether a prime is present on a line number

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

pubdate

str

publication date from metadata field "Publication date"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

question

int

whether a sign has the question flag (?)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

reading

str

reading of a sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

readingr

str

reading of a sign using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

readingu

str

reading of a sign using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:08Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

remarkable

int

whether a sign is remarkable (!)

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

remarks

str

# comment to line

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

repeat

int

repeat of a numeral; the value n (unknown) is represented as -1

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcLn

str

full line in source file

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcLnNum

int

line number in source file

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

srcfile

str

source file name of a document

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

subgenre

str

genre from metadata field "Sub-genre"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

supplied

int

whether a sign is supplied - between angle brackets < >

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

sym

str

essential part of a sign or of a word

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

symr

str

essential part of a sign or of a word using non-ascii characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

symu

str

essential part of a sign or of a word using cuneiform unicode characters

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:09Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

trans

int

whether a line has a translation

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

transcriber

str

person who did the encoding into ATF from metadata field "ATF source"

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

translation@ll

str

translation of line in language en = English

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

type

str

name of a type of cluster or kind of sign

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

uncertain

int

whether a sign is uncertain - between brackets ( )

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

volume

int

volume of a document within a collection

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

oslots

none

converters:

Cale Johnson, Dirk Roorda

dateWritten:

2020-06-26T09:20:10Z

editor:

Cale Johnson et al.

institute:

CDL

name:

AbB Old Babylonian Cuneiform

writtenBy:

Text-Fabric

Text-Fabric API: names N F E L T S C TF directly usable

Above you see a new section in the feature list: Nino-cunei/exercises/analysis/tf with our foreign feature in it: ummama.

Now, suppose did not know much about this feature, then we would like to do a few basic checks:

In [15]:

F.ummama.freqList()

Out[15]:

((1, 182221),)

We see that the feature has only one value, 1, and that 182222 nodes have it.

Which nodes have a ummama feature?

In [16]:

{F.otype.v(n) for n in N.walk() if F.ummama.v(n)}

Out[16]:

{'sign'}

Only signs have the feature.

Let's have a look at a table of some ummama signs.

In [17]:

results = A.search(
    """
sign ummama
"""
)

  0.23s 182221 results

In [18]:

A.table(results, start=1, end=20)

n	p	sign
1	P509373 obverse:1	[a-
2	P509373 obverse:1	na]
3	P509373 obverse:1	_{d}
4	P509373 obverse:1	suen_-
5	P509373 obverse:1	i-
6	P509373 obverse:1	[din-
7	P509373 obverse:1	nam]
8	P509373 obverse:2	qi2-
9	P509373 obverse:2	[ma]
10	P509373 obverse:3	um-
11	P509373 obverse:3	ma
12	P509373 obverse:3	_{d}
13	P509373 obverse:3	en-
14	P509373 obverse:3	lil2_-
15	P509373 obverse:3	sza-
16	P509373 obverse:3	du-
17	P509373 obverse:3	u2-
18	P509373 obverse:3	ni-
19	P509373 obverse:3	ma
20	P509373 obverse:4	_{d}

Now let's get some non-ummama signs:

In [19]:

results = A.search(
    """
sign ummama#
"""
)

  0.12s 20998 results

In [20]:

A.table(results, start=1, end=20)

n	p	sign
1	P509373 obverse:2	bi2-
2	P509373 obverse:5	t,u2-
3	P509373 obverse:6	a2-
4	P509373 obverse:6	gal2
5	P509373 obverse:9	2(esze3)
6	P509373 obverse:9	gud_
7	P509373 obverse:10	gar3_
8	P509373 obverse:10	ag-
9	P509373 obverse:10	_uru_
10	P509373 obverse:11	kam_
11	P509373 obverse:12	_uru_
12	P509373 obverse:12	ak-
13	P509373 obverse:13	2(esze3)
14	P509373 obverse:13	szuku_
15	P509373 obverse:13	_nagar-
16	P509373 obverse:14	gar3
17	P509373 obverse:14	uru_
18	P509373 obverse:14	[...]
19	P509373 obverse:15	[...]
20	P509373 obverse:$a	$ rest broken

Let's get lines with both ummama and non-ummama signs:

In [21]:

results = A.search(
    """
line
  sign ummama
  sign ummama#
"""
)

  0.58s 133413 results

In [22]:

A.table(results, start=1, end=2, condensed=True)

n	p	line	sign	sign	sign
1	P509373 obverse:2	qi2-bi2-[ma]	qi2-	bi2-	[ma]
2	P509373 obverse:5	li-ba-al-li-t,u2-u2-ka	li-	ba-	al-	li-	t,u2-	u2-	ka

With highlights:

In [23]:

highlights = {}

for s in F.otype.s("sign"):
    color = "lightsalmon" if F.ummama.v(s) else "mediumaquamarine"
    highlights[s] = color

In [24]:

A.table(
    results, start=1, end=10, baseTypes="sign", condensed=True, highlights=highlights
)

n	p	line	sign	sign	sign
1	P509373 obverse:2	qi2-bi2-[ma]	qi2-	bi2-	[ma]
2	P509373 obverse:5	li-ba-al-li-t,u2-u2-ka	li-	ba-	al-	li-	t,u2-	u2-	ka
3	P509373 obverse:6	{disz}sze-ep-_{d}suen a2-gal2 [dumu] um-mi-a-mesz_	{disz}	sze-	ep-	_{d}	suen	a2-	gal2	[dumu]	um-	mi-	a-	mesz_
4	P509373 obverse:9	2(esze3) _a-sza3_ s,i-[bi]-it {disz}[ku]-un-zu-lum _sza3-gud_	2(esze3)	_a-	sza3_	s,i-	[bi]-	it	{disz}	[ku]-	un-	zu-	lum	_sza3-	gud_
5	P509373 obverse:10	_a-sza3 a-gar3_ na-ag-[ma-lum] _uru_ x x x{ki}	[ma-	lum]	_uru_	x	x	x	{ki}	_a-	sza3	a-	gar3_	na-	ag-
6	P509373 obverse:11	sza _{d}utu_-ha-zi-[ir] isz-tu _mu 7(disz) kam_ id-di-nu-szum	sza	_{d}	utu_-	ha-	zi-	[ir]	isz-	tu	_mu	7(disz)	kam_	id-	di-	nu-	szum
7	P509373 obverse:12	u3 i-na _uru_ x-szum{ki} sza-ak-nu id-di-a-am-ma	id-	di-	a-	am-	ma	u3	i-	na	_uru_	x-	szum	{ki}	sza-	ak-	nu
8	P509373 obverse:13	2(esze3) _a-sza3 szuku_ i-li-ib-bu s,i-bi-it _nagar-mesz_	2(esze3)	_a-	sza3	szuku_	i-	li-	ib-	bu	s,i-	bi-	it	_nagar-	mesz_
9	P509373 obverse:14	_a-sza3 a-gar3 uru_ ra-bu-um x [...]	_a-	sza3	a-	gar3	uru_	ra-	bu-	um	x	[...]
10	P509373 obverse:15	x x x x x x [...]	x	x	[...]	x	x	x	x

If we do a pretty display, the ummama feature shows up.

In [25]:

A.show(
    results,
    start=1,
    end=3,
    baseTypes="sign",
    condensed=True,
    withNodes=True,
    highlights=highlights,
)

line 1

P509373 obverse:2

line:230789

word:258165 qi2-bi2-[ma]

8 qi2-

ummama=1

9 bi2-

cluster:203224

10 [ma]

ummama=1

line 2

P509373 obverse:5

line:230792

word:258173 li-ba-al-li-t,u2-u2-ka

32 li-
ummama=1
33 ba-
ummama=1
34 al-
ummama=1
35 li-
ummama=1
36 t,u2-
37 u2-
ummama=1
38 ka
ummama=1

line 3

P509373 obverse:6

line:230793

word:258174 {disz}sze-ep-_{d}suen

cluster:203233

39 {disz}

ummama=1

40 sze-

ummama=1

41 ep-

ummama=1

cluster:203234

cluster:203235

42 _{d}

ummama=1

43 suen

ummama=1

word:258175 a2-gal2

cluster:203234

44 a2-

45 gal2

word:258176 [dumu]

cluster:203234

cluster:203236

46 [dumu]

ummama=1

word:258177 um-mi-a-mesz_

cluster:203234

47 um-
ummama=1
48 mi-
ummama=1
49 a-
ummama=1
50 mesz_
ummama=1

Or in the context of a whole face:

In [26]:

A.show(
    results,
    start=1,
    end=1,
    condensed=True,
    condenseType="face",
    withNodes=False,
    highlights=highlights,
)

face 1

P509373 obverse

face P509373 obverse

line

word [a-na]

word _{d}suen_-i-[din-nam]

line

word qi2-bi2-[ma]

line

word um-ma

word _{d}en-lil2_-sza-du-u2-ni-ma

line

word _{d}utu_

word u3

word _{d}[marduk]_

word a-na

word da-ri-a-[tim]

line

word li-ba-al-li-t,u2-u2-ka

line

word {disz}sze-ep-_{d}suen

word a2-gal2

word [dumu]

word um-mi-a-mesz_

line

word ki-a-am

word u2-lam-mi-da-an-ni

word um-[ma]

word szu-u2-[ma]

line

word {disz}sa-am-su-ba-ah-li

word sza-pi2-ir

word ma-[tim]

line

word 2(esze3)

word _a-sza3_

word s,i-[bi]-it

word {disz}[ku]-un-zu-lum

word _sza3-gud_

line

word _a-sza3

word a-gar3_

word na-ag-[ma-lum]

word _uru_

word x

word x{ki}

line

word sza

word _{d}utu_-ha-zi-[ir]

word isz-tu

word _mu

word 7(disz)

word kam_

word id-di-nu-szum

line

word u3

word i-na

word _uru_

word x-szum{ki}

word sza-ak-nu

word id-di-a-am-ma

line

word 2(esze3)

word _a-sza3

word szuku_

word i-li-ib-bu

word s,i-bi-it

word _nagar-mesz_

line

word _a-sza3

word a-gar3

word uru_

word ra-bu-um

word x

word [...]

line

word x

word [...]

line

$ rest broken

All together!¶

If more researchers have shared data modules, you can draw them all in.

Then you can design queries that use features from all these different sources.

In that way, you build your own research on top of the work of others.

Hover over the features to see where they come from, and you'll see they come from your local GitHub repo.

All chapters:

start become an expert in creating pretty displays of your text structures
display become an expert in creating pretty displays of your text structures
search turbo charge your hand-coding with search templates
exportExcel make tailor-made spreadsheets out of your results
share draw in other people's data and let them use yours
similarLines spot the similarities between lines

See the cookbook for recipes for small, concrete tasks.

CC-BY Dirk Roorda

Sharing data features¶

Explore additional data¶

Make your own data¶

Share your new data¶

Making data¶

Saving data¶

Sharing data¶

Use the data¶

All together!¶