To get started: consult start

Explore additional data¶

Once you analyse a corpus, it is likely that you produce data that others can reuse. Maybe you have defined a set of proper name occurrences, or special numerals, or you have computed part-of-speech assignments.

It is possible to turn these insights into new features, i.e. new .tf files with values assigned to specific nodes.

Make your own data¶

New data is a product of your own methods and computations in the first place. But how do you turn that data into new TF features? It turns out that the last step is not that difficult.

If you can shape your data as a mapping (dictionary) from node numbers (integers) to values (strings or integers), then TF can turn that data into a feature file for you with one command.

You can then easily share your new features on GitHub, so that your colleagues everywhere can try it out for themselves.

You can add such data on the fly, by passing a mod={org}/{repo}/{path} parameter, or a bunch of them separated by commas.

If the data is there, it will be auto-downloaded and stored on your machine.

Let's do it.

In [1]:

%load_ext autoreload
%autoreload 2

In [2]:

import re
import collections
import os

from tf.app import use

In [3]:

A = use("CLARIAH/wp6-missieven", hoist=globals())
VERSION = A.version

TF-app: ~/text-fabric-data/github/CLARIAH/wp6-missieven/app

data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/tf/1.0

Text-Fabric: Text-Fabric API 10.2.6, CLARIAH/wp6-missieven/app v3, Search Reference
Data: WP6-MISSIEVEN, Character table, Feature docs
Features:

General Missives Dutch East India Company 1600-1800

author

str

authors of the letter, surnames only

authorFull

str

authors of the letter, full names

col

int

column number of a column in a row in a table

day

int

day part of the date of the letter

isden

int

whether a word is the denominator in fraction, e.g. 4 in 1/4

isemph

str

whether a word is emphasized by typography

isfolio

int

a folio reference

isnote

int

whether a word belongs to footnote text

isnum

int

whether a word is the numerator in fraction, e.g. 1 in 1/4

isorig

int

whether a word belongs to original text

isq

int

whether a word is a numerical fraction, e.g. 1/4

isref

int

whether a word belongs to the text of reference

isremark

int

whether a word belongs to the text of editorial remarks

isspecial

int

whether a word has special typography possibly with OCR mistakes as well

issub

int

whether a word has subscript typography possibly indicating the denominator of a fraction

issuper

int

whether a word has superscript typography possibly indicating the numerator of a fraction

isund

str

whether a word is underlined by typography

mark

int

footnote mark (not necessarily the same as shown on the printed page

month

int

month part of the date of the letter

n

int

number of a volume, letter, page, para, line, table

otype

str

page

str

number of the first page of this letter in this volume

place

str

place from where the letter was sent

punc

str

punctuation and/or whitespace following a wordup to the next word

puncn

str

punctuation and/or whitespace following a word,up to the next word, footnote text only

punco

str

punctuation and/or whitespace following a word,up to the next word, original text only

puncr

str

punctuation and/or whitespace following a word,up to the next word, remark text only

rawdate

str

the date the letter was sent

row

int

row number of a row of column in a table

seq

str

('sequence number of this letter among the letters of the same author in this volume',)

status

str

status of the letter, e.g. secret, copy

title

str

title of the letter

trans

str

transcription of a word

transn

str

transcription of a word, only for footnote text

transo

str

transcription of a word, only for original text

transr

str

transcription of a word, only for remark text

vol

int

volume number

weblink

str

the page-specific part of web links for page nodes

x

int

column offset of a column in a row in a table

year

int

year part of the date of the letter

note

none

edge between a word and the footnotes associated with it

oslots

none

Text-Fabric API: names N F E L T S C TF directly usable

Making data¶

We illustrate the data creation part by creating a new feature, number. The idea is that we compute a number value for each word that looks like a number, but that contains OCR errors.

We keep things simple.

We are interested in words that contain only digits and letters, and where the number of digits is greater than de number of letters. We exclude words that consist of digits only.

We only work in original letter content.

Let's find them by hand coding.

In [4]:

results = []

digitRe = re.compile(r"[0-9]")

for w in F.otype.s("word"):
    chars = F.transo.v(w)
    if not chars:
        continue
    (letters, nDigits) = digitRe.subn("", chars)
    nLetters = len(chars) - nDigits
    if nLetters and nDigits > nLetters:
        results.append(w)

print(results[0:10])
len(results)

[11761, 28520, 30481, 31702, 36287, 37982, 37988, 106832, 112548, 119347]

Out[4]:

It happens quite a bit.

Let's have a quick look at the text of the results

In [5]:

print("\n".join(sorted(F.transo.v(w) for w in results)[0:20]))

We want to map characters to digits. To get a feel for that, inventorize the characters that occur in these words.

For each character, count how often it occurs and give at most 10 examples.

In [6]:

inventory = collections.defaultdict(list)

for w in results:
    for c in (trans := F.transo.v(w)):
        if not c.isdigit():
            inventory[c].append(trans)

len(inventory)

Out[6]:

Quite a bit of different characters.

In [7]:

for c in sorted(inventory):
    examples = inventory[c]
    n = len(examples)
    showExamples = ", ".join(sorted(examples)[0:10])
    print(f"{c} ({n:>4}x) {showExamples}")

? (  15x) 12?, 144?, 1617?, 16?, 18?, 19?, 286?, 29?, 31?, 413?
A (   9x) 0753A, 13A, 273A, 343A, 3933A, 423A, 43A, 4743A, 553A
C (   1x) 540C3
D (   1x) 1685De
E (   1x) 194845En
H (   4x) 022H, 22H, 2328H, 252H
I (   5x) 217IM, I299v, I85, I85, I85
J (  96x) 052J, 1079J, 1092J, 10J, 10J, 110J, 115J, 1191J, 11J, 121378J
M (   3x) 217IM, 4047M, 564M
O (   4x) 1671Op, 27O4508, O86V2, ÏO011
P (   1x) P10
S (   1x) 16S6
U (   1x) 1U8
V (  76x) 042V2, 1014V2, 1019V2, 1062V4, 1062V4, 10V5, 12V2, 1364V2, 13V2, 14V2
a (   5x) 10a, 11a, 11a, 13a, 1684dat
b (  26x) 0001b, 0001b, 0001b, 0001b, 0001b, 1156bls, 121b, 121b, 121b, 121b
c (  59x) 10c, 12c, 12c, 13c, 13c, 13c, 14c, 14c, 14c, 15c
d (  14x) 100d, 14101de, 1684dat, 29d, d08, d08, d08, d08, d08, d08
e (2952x) 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e
f (  58x) 053f, 053f, 09f, 102f, 108f, 121f, 1222f, 137f, 14f, 14f
g (   9x) 16g, 22g, 28g, 36g, 430g, 6000g, 600g, 705g, 74g
h (   2x) 42h, 605h
i (   4x) 302061in, 496159in, 7897io, 8337tis
j (  24x) 086j, 1023j, 12j, 14j, 14j, 16j, 176j, 236j, 30j, 31j
l (   1x) 1156bls
m (   1x) 366m
n (  44x) 10n, 10n, 10n, 10n, 14n, 14n, 150599en, 15n, 15n, 15n
o (   9x) 24o, 24o, 36o, 36o, 438834V36o, 48o, 5622tot, 5957V3óo, 7897io
p (   2x) 1419p, 1671Op
q (   1x) 2901§§q
r (  24x) 128r, 1300r, 1394rv, 1427r, 149r, 189r, 202r, 20r, 2182r, 256r
s (   6x) 1156bls, 167s, 336s, 4395Vs, 50s, 8337tis
t (   8x) 1684dat, 22t, 4t0, 520t, 5622tot, 5622tot, 6t0, 8337tis
u (   2x) 21u, 417u
v (  20x) 124v, 1394rv, 1426v, 148v, 15v, 15v, 16v, 19v, 212v, 286v
x (   4x) 10x, 18x, 31x, 34x
| ( 232x) 051|, 062|, 084|, 087|, 1034|, 104|, 104|, 106|, 108|, 10|
£ (  51x) 03£, 10£, 10£, 10£, 11£, 12£, 14£, 14£, 14£, 16£
§ (  49x) 090§, 10§, 10§, 10§, 1216§, 1372|§, 139§, 146§, 14§, 166§
© (   1x) 000©
® (  25x) 1000®, 10®, 10®, 125®, 15®, 16®, 1719®, 1®11, 2000®, 20®
° (  10x) 16°, 17°, 20°, 24°, 24°, 25°, 28°, 30°, 51°, °1677
± (   4x) 16±, 28±, 32±, 97±
¼ (   1x) 254¼
½ (   7x) 006½, 024½, 117½, 144½, 22½, 27½, 699½
Ï (   2x) 143Ï, ÏO011
Ö (   1x) Ö00
Ü (   4x) 2328Ü, 516Ü, 659Ü, 929Ü
è (   2x) è60, è70
ï (   8x) 10ï, 166ï, 24ï, 28ï, 292ï, 29ï, 42ï, 8ï4
ó (   3x) 169Vó, 25ó, 5957V3óo
ö (   2x) 189öf, 2ö00
ƒ ( 765x) 12ƒ, 14ƒ, 1753ƒ, 17ƒ, 19ƒ, 8ƒ294, ƒ10, ƒ10, ƒ100, ƒ1002
— (   2x) 1—151, 568—
‘ (   1x) 6440‘
’ (  68x) 36’, d’480, ’19, ’20, ’29, ’34, ’34, ’35, ’35, ’35
“ (   1x) 29“
” (   1x) 1681”
„ (  36x) 12143„, 13757„, 1637„, 3096„, 3546„, 44246„, 615„, „10, „114, „116
™ (   1x) 30™
⌊ (   1x) 1706⌊

We decide to translate a few characters to numerals:

In [8]:

charMapping = {
    "o": 0,
    "ó": 0,
    "ö": 0,
    "Ö": 0,
    "I": 1,
    "J": 1,
    "ï": 1,
    "è": 6,
}

Now we translate all numerals with this mapping, and if the result is numeric and does not start with a 0, we save the result in a mapping from nodes to numbers.

In [9]:

def cmap(chars):
    n = "".join(str(charMapping.get(c, c)) for c in chars)
    return int(n) if not n.startswith("0") and n.isdigit() else None


number = {w: n for w in results if (n := cmap(F.transo.v(w)))}
len(number)

Out[9]:

In [10]:

print(number)

{11761: 1151, 368089: 670, 379197: 94001, 379568: 131, 396613: 141, 396656: 20621, 407164: 121, 430354: 121, 432757: 128181, 432879: 1241, 434920: 141, 462917: 621, 464624: 1241, 465415: 631, 472907: 3191, 473135: 9581, 483858: 8191, 486913: 10791, 498619: 8541, 533953: 261, 533968: 331, 535684: 6121, 557983: 77841, 618358: 261, 618871: 4021, 618877: 501, 627195: 261, 653407: 1741, 667437: 15301, 675324: 65931, 750255: 3231, 750445: 5021, 1019955: 10921, 1047395: 1371, 1068377: 52141, 1070934: 49141, 1079667: 2000, 1080766: 72771, 1118656: 4061, 1173348: 161, 1178433: 101, 1196647: 191, 1200319: 201, 1211567: 660, 1230723: 3501, 1234154: 171, 1237203: 111, 1237391: 141, 1250144: 8421, 1253186: 32091, 1271818: 121, 1282202: 75621, 1327325: 121, 1346403: 131, 1352127: 421, 1352309: 421, 1372543: 371, 1379628: 161, 1393864: 2228491, 1443457: 161, 1443464: 361, 1443641: 361, 1443657: 361, 1443666: 101, 1451420: 2981, 1548082: 1101, 1554393: 421, 1653139: 2501, 1669175: 151, 1682688: 4041, 1682700: 1441, 1714540: 721, 1833190: 1213781, 1851679: 1441, 1877221: 98771, 1877228: 977381, 1877230: 167081, 1948091: 925981, 1957857: 15361, 1965567: 181, 2089027: 541, 2126313: 701, 2126473: 621, 2126645: 901, 2126699: 731, 2126709: 911, 2126717: 761, 2126753: 561, 2207671: 1321, 2207675: 361, 2207742: 361, 2351417: 151, 2379398: 121, 2945183: 240, 2968542: 480, 2968588: 240, 2993386: 360, 2993418: 360, 3037496: 250, 3704420: 185, 3820516: 9961, 4086262: 101, 4131362: 185, 4188174: 241, 4262757: 2991, 4277217: 281, 4355285: 291, 4355770: 2921, 4394040: 421, 4412289: 814, 4522464: 1661, 4792505: 121, 4993558: 185, 5146359: 11911}

Saving data¶

In annotate we saw how to save features. We do the same for the number feature.

In [11]:

GITHUB = os.path.expanduser("~/github")
ORG = A.context.org
REPO = A.context.repo
PATH = "exercises/numerics"

Later on, we pass this version on, so that users of our data will get the shared data in exactly the same version as their core data.

We have to specify a bit of metadata for this feature:

In [12]:

metaData = {
    "number": dict(
        valueType="int",
        description="numeric value of corrected number-like strings",
        creator="Dirk Roorda",
    ),
}

Now we can give the save command:

In [14]:

location = f"{GITHUB}/{ORG}/{REPO}/{PATH}/tf"
TF.save(
    nodeFeatures=dict(number=number),
    metaData=metaData,
    location=location,
    module=VERSION,
    silent="auto",
)

  0.00s Exporting 1 node and 0 edge and 0 config features to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0:
   |     0.00s T number               to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0
  0.00s Exported 1 node features and 0 edge features and 0 config features to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0

Out[14]:

True

Here is the data in text-fabric format: a feature file

In [15]:

with open(f"{location}/{VERSION}/number.tf") as fh:
    print(fh.read())

@node
@creator=Dirk Roorda
@description=numeric value of corrected number-like strings
@valueType=int
@writtenBy=Text-Fabric
@dateWritten=2022-10-11T14:56:42Z

11761	1151
368089	670
379197	94001
379568	131
396613	141
396656	20621
407164	121
430354	121
432757	128181
432879	1241
434920	141
462917	621
464624	1241
465415	631
472907	3191
473135	9581
483858	8191
486913	10791
498619	8541
533953	261
533968	331
535684	6121
557983	77841
618358	261
618871	4021
618877	501
627195	261
653407	1741
667437	15301
675324	65931
750255	3231
750445	5021
1019955	10921
1047395	1371
1068377	52141
1070934	49141
1079667	2000
1080766	72771
1118656	4061
1173348	161
1178433	101
1196647	191
1200319	201
1211567	660
1230723	3501
1234154	171
1237203	111
1237391	141
1250144	8421
1253186	32091
1271818	121
1282202	75621
1327325	121
1346403	131
1352127	421
1352309	421
1372543	371
1379628	161
1393864	2228491
1443457	161
1443464	361
1443641	361
1443657	361
1443666	101
1451420	2981
1548082	1101
1554393	421
1653139	2501
1669175	151
1682688	4041
1682700	1441
1714540	721
1833190	1213781
1851679	1441
1877221	98771
1877228	977381
1877230	167081
1948091	925981
1957857	15361
1965567	181
2089027	541
2126313	701
2126473	621
2126645	901
2126699	731
2126709	911
2126717	761
2126753	561
2207671	1321
2207675	361
2207742	361
2351417	151
2379398	121
2945183	240
2968542	480
2968588	240
2993386	360
2993418	360
3037496	250
3704420	185
3820516	9961
4086262	101
4131362	185
4188174	241
4262757	2991
4277217	281
4355285	291
4355770	2921
4394040	421
4412289	814
4522464	1661
4792505	121
4993558	185
5146359	11911

How to share your own data is explained in the documentation.

Here we show it step by step for the number feature.

If you commit your changes to the exercises repo, and have done a git push origin master, you already have shared your data!

Keep it simple for small datasets: For small feature datasets, you are done.

If it gets serious, there is support for releases and efficient data transfer. Here is how:

Note (releases)

If you want to make a stable release, so that you can keep developing, while your users fall back on the stable data, you can make a new release.

Go to the GitHub website for that, go to your repo, and click Releases and follow the nudges.

Note (release binaries)

If you want to make it even smoother for your users, you can zip the data and attach it as a binary to the release just created.

We need to zip the data in exactly the right directory structure. Text-Fabric can do that for us.

In [17]:

%%sh

text-fabric-zip CLARIAH/wp6-missieven/exercises/numerics/tf

This is a TF dataset
Create release data for CLARIAH/wp6-missieven/exercises/numerics/tf
Found 2 versions
zip files end up in ~/Downloads/None/CLARIAH-release/wp6-missieven
zipping CLARIAH/wp6-missieven     0.9.1 with   1 features ==> exercises-numerics-tf-0.9.1.zip
zipping CLARIAH/wp6-missieven      1.0 with   1 features ==> exercises-numerics-tf-1.0.zip

All versions have been zipped, but it works OK if you only attach the newest version to the newest release.

If a user asks for an older version in this release, the system can still find it.

Use the data¶

We can use the data by calling it up when we say use('CLARIAH/wp6-missieven', ...) where we put in a data module argument on the dots. We will also call up the entity data we created in the annotate chapter.

Note that for each module we can specify flags like :latest, :hot, clone.

If you are the author of the data, and want to test it, use :clone: it takes the data from where you saved it.

If you are a new user of the data, use :hot (get latest commit) or :latest (get latest release) to download the data.

If you have downloaded the data before, leave out the flag.

In [18]:

A = use(
    f"CLARIAH/wp6-missieven",
    hoist=globals(),
    mod=(
        f"CLARIAH/wp6-missieven/exercises/entities/tf",
        f"CLARIAH/wp6-missieven/exercises/numerics/tf",
    ),
    version=VERSION,
    silent=False,
)

TF-app: ~/text-fabric-data/github/CLARIAH/wp6-missieven/app

data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/tf/1.0

data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/exercises/entities/tf/1.0

data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0

This is Text-Fabric 10.2.6
Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html

49 features found and 0 ignored
  4.57s All features loaded/computed - for details use TF.isLoaded()
  0.42s All additional features loaded - for details use TF.isLoaded()

Text-Fabric: Text-Fabric API 10.2.6, CLARIAH/wp6-missieven/app v3, Search Reference
Data: WP6-MISSIEVEN, Character table, Feature docs
Features:

CLARIAH/wp6-missieven/exercises/numerics/tf

number

int

numeric value of corrected number-like strings

creator:

Dirk Roorda

dateWritten:

2022-05-04T14:18:27Z

writtenBy:

Text-Fabric

CLARIAH/wp6-missieven/exercises/entities/tf

entityComment

str

comment to a named entity

creator:

Dirk Roorda

dateWritten:

2022-05-04T14:17:20Z

upgraded:

‼️ from version 0.4 to 1.0

writtenBy:

Text-Fabric

entityId

str

identifier of a named entity

creator:

Dirk Roorda

dateWritten:

2022-05-04T14:17:20Z

upgraded:

‼️ from version 0.4 to 1.0

writtenBy:

Text-Fabric

entityKind

str

kind of a named entity

creator:

Dirk Roorda

dateWritten:

2022-05-04T14:17:20Z

upgraded:

‼️ from version 0.4 to 1.0

writtenBy:

Text-Fabric

General Missives Dutch East India Company 1600-1800

author

str

authors of the letter, surnames only

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

comma-space-separated values

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

authorFull

str

authors of the letter, full names

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

comma-space-separated values

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

col

int

column number of a column in a row in a table

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

day

int

day part of the date of the letter

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

numeral between 1 and 31 inclusive

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isden

int

whether a word is the denominator in fraction, e.g. 4 in 1/4

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isemph

str

whether a word is emphasized by typography

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isfolio

int

a folio reference

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isnote

int

whether a word belongs to footnote text

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isnum

int

whether a word is the numerator in fraction, e.g. 1 in 1/4

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isorig

int

whether a word belongs to original text

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:04Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isq

int

whether a word is a numerical fraction, e.g. 1/4

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:05Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isref

int

whether a word belongs to the text of reference

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:05Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isremark

int

whether a word belongs to the text of editorial remarks

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:05Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isspecial

int

whether a word has special typography possibly with OCR mistakes as well

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

issub

int

whether a word has subscript typography possibly indicating the denominator of a fraction

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

issuper

int

whether a word has superscript typography possibly indicating the numerator of a fraction

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

isund

str

whether a word is underlined by typography

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer 1 or absent

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

mark

int

footnote mark (not necessarily the same as shown on the printed page

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

integer

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

month

int

month part of the date of the letter

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

numeral between 1 and 12 inclusive

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

n

int

number of a volume, letter, page, para, line, table

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

otype

str

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:06Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

page

str

number of the first page of this letter in this volume

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:07Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

numeral (at most 4 digits)

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

place

str

place from where the letter was sent

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:07Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

punc

str

punctuation and/or whitespace following a wordup to the next word

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:07Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

puncn

str

punctuation and/or whitespace following a word,up to the next word, footnote text only

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:09Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

punco

str

punctuation and/or whitespace following a word,up to the next word, original text only

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:09Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

puncr

str

punctuation and/or whitespace following a word,up to the next word, remark text only

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:11Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

rawdate

str

the date the letter was sent

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

informal Dutch date notation

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

row

int

row number of a row of column in a table

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

seq

str

('sequence number of this letter among the letters of the same author in this volume',)

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

roman numeral (capitalized)

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

status

str

status of the letter, e.g. secret, copy

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

keyword

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

title

str

title of the letter

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

comma-separated values

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

trans

str

transcription of a word

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:12Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

transn

str

transcription of a word, only for footnote text

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:14Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

transo

str

transcription of a word, only for original text

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:14Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

transr

str

transcription of a word, only for remark text

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:16Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

vol

int

volume number

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

positive integer

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

weblink

str

the page-specific part of web links for page nodes

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

x

int

column offset of a column in a row in a table

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

year

int

year part of the date of the letter

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

numeral between 1600 and 1800

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

note

none

edge between a word and the footnotes associated with it

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

format:

no values

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

oslots

none

converters:

Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)

dateWritten:

2022-05-04T12:42:17Z

descriptionTf:

Original text, editorial text and footnotes form three different types of text

editor:

Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob

institute:

KNAW/HuygensING

language:

nld

name:

Generale Missiven

period:

1610-1761

project:

CLARIAH WP6 Use Case 1

published:

Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)

researcher:

Lodewijk Petram

sourceFormat:

TEI

title:

Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie

writtenBy:

Text-Fabric

Text-Fabric API: names N F E L T S C TF directly usable

Above you see a new sections in the feature list that you can expand to see which features that module contributed.

Now, suppose did not know much about these feature, then we would like to do a few basic checks.

A good start it to do inspect a frequency list of the values of the new features, and then to perform a query looking for the nodes that have these features.

We do that for the entity features and for the number feature.

Entities¶

In [19]:

F.entityId.freqList()

Out[19]:

(('T11', 6),
 ('T2', 5),
 ('T13', 3),
 ('T16', 3),
 ('T8', 3),
 ('T9', 3),
 ('T10', 2),
 ('T15', 2),
 ('T17', 2),
 ('T3', 2),
 ('T5', 2),
 ('T1', 1),
 ('T12', 1),
 ('T4', 1),
 ('T6', 1),
 ('T7', 1))

In [20]:

F.entityKind.freqList()

Out[20]:

(('Person', 18), ('GPE', 15), ('Organization', 5))

In [21]:

F.entityComment.freqList()

Out[21]:

(('Ternate', 5), ('Amboina', 2))

Let's query all words that have an entity notation:

In [22]:

query = """
word entityId entityKind* entityComment*
"""
results = A.search(query)

  4.40s 23 results

Here we query all word where the entityId is present. We also mention the entityKind and entityComment features, but with a * behind them. That is a criterion that is always True, so these mentions do not alter the result list. But now these features do occur in the query, and when we show results, these features will be shown.

In [23]:

A.show(results, condensed=True)

line 1

1 6:4

line

entityId=T11entityKind=Person

Op

den

12

deser

is

een

jonge

slave

van

een

orancay

entityId=T11entityKind=Person

Orangkaja,

hier

aanduiding

voor

een

Bandanees

hoofd

of

aanzienlijke.

line 2

1 6:5

line

entityId=T11entityKind=Person

bij

entityId=T11entityKind=Person

nacht

comen

line 3

1 6:14

line

entityId=T13entityKind=Person

jongen

een

groot

arancay

entityId=T12entityKind=Person

van

Nera,

entityId=T1entityKind=Person

broeder

van

den

sabandaer

entityId=T13entityKind=Person

Sjahbandar,

uit

het

Perzisch

overgenomen

woord,

in

Zuidoost-

Azië

gebruikt

voor

line 4

1 6:17

line

entityId=T11entityKind=Person

tref

f

ten

ende

’t

hooft

ons

hier

in

’t

casteel

gebracht,

den

voorsz.

orancaye

entityId=T11entityKind=Person

was

door

line 5

1 6:19

line

entityId=T15entityKind=GPE

met

den

onsen

gesproken,

die

hem

mede

geroemt

hadde,

2

van

onse

Hollanders

entityId=T15entityKind=GPE

line 6

1 6:20

line

entityId=T16entityKind=Person

in

den

moort

van

den

admirael

Verhoeven

entityId=T16entityKind=Person

Admiraal

Pieter

Willemsz.

Verhoeff

kwam

23

november

1608

met

zijn

vloot

voor

line 7

1 6:31

line

entityComment=AmboinaentityId=T17entityKind=GPE

Ambojna

entityComment=AmboinaentityId=T17entityKind=GPE

ende

van

daer

naer

Ternnate

entityComment=TernateentityId=T2entityKind=GPE

ende

soo

den

Coninck

entityId=T3entityKind=Person

van

entityId=T3entityKind=Person

Spagnien

entityId=T4entityKind=GPE

tusschen

line 8

1 6:32

line

entityComment=TernateentityId=T2entityKind=GPE

de

Heeren

entityId=T5entityKind=Organization

Staeten

entityId=T5entityKind=Organization

den

treves

geobserveert

werde,

metten

Coninck

entityId=T6entityKind=Person

van

ditto

entityComment=TernateentityId=T2entityKind=GPE

line 9

1 6:33

line

entityComment=TernateentityId=T2entityKind=GPE

plaetse

entityComment=TernateentityId=T2entityKind=GPE

te

contracteeren

om

met

sijn

hulpe

dese

plaetse

te

ocuperen

ende

hem

daer

line 10

1 6:34

line

entityId=T8entityKind=Organization

mede

Coninck

entityId=T7entityKind=Person

van

te

maeken

onder

protexie

van

E

Mogende

Heeren

entityId=T8entityKind=Organization

Staeten,

entityId=T8entityKind=Organization

doch

line 11

1 6:39

line

entityId=T9entityKind=GPE

Dit

volck

van

Banda

entityId=T9entityKind=GPE

is

superbe,

moordadich,

wel

versien

van

waepenen,

van

line 12

1 6:40

line

entityId=T10entityKind=GPE

de

onsen

voor

desen

ende

van

de

Engelsche

entityId=T10entityKind=GPE

gecomen,

dan

weynich

couraege

omme

Observation

It's not only words that have entity features, also the lines themselves have gotten such annotations.

It turns out that it is not very useful to annotate lines with entities this way. It would be better to annotate them with the number of entities they contain. That is our feedback to the creator of these annotations, and because we know the GitHub repo that they are from, we can file an issue!

Numerics¶

In [24]:

F.number.freqList()

Out[24]:

((121, 6),
 (361, 5),
 (421, 4),
 (101, 3),
 (141, 3),
 (161, 3),
 (185, 3),
 (261, 3),
 (131, 2),
 (151, 2),
 (240, 2),
 (360, 2),
 (621, 2),
 (1241, 2),
 (1441, 2),
 (111, 1),
 (171, 1),
 (181, 1),
 (191, 1),
 (201, 1),
 (241, 1),
 (250, 1),
 (281, 1),
 (291, 1),
 (331, 1),
 (371, 1),
 (480, 1),
 (501, 1),
 (541, 1),
 (561, 1),
 (631, 1),
 (660, 1),
 (670, 1),
 (701, 1),
 (721, 1),
 (731, 1),
 (761, 1),
 (814, 1),
 (901, 1),
 (911, 1),
 (1101, 1),
 (1151, 1),
 (1321, 1),
 (1371, 1),
 (1661, 1),
 (1741, 1),
 (2000, 1),
 (2501, 1),
 (2921, 1),
 (2981, 1),
 (2991, 1),
 (3191, 1),
 (3231, 1),
 (3501, 1),
 (4021, 1),
 (4041, 1),
 (4061, 1),
 (5021, 1),
 (6121, 1),
 (8191, 1),
 (8421, 1),
 (8541, 1),
 (9581, 1),
 (9961, 1),
 (10791, 1),
 (10921, 1),
 (11911, 1),
 (15301, 1),
 (15361, 1),
 (20621, 1),
 (32091, 1),
 (49141, 1),
 (52141, 1),
 (65931, 1),
 (72771, 1),
 (75621, 1),
 (77841, 1),
 (94001, 1),
 (98771, 1),
 (128181, 1),
 (167081, 1),
 (925981, 1),
 (977381, 1),
 (1213781, 1),
 (2228491, 1))

We see that the values that we have generated before.

Let's show the original and the number side by side.

In [25]:

results = A.search(
    """
word number transo*
"""
)

  1.87s 114 results

In [26]:

A.show(results, start=1, end=10)

result 1

1 32:7

line

5

transo=5

jan.

transo=jan

1613

transo=1613

sloot

transo=sloot

hij

transo=hij

een

transo=een

contract

transo=contract

met

transo=met

den

transo=den

vorst

transo=vorst

van

transo=van

Buton

transo=Buton

Corpus

transo=Corpus

I,

transo=I

p.

transo=p

115J,

number=1151transo=115J

20

transo=20

april

transo=april

d.

transo=d

a.

transo=a

v.

transo=v

result 2

2 33:27

line

schuylt (

transo=schuylt

alles

transo=alles

tot

transo=tot

laste

transo=laste

van

transo=van

de

transo=de

Comp.

transo=Comp

e

transo=e

loopende),

transo=loopende

deminueert

transo=deminueert

d’advance

transo=d’advance

van

transo=van

60

transo=60

è70

number=670transo=è70

result 3

2 55:35

line

Ticco,

transo=Ticco

samen

transo=samen

9400J

number=94001transo=9400J

realen,

transo=realen

op

transo=op

Atchin

transo=Atchin

te

transo=te

nemen,

transo=nemen

met

transo=met

waerschouwingh

transo=waerschouwingh

in

transo=in

toecomende

transo=toecomende

result 4

2 56:26

line

van

transo=van

juweelen

transo=juweelen

ende

transo=ende

coopmanschappen,

transo=coopmanschappen

sijnde

transo=sijnde

per

transo=per

reste

transo=reste

3343

transo=3343

tayl

transo=tayl

13J

number=131transo=13J

maes

transo=maes

ofte

transo=ofte

result 5

2 91:20

line

row

cell

Eerstelijck:

transo=Eerstelijck

cell

129323/

transo=129323

16

transo=16

cell

realen

transo=realen

voor

transo=voor

intrest

transo=intrest

van

transo=van

14J

number=141transo=14J

maenden

transo=maenden

a

transo=a

percento

transo=percento

van

transo=van

r.

transo=r

a

transo=a

35678,

transo=35678

dat

transo=dat

monteert

transo=monteert

’t

transo=’t

cargasoen,

transo=cargasoen

bestaende

transo=bestaende

uyt

transo=uyt

lijnwaten

transo=lijnwaten

ende

transo=ende

diamanten,

transo=diamanten

den

transo=den

6en

transo=6en

mey

transo=mey

anno

transo=anno

’38

transo=’38

in

transo=in

den

transo=den

Corsoer

transo=Corsoer

gescheept,

transo=gescheept

tot

transo=tot

21

transo=21

augusti

transo=augusti

1639

transo=1639

gemelte

transo=gemelte

Corsoer

transo=Corsoer

van

transo=van

Macassar

transo=Macassar

over

transo=over

Bantam

transo=Bantam

tot

transo=tot

Masilipatnam

transo=Masilipatnam

salvo

transo=salvo

is

transo=is

aengelandt ;

transo=aengelandt

result 6

2 91:21

line

row

cell

ten

transo=ten

tweeden:

transo=tweeden

cell

2062J

number=20621transo=2062J

cell

realen

transo=realen

voor

transo=voor

intrest

transo=intrest

van

transo=van

16

transo=16

J

transo=J

maenden

transo=maenden

over

transo=over

’t

transo=’t

capitael,

transo=capitael

dat

transo=dat

den

transo=den

Corsoer

transo=Corsoer

gecost

transo=gecost

soude

transo=soude

hebben;

transo=hebben

result 7

2 114:44

line

vercocht,

transo=vercocht

dat

transo=dat

comt

transo=comt

’t

transo=’t

te

transo=te

wesen

transo=wesen

12J

number=121transo=12J

result 8

2 164:5

line

Noopende

transo=Noopende

d’affgedrongen

transo=d’affgedrongen

4006

transo=4006

tomannen

transo=tomannen

12J

number=121transo=12J

mamoudy

transo=mamoudy

is

transo=is

weynigh

transo=weynigh

off

transo=off

geen

transo=geen

result 9

2 168:31

line

12818J,

number=128181transo=12818J

die

transo=die

nu

transo=nu

mede

transo=mede

overgaen.

transo=overgaen

result 10

2 169:3

line

Van

transo=Van

Masilipatnam

transo=Masilipatnam

wert

transo=wert

ons

transo=ons

gesonden

transo=gesonden

124J

number=1241transo=124J

onzen

transo=onzen

cleen

transo=cleen

besarsteen,

transo=besarsteen

costende

transo=costende

All together!¶

If more researchers have shared data modules, you can draw them all in.

Then you can design queries that use features from all these different sources.

In that way, you build your own research on top of the work of others.

Hover over the features to see where they come from, and you'll see they come from your local GitHub repo.

For real¶

See the next tutorial in this series how you can draw in and make use additional features produced by a serious algorithm to detect named entities.

Contents¶

start start computing with this corpus
search turbo charge your hand-coding with search templates
compute sink down a level and compute it yourself
exportExcel make tailor-made spreadsheets out of your results
annotate export text, annotate with BRAT, import annotations
share draw in other people's data and let them use yours
entities use results of third-party NER (named entity recognition)
porting port features made against an older version to a newer version
volumes work with selected volumes only

CC-BY Dirk Roorda

Sharing data features¶

Explore additional data¶

Make your own data¶

Share your new data¶

Making data¶

Saving data¶

Sharing data¶

Use the data¶

Entities¶

Numerics¶

All together!¶

For real¶

Contents¶