Before the hands-on session, install all of the following:
Make sure Python 3.6 or higher is installed.
Then install Text-Fabric and Jupyter.
Do it by following the instructions in the Text-Fabric docs
seaborn, a python library for graphic plots:
pip install seaborn
pip3 install seaborn
(if you have installed Text-Fabric you know which)
If you followed the link to this tutorial, then you see its Jupyter notebooks in readonly mode. You cannot execute code cells.
In order to compute with these notebooks, you have to download them.
Here is the
Unzip it, open a terminal or command prompt, navigate to the new folder, and give the command
Your browser starts up and presents you a local computing environment where you can run Python programs. You see this start notebook and several notebooks dedicated to a specific corpus. Click such a notebook.
You see cells like the one below, where you can type programming statements and execute them by pressing
The list of available corpora is here.
There are hands-on guides for
Each hands-on guide takes you through the activities of browsing the corpus, searching it, and collecting information that cannot easily expressed as a search instruction.
These three corpora differ in data features, section structure, granularity, richness of data features. That is why the toy problems in these hands on are different. It is worth studying all three of them.
Head over to the hands-on guide corpus of your prime interest by clicking one of the three links above.
These guides will let you download the corpus data.
Please do that before the hands-on session.