This set of notebooks is meant to encourage the exploration of data from the world around us. Notebooks should:
to the tools and ideas of data analysis; and
gallery analysing public datasets.
This is a jupyter notebook, where you can have text "cells" (like this text here) and code "cells" i.e. boxes where you can write python code to be executed (like the one below). No need to install anything (if you run this on http://mybinder.org/) or find compilers, it is all done for you in the background.
For "explore open data", we use python as programming language: it is easy to get started, just type, for example, 1 + 1 in the cell below and click on "Run" icon above.
Now try something more advanced, for example sqrt(4)
That failed: basic python can do some operations but for anything more complex, we need additional software packages or "modules".
That's what we will import here:
import pandas as pd
# pandas is a data analysis package, a bit like Excel
import numpy as np
# numpy is for scientific computing
import matplotlib.pyplot as plt
# matplotlib is for plotting
Now, you want to try whether sqrt(4) works now? No, it does not... you will have to tell jupyter that you want to take the function from numpy (which for brevity was named np above). So try np.sqrt(4)...
Note that you can modify this page at any time, it does no harm. You can add cells (Insert) and change their type from Code (default) to text (i.e. "Markdown") under Cell -> Cell type.