This notebook was put together by Jake Vanderplas for PyCon 2015. Source and license info is on GitHub.
Outline:
9:00 - 9:15 Preliminaries: Setup & introduction
9:15 - 10:00 Basic Principles of Machine Learning and the Scikit-learn Interface
10:00 - 10:45 Supervised learning in-depth
10:45 - 11:00: break
11:00 - 11:45 Unsupervised learning in-depth
11:45 - 12:20 Validation and Model Selection
This tutorial requires the following packages:
numpy
version 1.5 or later: http://www.numpy.org/scipy
version 0.10 or later: http://www.scipy.org/matplotlib
version 1.3 or later: http://matplotlib.org/scikit-learn
version 0.14 or later: http://scikit-learn.orgipython
version 2.0 or later, with notebook support: http://ipython.orgseaborn
: version 0.5 or later, used mainly for plot stylingThe easiest way to get these is to use the conda environment manager. I suggest downloading and installing miniconda.
The following command will install all required packages:
$ conda install numpy scipy matplotlib scikit-learn ipython-notebook
Alternatively, you can download and install the (very large) Anaconda software distribution, found at https://store.continuum.io/.
You can run the following code to check the versions of the packages on your system:
(in IPython notebook, press shift
and return
together to execute the contents of a cell)
from __future__ import print_function
import IPython
print('IPython:', IPython.__version__)
import numpy
print('numpy:', numpy.__version__)
import scipy
print('scipy:', scipy.__version__)
import matplotlib
print('matplotlib:', matplotlib.__version__)
import sklearn
print('scikit-learn:', sklearn.__version__)
import seaborn
print('seaborn', seaborn.__version__)
IPython: 2.4.1 numpy: 1.9.2 scipy: 0.15.1 matplotlib: 1.4.3 scikit-learn: 0.15.2 seaborn 0.5.1