Notebook

Tutorials given in book "Time Series Analysis and Its Application with R"¶

book site

In [1]:

%pylab inline
%load_ext rmagic

Populating the interactive namespace from numpy and matplotlib

Methodologies¶

why time series is special for traditional statistical modelling (because the adjacent observations are usually NOT i.i.d)
first step of any ts analysis always involves careful scrutiny of the

recorded data plotted over time. This scrutiny often suggests the method of analysis (e.g. time domain vs frequqncy domain) as well as statistics that will be of use in summarizing the information in the data.

time domain approach assumes that the correlation between adjacent points in time is best explained in terms of a dependence of the current value on past values. So it usually models the future value of a ts as a parametric function of the current and past values from the same or other series. Examples are
- ARIMA (autoregressive integrated moving average): its defining feature is that they are multiplicative models, meaning that the observed data are assumed to result from products of factors involving differential or difference equation operators responding to a white noise input.
- Additive Models: the observed data are assumed to result from sums of series, each with a specified time series structure, e.g., in economics, assume a series is generated as the sum of trend, a seasonal effect and error.
- State Space Model: essentially an additive model, using Kalman filters and smoothers.
frequency domain approach assumes the primary characteristics of interest in time series analyses relate to periodic or systematic sinusoidal variations found naturally in most data.These periodic variations are often caused by biological, physical, or environmental phenomena of interest.
- Spectral Analysis: partition of the various kinds of periodic variation in a time series is accomplished by evaluating separately the variance associated with each periodicity of interest. This variance profile over frequency is called the power specturm.

Example Datasets - Nature of Time Series¶

In [3]:

%%R
library("astsa")

Johnson & Johnson Quarterly Earnings (trend + seasonal + noise)¶

21 years quaterly data from 1960 to 1980
primary pattern: (1)gradually increasing underlying trend and (2) the rather regular variation superimposed on the trend that seems to repeat over quarters.

In [4]:

%%R
data(jj)
plot(jj, type="o", ylab="quarterly earnings per share")

Global Warming (trend matters most)¶

global mean land-ocean temperature index from 1880 to 2009
primary pattern: an apparent upward trend during the latter part of the 20th century - the trend is of more interest than particular periodicities in this case. The variations in series is not obvious or regular on my observation

In [5]:

%%R
data(gtemp)
plot.ts(gtemp, type="o", ylab="Global Temperature Deviations")

Speech Data (finding signatures by spectral analysis)¶

.1 second (1000 point) sample of recorded speech from the phrase aaa ... hhh
repetitive nature of the sinal and the rather regular periodicities
Spectral analysis can be used to produce signature that will be compared with library to translate the signal into recorded phrase aaa ... hhh

In [6]:

%%R
data(speech)
plot(speech, main = "speech")

New York Stock Exchange (financial time series data - constant mean, dynamic variability)¶

typcial return data (crash of October 19, 1987): the mean of the series appears to be stable with an average return of approximately zero, however, the volotility (or variability) of data changes over time.
the data also show volatility clustering: highly volatile periods tend to be clustered together. So a proble in the analysis of these type of financial data is to forecast the volatility of future returns by models such as ARCH and GARCH

In [7]:

%%R
data(nyse)
plot(nyse, ylab = "NYSE Returns")

El Nino & Fish Population (multiple time series)¶

monthly values of Southern Oscillation Index (SOI) and associated Recruitment (number of new fish)
both series exhibit repetitive behavior, with regularly repeating cycles that are easily visible. And the cycles of SOI are repeating at a faster rate than those of the Recuritment series
the main interest is to study the kinds of cycles and their strengths. and those two series seem to be related

In [8]:

%%R
data(soi)
data(rec)
par(mfrow = c(2, 1))
plot(soi, main="Southern Oscillation Index")
plot(rec, main = "Recruitment")

fMRI Imaging (variance analysis of multiple series)¶

Notice that responding to stimulus, the priodicities appear strongly in the motor cortex series and less strongly in the thalamus and cerebellum. This may indicate that different areas in brain respond differently to the brush stimulus.
Analysis of variance is useful by spectral analysis of variance

In [9]:

%%R
data(fmri1)
par(mfrow=c(2, 1), mar=c(3, 2, 1, 0)+.5, mgp = c(1.5, .6, 0))
ts.plot(fmri1[,2:5], lty=1:4, ylab="BOLD", main="Cortex")
ts.plot(fmri1[,6:9], lty=1:4, ylab="BOLD", main="Thalamus & Cerebellum")
mtext("time (1pt = 2sec)", side = 1, line = 2)
print(colnames(fmri1))

[1] "time"  "cort1" "cort2" "cort3" "cort4" "thal1" "thal2" "cere1" "cere2"

Earthquakes & Explosions (distinguish multiple ts)¶

two phases denoted by P(t = 1, ..., 1024) and S(t = 1025, ..., 2048) at a seismic recording station caused by earthquakes and explosions
to distinguish similar series, we can again use spectral analysis of variances to find the signatures of them

In [10]:

%%R
data(EQ5)
data(EXP6)
par(mfrow = c(2, 1))
plot(EQ5, main = "Earthquake")
plot(EXP6, main = "Explosion")

The fundamental visual characteristic distinguishing the different series above is their differing degrees of smoothness.

Time Series Statistical Models¶

a ts can be defined as a collection of random varaibles indexed according to the order they are obtained.
its smoothness is induced by the supposition that adjancent points in time are correlated.
some fundamental models (templates) for time series include
- WHITE NOISE: the random variables follow i.i.d. of mean $0$ and fixed variance $\sigma_{w}^2$, e.g. Gaussian white noise. In spectral analysis, white noise simply means that all possible periodic oscillations are present with equal strength.
if all time series can be modelled as white noise model, classical statistical methods would suffice. Two ways of introducing serial correlation and more smoothness into series are moving average and augoregression
- MOVING AVERAGES: white noise series $w_t$ might be replaced by a moving average that smooths the series, e.g.,
$$v_t = \frac{1}{3} (w_{t-1}+w_t+w_{t+1})$$
you might start to notice the similiarity between moving averages on white noise and SOI or fMRI
- AUTOREGRESSIONS: consider the series defined by the second-order equation, where $w_t$ is the white noise
$$x_t = x_{t-1} - .9x_{t-2} + w_t$$ The equation above represents a regression or prediction of the current value $x_t$ of a time series as a function of the past two values of the series, and, hence, the term autogression is suggested.

And in the plot, we notice that it starts to display periodic behavior, which is similar to that displayed by the speech data.
- RANDOM WALK WITH DRIFT: a model for analyzing trend such as the global temp data is the random walk with drift model given by
$$x_t = \delta + x_{t-1} + w_{t}$$ where $w_t$ is the white noise and $\delta$ is a constant called drift. and when $\delta=0$, it is simply called a random walk. Note that we may rewrite the recursion function for random walk as the cumulative sum of white noise variable, e.g. $$x_t = {\delta}t+\sum_{j=1}^{t}{w_j}$$
- SIGNAL IN NOISE: Many realistic models for generating time series assume an underlying signal with some consistent periodic variation, contaminated by adding a random noise, e.g.,
$$x_t = 2cos(2{\pi}t/50+.6{\pi})+w_t$$ where the signal part is a sinusoidal waveform that can be generally written as $$Acos(2{\pi}{\omega}t+\phi)$$ with $A$ is the amplitude, $\omega$ si the frequency of oscillation and $\phi$ is a phase shift. e.g. $\omega=1/50$ above means one cycle per 50 time units (points)

In [11]:

%%R
## WHITE NOISE
wn <- rnorm(500, 0, 1) # 500 N(0, 1) variates

## MOVING AVERAGE by filtering
mv <- filter(wn, sides = 2, rep(1/3, 3)) # 1/3[w(t-1)+w(t)+w(t+1)]
## - a linear combination of time series values is referred to as a filtered series

## AUTOREGRESSION
w <- rnorm(550, 0, 1) # 50 extra to avoid startup problems
ar <- filter(w, filter = c(1, -.9), method ="recursive")[-(1:50)]
plot.ts(wn, main="white noise")
plot.ts(mv, main="moving average")

## Random Walk with Drift
set.seed(1)
w <- rnorm(200, 0, 1); x <- cumsum(w) # drift = 0
wd <- w + .2; xd <- cumsum(wd) # drift = .2
plot.ts(ar, main="autoregression")
plot.ts(xd, ylim = c(-5, 55), main="Random Walk with Drift", col=1)
lines(x, col=2)
legend("topleft", c("drift=0.2", "drift=0"), col=c(1, 2), lty=1)

## Signal in Noise
cs <- 2*cos(2*pi*(1:500)/50 + .6*pi)
w <- rnorm(500, 0, 1)
par(mfrow=c(3, 1), mar=c(3, 2, 2, 1), cex.main=1.5)
plot.ts(cs, main = expression(2*cos(2*pi*t/50+.6*pi)))
plot.ts(cs+w, main = expression(2*cos(2*pi*t/50+.6*pi) + N(0, 1)))
plot.ts(cs+5*w, main = expression(2*cos(2*pi*t/50+.6*pi) + N(0, 25)))



## the method parameter to filter - "recursive" for autoregression
## - "convolution" for moving average

Measures that Describes Time Series Behavior¶

full description: joint probability distribution of random variables
brief description: mean function (of time t) and autocorrelation/cross-correlation function (of time pairs (t1, t2)) that measures dependency.
autocorrelation function (ACF) measures the linear predictability of the series at time t, say $x_t$, using only the value $x_s$.
cross-correlation function (CCF) is similiar to ACF excpets that we are measuing the predictability of another series $y_t$ from the series $x_s$

Stationary Time Series¶

Strictly Stationary: iff the joint probability of every collection of values is identitcal to that of the time shifted set
Weakly Stationary: if the mean funciton is constant, and the autocovariance function depends on time distance s and t only through their difference $|s-t|$. In other words, the time difference between times $t+h$ and $t$ is the same as the time difference between times $h$ and $0$.
Jointly stationary: if two time series are each stationary, and the cross covariance function is a function only of lag $h$
Leading and Lagging, consider the relations between two series $x_t$ and $y_t$,

$$y_t = Ax_{t-l}+w_t$$

the series $x_t$ is said to lead $y_t$ for $l>0$ and is said to lag $y_t$ for $l<0$

The concept of weak stationarity forms the basis for much of the analysis performed with time series

In [34]:

## ACF of typical time series
## white noise - wn
#Acf(wn)
## autoregression - ar
## random walk with drift - xd
## signal in noise = cs+5

In [12]:

%%R
library(forecast)
par(mfrow=c(3, 2))

Acf(wn, main = "white noise")
Acf(na.omit(mv), main = "three-point moving average")
Acf(ar, main = "autoregression")
Acf(xd, main = "random walk with drift = 0.2")
Acf(cs+5, main = "signal with random noise", lag.max = 50)

This is forecast 4.8

In [18]:

%%R
## as a comparison ,the ACF of some real data
par(mfrow=c(2, 2))

Acf(speech, main = "speech aaa...hhh", lag.max=150)
mtext(side=1, line = 3, "repeating peaks spaced at about 106-109\n it contains a series of repeating short signals")

Acf(soi, main = "SOI Temp", lag.max=50)
mtext(side = 1, line = 3, "repeating peaks located at around 12")

Acf(rec, main = "Rec Fish Amount", lag.max=50)


ccf(soi, rec, lag.max=50, main = "ccf between SOI and Rec")
mtext(side = 1, line = 3, "repeating peak located at around -6 \n so soi is -6 leading of rec")

In [14]:

%%R
par(mfrow=c(2, 1))
## Cross Corrleation Function

## white noise v.s. moving average
ccf(wn, na.omit(mv), main = "white noise vs 3point moving average")

## moving average v.s. autoregression
ccf(na.omit(mv), ar, main = "3point moving average vs autoregresion")

Acf conveys information of signatures of time series, but it is not necessary to tell whether a time sereis is stationary or not as the correlation plot is not against time

In [ ]: