Reading in Climate Data with `xarray`¶

Xarrays is a new-ish Python library that makes working with n-dimensional data relatively painless.

More info: http://xarray.pydata.org/en/stable/

In [ ]:

#Import libraries
import xarray as xr
import numpy as np
import pandas as pd
from matplotlib import pyplot as plt

In [ ]:

#Load the netCDF dataset into an `xarray` object
fileName = 'macav2livneh_pr_bcc-csm1-1_r1i1p1_historical_1990_2005_CONUS_monthly.nc'
ds = xr.open_dataset(fileName)

In [ ]:

#Extract the precipitation data into an xarray dataset object
ds = xr.open_dataset(fileName)

In [ ]:

#Save the precipitation values into a single data array
arrPrecip = ds['precipitation']

In [ ]:

#Display information about the arrPrecip data array
print(arrPrecip)

In [ ]:

#Extract the time dimension from the data array
arrTime = arrPrecip.time

In [ ]:

#Display the first 5 records of the arrTime data array
arrTime[:5].data

In [ ]:

#Extract the lat and long values
arrLat = arrPrecip.lat
arrLon = arrPrecip.lon

In [ ]:

arrLat[30].data,arrLon[30].data

Extract data for one time-location combination¶

Use the xarray dataset's sel function to select the datum nearest the specified time and location

In [ ]:

theTime = np.datetime64('1990-03-15')
theLat = 36.005
theLon = 360-78.942

In [ ]:

#Extract the value corresponding to the point nearest to the specified time & location
theResult = ds.sel(lat=theLat,lon=theLon,time=theTime,method='nearest')

In [ ]:

#Show the precipitation at that point
theResult.precipitation.data

Plot a time series for one location¶

Dropping one of the dimensions in the sel statement retrieves all data in that dimension fitting the criteria specified in the other dimensions. Here, we omit the time constraint.

In [ ]:

#Select only on lat and lon and we get all precip data for all times
theTimeSeries = ds.sel(lat=theLat,lon=theLon,method='nearest')

In [ ]:

#Extract the precipitation data array from the filtered dataset
ts_Precip = theTimeSeries.precipitation

In [ ]:

#Plot the time series
fig = plt.figure(figsize=(15,5))
ts_Precip.plot.line();

Map precipitation for one time period¶

In [ ]:

#Drop the lat and lon filters to grab data for all locations
theMapResult = ds.sel(time=theTime).precipitation

In [ ]:

#Plot the data
fig = plt.figure(figsize=(12,5))
theMapResult.plot(cmap="YlGnBu", #Specifies the colors to use
                  robust=True);  #Drops outliers (<2%,>98%) from plot

Create a spatial subset¶

We uses "slices" to extract subsets of data. Here we subset the data spatially and compute the mean

In [ ]:

#Specify the spatial slices to grab
theLats = slice(25,36.5)
theLons = slice(360-91,360-76)

In [ ]:

#Extract the spatial subset into its own data array
theSubset = ds.sel(lon=theLons,lat=theLats).precipitation

In [ ]:

#Display the shape of the returned data array
theSubset.shape

In [ ]:

#Plot a histogram of the data within the subset
theSubset.plot();

In [ ]:

#Compute the mean across the time axis and show a map
theSubsetAvg = theSubset.mean(axis=0)
theSubsetAvg.plot(cmap="YlGnBu");

In [ ]:

#Fancier plots
plt.figure(figsize=(15,3))
plt.subplot(1,3,1)
theSubset.min(axis=0).plot(cmap="YlGnBu")
plt.subplot(1,3,2)
theSubset.mean(axis=0).plot(cmap="YlGnBu")
plt.subplot(1,3,3)
theSubset.max(axis=0).plot(cmap="YlGnBu")
plt.show();

Calculate summer (JJA) average¶

The xarray package supports seasons to make easy seasonal averages.

In [ ]:

#Create a new data array by converting the dates in the `time` array to seasons
arrSeason = ds['time'].dt.season

In [ ]:

#Replace the time dimension with seasons
ds['time'] = ds['time'].dt.season

In [ ]:

#Extract precipitation for just the summer months; we have 48 summers of data
summerPrecip = ds.sel(time='JJA',lat=theLats,lon=theLons).precipitation
summerPrecip.shape

In [ ]:

summerPrecip.plot();

In [ ]:

#Fancier plots
plt.figure(figsize=(15,5))
plt.subplot(1,2,1)
summerPrecip.mean(axis=0).plot(cmap="YlGnBu")
plt.subplot(1,2,2)
summerPrecip.std(axis=0).plot(cmap="YlGnBu")
#plt.subplot(1,3,3)
#summerPrecip.max(axis=0).plot(cmap="YlGnBu")
plt.show();

Reading in Climate Data with xarray¶

Extract data for one time-location combination¶

Plot a time series for one location¶

Map precipitation for one time period¶

Create a spatial subset¶

Calculate summer (JJA) average¶

Reading in Climate Data with `xarray`¶