Metocean altimetry track comparison for global wave model¶

This notebook will demonstrate the use of ModelSkill on a larger dataset containing more than 9 million satellite track observation points.

Note: requires running the download.ipynb first!

In [1]:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import modelskill as ms
from matplotlib_inline.backend_inline import set_matplotlib_formats
set_matplotlib_formats('png')

Big data¶

Run the download.ipynb first

In [2]:

fn = '../data/SW_gwm_3a_extracted_2018.dfs0'
mr = ms.model_result(fn, name='GWM', item='Sign. Wave Height', gtype="track")
mr

Out[2]:

<TrackModelResult> 'GWM' (n_points: 9141126)

In [3]:

o1 = ms.TrackObservation('../data/altimetry_3a_2018_filter1.dfs0', item=2, name='3a')

In [4]:

cmp = ms.match(o1, mr)

In [5]:

cmp.sel(end='2018-1-15').skill()

Out[5]:

	n	bias	rmse	urmse	mae	cc	si	r2
observation
3a	372356	-0.475229	0.633093	0.418287	0.510757	0.940399	0.124015	0.72003

In [6]:

cmp.skill()

Out[6]:

	n	bias	rmse	urmse	mae	cc	si	r2
observation
3a	9105388	-0.489389	0.646012	0.421699	0.520279	0.943032	0.122967	0.720777

Gridded skill¶

Gridded skill with 1 deg bins and default bin edges.

In [7]:

gs = cmp.gridded_skill(metrics=['bias'], bins=(np.arange(-180,180,1), np.arange(-90,90,1)), n_min=20)

Add attrs and plot

In [8]:

type(gs)

Out[8]:

modelskill.skill_grid.SkillGrid

In [9]:

gs.bias.data.attrs = dict(long_name="Bias of significant wave height, Hm0",units="m")
gs.n.data.attrs = dict(long_name="N of significant wave height",units="-")
fig, axes = plt.subplots(ncols=1, nrows=2, figsize = (8, 10))
gs.n.plot(ax=axes[0])
gs.bias.plot(ax=axes[1]);

Multiple bins - gridded skill for wave height¶

Use all_df to obtain and df argument to pass customized data back to comparer.

In [10]:

all_df = cmp.data.to_dataframe()
mean_val = all_df[['GWM','Observation']].mean(axis=1)
all_df['val_cat'] = pd.cut(mean_val, [0,2,5,np.inf], labels=["Hm0[m]=[0, 2)","Hm0[m]=[2, 5)","Hm0[m]=[5, inf)"])
all_df.head()

Out[10]:

	x	y	Observation	z	GWM	val_cat
time
2018-01-01 00:00:00	-33.706020	23.181158	2.611	NaN	2.292599	Hm0[m]=[2, 5)
2018-01-01 00:00:01	-33.720741	23.240074	2.608	NaN	2.292612	Hm0[m]=[2, 5)
2018-01-01 00:00:02	-33.735474	23.298990	2.518	NaN	2.292624	Hm0[m]=[2, 5)
2018-01-01 00:00:03	-33.750214	23.357904	2.729	NaN	2.292637	Hm0[m]=[2, 5)
2018-01-01 00:00:04	-33.764965	23.416819	2.593	NaN	2.292650	Hm0[m]=[2, 5)

In [11]:

cmp.data["val_cat"] = all_df["val_cat"]

In [12]:

cmp.data

In [13]:

gs = cmp.gridded_skill(by=["val_cat"], metrics=["bias"], bins=(np.arange(-180,180,5), np.arange(-90,90,5)), n_min=20)

In [14]:

gs.data['bias'].attrs = dict(long_name="Bias of significant wave height, Hm0", units="m")
gs.data['n'].attrs = dict(long_name="N of significant wave height", units="-")
gs.data['val_cat'].attrs = dict(long_name="Range of sign. wave height, Hm0", units="m")

In [15]:

gs.n.plot(figsize=(12,4));

In [16]:

gs.bias.plot(figsize=(12,4));

Map¶

http://xarray.pydata.org/en/stable/plotting.html#maps

Requires cartopy: https://scitools.org.uk/cartopy/docs/latest/installing.html