A practical introduction to polynomial chaos with the chaospy package¶

Vinzenz Gregor Eck, Expert Analytics, Oslo

**Jacob Sturdy**, Department of Structural Engineering, NTNU

Date: Jul 13, 2018

In [1]:

# ipython magic
%matplotlib notebook
%load_ext autoreload
%autoreload 2

In [2]:

# plot configuration
import matplotlib
import matplotlib.pyplot as plt
plt.style.use("ggplot")
# import seaborn as sns # sets another style
matplotlib.rcParams['lines.linewidth'] = 3
fig_width, fig_height = (7.0,5.0)
matplotlib.rcParams['figure.figsize'] = (fig_width, fig_height)

# font = {'family' : 'sans-serif',
#         'weight' : 'normal',
#         'size'   : 18.0}
# matplotlib.rc('font', **font)  # pass in the font dict as kwar

In [3]:

import chaospy as cp
import numpy as np
from numpy import linalg as LA

Introduction¶

To conduct UQSA with the polynomial chaos (PC) method we use the python package chaospy. This package includes many features and methods to apply non-intrusive polynomial chaos to any model with few lines of code. Since, chaospy allows for convenient definition of random variables, calculation of joint distributions and generation of samples, we apply chaospy as well for the Monte Carlo method.

In the following we will briefly describe the theory of polynomial chaos and give a practical step by step guide for the application of chaospy.

Polynomial chaos¶

For a more in depth introduction to the polynomial chaos method see ([[xiu_numerical_2010;@ smith_uncertainty_2013]](#xiu_numerical_2010;@ smith_uncertainty_2013)).

Concept¶

The main concept of polynomial chaos is to approximate the stochastic model response of a model with stochastic input as polynomial expansion relating the response to the inputs. For simplicity, we consider a model function $f$ which takes random variables $\mathbf{Z}$ and non-random variables as spatial coordinates $x$ or time $t$, as input:

$$ \begin{equation} Y = y{}(x, t, \mathbf{Z}) \label{_auto1} \tag{1} \end{equation} $$

Then we seek to generate a polynomial expansion such that we can approximate the model response.

$$ \begin{equation} Y = y{}(x, t, \mathbf{Z}) \approx y_{N} = \sum_{i=1}^{P} v_i(x, t) \Phi_i(\mathbf{Z}), \label{_auto2} \tag{2} \end{equation} $$

where $v_i$ are the expansion coefficients, which are only dependent on the non-stochastic parameters, and $\Phi_i$ are orthogonal polynomials, which depend solely on the random input parameter $\mathbf{Z}$. Once the polynomial expansion is calculated, we can analytically calculate the statistics of approximated model output.

Since polynomial chaos generally requires fewer model evaluations to estimates statistics than Monte Carlo methods, this method is preferable applied for models which need long computational time.

Applicability¶

Polynomial chaos may be applied in a wide variety of situations. Requirements for convergence are simply that the function be square integrable with respect to the inner product associated with the orthogonal polynomials. In otherwords if the function has a finite variance, polynomial chaos may be applied without worrying too much. The caveat to this fact is that convergence may not necessarily be fast! The smoother the function the faster the convergence. Polynomial choas can handle discontinuities, however, it may be advisable to reformulate the problem in these situations (see Feinberg, Eck and Langtangen 2016...).

Orthogonal polynomials¶

As state above the polynomial chaos expansion consists of a sum of basis polynomials in the input parameters $\mathbf{Z}$. By using orthogonal polynomials as the basis polynomials the efficiency of the convergence may be improved, and the use of the expansion in uncertainty quantification and sensitivity analysis is simplified. Orthogonality of functions is a general concept developed for functional analysis within an inner product space. Typically the inner product of two functions is defined as a weighted integral of the product of the functions over a domain of interest.

In particular for the purposes of polynomial chaos the inner product of two polynomials is defined as the expected value of their product, i.e. the integral of their product weighted with respect to the distribution of random input parameters. Two polynomials are orthogonal if their inner product is 0, and a set of orthogonal polynomials is orthogonal if every pair of distinct polynomials are orthogonal.

The following equalities hold for the orthogonal basis polynomials used in polynomial chaos

$$ \begin{align*} \mathbb{E}(\Phi_i(\mathbf{Z})\Phi_j(\mathbf{Z})) &= \langle \Phi_i(\mathbf{Z}),\Phi_j(\mathbf{Z}) \rangle\\ &= \int_{\Omega} \Phi_i(z)\Phi_j(z)w(z) dz \\ &= \int_{\Omega} \Phi_i(z)\Phi_j(z)f(z) dz \\ &= {\int_{\Omega} \Phi_i(z)\Phi_j(z) dF_Z(\mathbf{z})} = h_j \delta_{ij}, \end{align*} $$

where $h_j$ is equal to the normalisation factor of the used polynomials, and $\Phi_i(\mathbf{Z})$ indicates the substitution of the random variable $\mathbf{Z}$ as the polynomial's variable. Note that $\Phi_0$ is a polynomial of degree zero, i.e. a constant thus $\mathbb{E}(\Phi_0\Phi_j) = \Phi_0 \mathbb{E}(\Phi_j) = 0$ which implies that $\mathbb{E}(\Phi_j) = 0$ for $j \neq 0$.

Once the random inputs $\mathbf{Z}$ are properly defined with marginal distributions, orthogonal polynomials can be constructed. For most univariate distributions, polynomial bases functions are defined, and listed in the Asker Wilkeys scheme. A set of orthogonal polynomials can be created from those basis polynomials with orthogonalization methods as Cholesky decomposition, three terms recursion, and modified Gram-Schmidt.

Expansion coefficients¶

There are two non-intrusive ways of approximating a polynomial chaos expansion coefficients:

Regression¶

Supposing a polynomial expansion approximation $y_{N} = \sum_i v_i \Phi_i$, then stochastic collocation specifies a set of nodes, $\Theta_{M} = \left\{\mathbf{z}^{(s)}\right\}_{s=1}^{P}$, where the deterministic values of $y_{N}=y$. The task is thus to find suitable coefficients $v_i$ such that this condition is satisfied. Considering the existence of a set of collocation nodes $\left\{\mathbf{z}^{(s)}\right\}_{s=1}^{P}$, then $y_{N} = \sum_i v_i \Phi_i$ can be formed into linear system of equations for the coefficients $v_i$ at these nodes:

$$ \begin{equation} \begin{bmatrix} \Phi_0(\mathbf{z}^{(1)}) & \cdots & \Phi_P(\mathbf{z}^{(1)}) \\ \vdots & & \vdots \\ \Phi_0(\mathbf{z}^{(N)}) & \cdot & \Phi_P(\mathbf{z}^{(n)}) \end{bmatrix} \begin{bmatrix} v_{0}\\ \vdots \\ v_{P} \end{bmatrix} = \begin{bmatrix} y(\mathbf{z}^{(1)}) \\ \vdots \\ y (\mathbf{z}^{(N)}) \end{bmatrix} \end{equation} \label{eq:stochColl} \tag{3} $$

Now we can use regression to achieve the relaxed condition that $y_{N}$ is "sufficiently close" to $y$ at $\left\{\mathbf{z}^{(s)}\right\}_{s=1}^{P}$. This is done by choosing a larger number of samples so that (3) is over determined and then minimizing the appropriate error ${\lvert\lvert {y_{N} \rvert\rvert}-y}_{R}$ over $\left\{\mathbf{z}^{(s)}\right\}_{s=1}^{P}$. Ordinary least squares, Ridge-Regression, and Tikhonov-Regularization are all regression methods that may be applied to this problem.

Pseudo-spectral projection¶

Discrete projection refers to the approximation of coefficients for $y_{N} = \sum_{i=1}^{P} v_i \Phi_i$, by directly approximating the orthogonal projection coefficients

$$ \begin{equation} v_i = \frac{1}{h_i} \mathbb{E}(y \Phi_i) = \frac{1}{h_i} \int_{\Omega} y \Phi_i dF_z, \label{_auto3} \tag{4} \end{equation} $$

using a quadrature scheme to calculate the integral $\int_{\Omega} y \Phi_i dF_z$ as a sum $\sum_{i=1}^{P} w_i y(\mathbf{z}^{(i)})$, where $w_i$ are the quadrature weights.

This results in an approximation $\tilde{v}_{i}$ of $v_{i}$ the error between the final approximation may be split

$$ \begin{equation} \lVert{y_{N} - y}\rVert \leq \lVert \sum_{i=1}^{P} v_i \Phi_i - y{} \rVert + \lVert {\sum_{i=1}^{P} \left( v_i - \tilde{v}_i \right)\Phi_i} \rVert \label{eq:ps_error} \tag{5} \end{equation} $$

where the first term is called the truncation error and the second term the quadrature error. Thus one may consider the maximal accuracy for a given polynomial order $P$ and see this should be achieved as the quadrature error is reduced to almost 0 by increased number of collocation nodes.

Statistics¶

Once the expansion was generated, it can be used directly to calculated statistics for the uncertainty and sensitivity analysis. The two most common measures for uncertainty quantification, expected value and variance, can be calculated by inserting the expansion into the definition of the measures.

The expected value is equal to the first expansion coefficient:

$$ \begin{equation} \begin{aligned} {\mathbb{E}}(Y) \approx \int_{\Omega} \sum_{i=1}^{P} v_i \Phi_i(\mathbf{z}{}) dF_Z(\mathbf{z}) = v_1. \end{aligned} \label{eq:expectedValue_gPCE} \tag{6} \end{equation} $$

The variance is the sum of squared expansion coefficients multiplied by normalisation constants of the polynomials:

$$ \begin{equation} \begin{aligned} \operatorname{Var}(Y) &\approx {\mathbb{E}}{({v(x,t,\mathbf{Z})}(\mathbf{Z})-{\mathbb{E}}(Y))^2} = \int_{\Omega}({v(x,t,\mathbf{Z})}(\mathbf{z}) - v_1)^2 dF_Z(\mathbf{z}) \\ &= \int_{\Omega}\left(\sum_{i=1}^{P} v_i \Phi_i(\mathbf{z}) \right)^2 dF_Z(\mathbf{z}) - v_1^2 = \sum_{i=1}^{P} v_i^2 \int_{\Omega}{\Phi^2_i(\mathbf{z})}dF_Z(\mathbf{z}) - v_1^2 \\ &= \sum_{i=1}^{P} v_i^2 h_i - v_1^2 = \sum_{i=2}^{P} v_i^2 h_i \end{aligned} \label{eq:variance_gPCE} \tag{7} \end{equation} $$

(Note the orthogonality of individual terms implies their covariance is zero, thus the variance is simply the sum of the variances of the terms.)

The Sobol indices may be calculated quite simply from the expansion terms due to the fact that the ANOVA decomposition is unique. Thus the main effect of a parameter $z_i$ is simply the variance of all terms only in $z_i$. Explicitly let $\mathcal{A}_{i} = \{k | \Phi_{k}(\mathbf{z}) = \Phi_{k}(z_i)\} $ ,i.e. $\mathcal{A}_{i}$ is the set of all indices of basis functions depending only on $z_i$ then

$$ \begin{equation} \begin{aligned} f_i &= \sum_{k\in \mathcal{A}_{i}} v_{k} \Phi_{k} \implies \\ S_i &= \frac{1}{\operatorname{Var}(Y)} \sum_{k\in \mathcal{A}_{i}} \operatorname{Var}(v_{k} \Phi_{k}) \\ S_i &= \frac{1}{\operatorname{Var}(Y)} \sum_{k\in \mathcal{A}_{i}} v_{k}^2 h_{k} \end{aligned} \label{eq:sensitivity_gPCE} \tag{8} \end{equation} $$

and similarly one may define $\mathcal{A}_{ij}$ for pairwise combinations of inputs and further to calculate all orders of sensitivities.

Chaospy¶

The python package chaospy an introductory paper to the package including a comparison to other software packages is presented here ([feinberg_2015]). You can find an introduction, tutorials and the source code at the projects homepage: https://github.com/jonathf/chaospy.

The installation of the package can be done via pip:

    pip install chaospy

In the following we will use the import naming convention of the package creator to import the package in python:

    import chaospy as cp

Therefore it will be convenient to see whenever a method of the package is applied.

The package chaospy is doc-string annotated which means that every method provides a short help text with small examples. To show the method documentation simply type a ? after the method name in a ipython console or notebook. As shown in the following two examples:

In [4]:

# show help for uniform distributions
cp.Uniform?

In [5]:

# show help for sample generation
cp.samplegen?

Steps for polynomial chaos analysis with chaospy¶

To conduct UQSA analysis with polynomial chaos we need to follow the following steps:

Definition of the marginal and joint distributions
Generation of the orthogonal polynomials
Linear regression
- Generation of samples
- Evaluation of the model for all samples
- Generation of the polynomial chaos expansion
Pseudo-spectral projection
- Generation of integration nodes and weights
- Evaluation of the model for all nodes
- Generation of the polynomial chaos expansion
Calculations of all statistics

Note, that steps 3 Linear regression and 4 Pseudo-spectral projection are interchangeable. They are simply different methods of cacluating the expansion coefficients. In both cases generate a set of points in the parameter space where the model must be evaluated (steps 3.b and 4.b, respectively).

Step 1: Definition of marginal and joint distributions¶

The analysis of a each model starts with the definition of the marginal distributions for each random model input, i.e. describing it as random variable. Univariate random variables can be defined with chaospy by calling the class-constructor of a distribution type, e.g cp.Normal(), with arguments to describe the particular distribution, e.g. mean value and standard deviation for cp.Normal. The help function can be used to find out more about the required arguments, e.g. help(cp.Normal).

In the following an example for 3 random variables with uniform, normal and log-normal distribution:

In [6]:

# simple distributions
rv1 = cp.Uniform(0, 1)
rv2 = cp.Normal(0, 1)
rv3 = cp.Lognormal(0, 1, 0.2, 0.8)
print(rv1, rv2, rv3)

After all random input variables are defined with univariate random variables a multi-variate random variable and its joint distribution can be constructed with the following command:

In [7]:

# joint distributions
joint_distribution = cp.J(rv1, rv2, rv3)
print(joint_distribution)

It is also possible to construct independent identical distributed random variables from any univariate variable:

In [8]:

# creating iid variables
X = cp.Normal()
Y = cp.Iid(X, 4)
print(Y)

All 64 distributions available in the chaospy package can be found in the following table:

Distributions	implemented	in chaospy
Alpha	Birnbaum-Sanders	Laplace	Power log-normal
Anglit	Fisher-Snedecor	Levy	Power normal
Arcsinus	Fisk/log-logistic	Log-gamma	Raised cosine
Beta	Folded Cauchy	Log-laplace	Rayleigh
Brandford	Folded normal	Log-normal	Reciprocal
Burr	Frechet	Log-uniform	Right-skewed Gumbel
Cauchy	Gamma	Logistic	Student-T
Chi	Gen. exponential	Lomax	Triangle
Chi-square	Gen. extreme value	Maxwell	Truncated exponential
Double Gamma	Gen. gamma	Mielke's beta-kappa	Truncated normal
Double Weibull	Gen. half-logistic	Nakagami	Tukey-Lambda
Epanechnikov	Gilbrat	Non-central chi-squared	Uniform
Erlang	Truncated Gumbel	Non-central Student-T	Wald
Exponential	Gumbel	Non-central F	Weibull
Exponential power	Hypergeometric secant	Normal	Wigner
Exponential Weibull	Kumaraswamy	Pareto (first kind)	Wrapped Cauchy

Step 2: Orthogonal Polynomials¶

The orthogonal polynomials can be generated with different methods, in chaospy there are 4 methods implemented. The most stable method, and therefore most advised is the three terms recursion method.

Orthogonalization Method
Cholesky decomposition	cp.orth\_chol
Three terms recursion	cp.orth\_ttr
Modified Gram-Schmidt	cp.orth\_gs

Regarding the three terms recursion method: For the distributions Normal, Uniform, Gamma, Log-normal, Triangle, Beta and stochastic independent variable combinations of those, the three terms recursion coefficients are known. For all other distributions the coefficients are estimated numerically. The three terms recursion method is then also called discretized stieltjes method.

The most stable method and therefore most applied method is the three terms recursion (discretized stieltjes method) method.

We will look at all in a small example, try to increase the polynomial order and the instabilities of the methods become visible.

In [9]:

# example orthogonalization schemes
# a normal random variable
n = cp.Normal(0, 1)

x = np.linspace(0,1, 50)
# the polynomial order of the orthogonal polynomials
polynomial_order = 3

poly = cp.orth_chol(polynomial_order, n, normed=True)
print('Cholesky decomposition {}'.format(poly))
ax = plt.subplot(131)
ax.set_title('Cholesky decomposition')
_=plt.plot(x, poly(x).T)
_=plt.xticks([])

poly = cp.orth_ttr(polynomial_order, n, normed=True)
print('Discretized Stieltjes / Three terms reccursion {}'.format(poly))
ax = plt.subplot(132)
ax.set_title('Discretized Stieltjes ')
_=plt.plot(x, poly(x).T)

poly = cp.orth_gs(polynomial_order, n, normed=True)
print('Modified Gram-Schmidt {}'.format(poly))
ax = plt.subplot(133)
ax.set_title('Modified Gram-Schmidt')
_=plt.plot(x, poly(x).T)

Step 3.: Linear regression¶

The linear regression method requires to conduct the three following steps:

Generation of samples
Evaluation of the model for all samples
Generation of the polynomial chaos expansion

In the following we will not consider the model evaluation.

Step 3.a: Sampling¶

Once a random variable is defined or a joint random variable, also referred as distribution here, the following method can be used to generate as set of samples:

In [10]:

# sampling in chaospy
u = cp.Uniform(0,1)
u.sample?

The method takes the arguments size which is the number of samples and rule which is the applied sampling scheme. The following example shows the creation of 2 set of samples for the sampling schemes (Pseudo-)Random and Hammersley.

In [11]:

# example sampling
u1 = cp.Uniform(0,1)
u2 = cp.Uniform(0,1)
joint_distribution = cp.J(u1, u2)
number_of_samples = 350
samples_random = joint_distribution.sample(size=number_of_samples, rule='R')
samples_hammersley = joint_distribution.sample(size=number_of_samples, rule='M')

fig1, ax1 = plt.subplots()
ax1.set_title('Random')
ax1.scatter(*samples_random)
ax1.set_xlabel("Uniform 1")
ax1.set_ylabel("Uniform 2")
ax1.axis('equal')

fig2, ax2 = plt.subplots()
ax2.set_title('Hammersley sampling')
ax2.scatter(*samples_hammersley)
ax2.set_xlabel("Uniform 1")
ax2.set_ylabel("Uniform 2")
ax2.axis('equal')

All sampling schemes implemented in chaospy are listed in the following table:

Key	Name	Nested
C	Chebyshev nodes	no
NC	Nested Chebyshev	yes
K	Korobov	no
R	(Pseudo-)Random	no
RG	Regular grid	no
NG	Nested grid	yes
L	Latin hypercube	no
S	Sobol	yes
H	Halton	yes
M	Hammersley	yes

### Step 3.a.i: Importing and exporting samples

It may be useful to export the samples from chaospy for use in another program. The most useful format for exporting the samples likely depends on the external program, but it is quite simple to save the samples as a CSV file with a delimeter of your choice:

In [12]:

# example save samples to file
# Creates a csv file where each row corresponds to the sample number and each column with teh variables in the joint distribution
csv_file = "csv_samples.csv"
sep = '\t'
header = ["u1", "u2"]
header = sep.join(header)
np.savetxt(csv_file, samples_random, delimiter=sep, header=header)

Each row of the csv file now contains a single sample from the joint distribution with the columns corresponding to each component.

Now you may evaluate these samples with an external program and save the resulting data into a similarly formatted CSV file. Again each row should correspond to a single sample value and each column to different components of the model output.

In [13]:

# example load samples from file
# loads a csv file where the samples/or model evaluations for each sample are saved
# with one sample per row. Multiple components ofoutput can be stored as separate columns 
filepath = "external_evaluations.csv"
data = np.loadtxt(filepath)

Step 3.c: Polynomial Chaos Expansion¶

After the model is evaluated for all samples, the polynomial chaos expansion can be generated with the following method:

In [14]:

# linear regression in chaospy
cp.fit_regression?

In the following we show a complete example for polynomial chaos expansion using the linear regression. The model applied the very simple mathematical expression:

$$ \begin{equation} y(z_1, z_2) = z_1 + z_1 z_2 \label{eq:dummy_model} \tag{9} \end{equation} $$

The random variables for $Z_1, Z_2$ are defined as simple uniform random variables:

$$ \begin{equation} Z_1 = \mbox{U}(0,1), \quad Z_2 = \mbox{U}(0,1) \label{eq:dummy_rv} \tag{10} \end{equation} $$

The mean of this should be $\frac{3}{4}$, the variance should be $\frac{31}{144}$ and the sensitivites to $Z_1$ and $Z_2$ are respectively $\frac{3}{31}$ and $\frac{27}{31}$.

Here is the annotated example code with all steps required to generate a polynomial chaos expansion with linear regression:

In [15]:

# example linear regression
# 1. define marginal and joint distributions
u1 = cp.Uniform(0,1)
u2 = cp.Uniform(0,1)
joint_distribution = cp.J(u1, u2)

# 2. generate orthogonal polynomials
polynomial_order = 3
poly = cp.orth_ttr(polynomial_order, joint_distribution)

# 3.1 generate samples
number_of_samples = cp.bertran.terms(polynomial_order, len(joint_distribution))
samples = joint_distribution.sample(size=number_of_samples, rule='R')

# 3.2 evaluate the simple model for all samples
model_evaluations = samples[0]+samples[1]*samples[0]

# 3.3 use regression to generate the polynomial chaos expansion
gpce_regression = cp.fit_regression(poly, samples, model_evaluations)
print("Success")

Step 4: Pseudo-spectral projection¶

Step 4.a: Quadrature nodes and weights¶

Once a random variable is defined or joint random variables, also referred as distribution here, the following method can be used to generate nodes and weights for different quadrature methods:

In [16]:

# quadrature in polychaos
cp.generate_quadrature?

We will look at the following arguments of the method: order is the order of the quadrature, domain is the , rule is the name or key of the quadrature rule to apply.

In the following example we look at some quadrature nodes for the same uniform variables as for the sampling, for Optimal Gaussian quadrature and Clenshaw-Curtis quadrature.

In [17]:

# example quadrature
u1 = cp.Uniform(0,1)
u2 = cp.Uniform(0,1)
joint_distribution = cp.J(u1, u2)

order = 5

nodes_gaussian, weights_gaussian = cp.generate_quadrature(order=order, domain=joint_distribution, rule='G')
nodes_clenshaw, weights_clenshaw = cp.generate_quadrature(order=order, domain=joint_distribution, rule='C')

print('Number of nodes gaussian quadrature: {}'.format(len(nodes_gaussian[0])))
print('Number of nodes clenshaw-curtis quadrature: {}'.format(len(nodes_clenshaw[1])))


fig1, ax1 = plt.subplots()
ax1.scatter(*nodes_gaussian, marker='o', color='b')
ax1.scatter(*nodes_clenshaw, marker= 'x', color='r')
ax1.set_xlabel("Uniform 1")
ax1.set_ylabel("Uniform 2")
ax1.axis('equal')

In the following all quadrature rules implemented in chaospy are highlighted:

Collection of quadrature rules	Name	Key
Optimal Gaussian quadrature	Gaussian	G
Gauss-Legendre quadrature	Legendre	E
Clenshaw-Curtis quadrature	Clenshaw	C
Leja quadrature	Leja	J
Hermite Genz-Keizter 16 rule	Genz	Z
Gauss-Patterson quadrature rule	Patterson	P

It is also possible to use sparse grid quadrature. For this purpose Clenshaw-Curtis method is advised since it is nested.

In the following example we show sparse vs. normal quadrature nodes:

In [18]:

# example sparse grid quadrature
u1 = cp.Uniform(0,1)
u2 = cp.Uniform(0,1)
joint_distribution = cp.J(u1, u2)

order = 2
# sparse grid has exponential growth, thus a smaller order results in more points
nodes_clenshaw, weights_clenshaw = cp.generate_quadrature(order=order, domain=joint_distribution, rule='C')
nodes_clenshaw_sparse, weights_clenshaw_sparse = cp.generate_quadrature(order=order, domain=joint_distribution, rule='C', sparse=True)

print('Number of nodes normal clenshaw-curtis quadrature: {}'.format(len(nodes_clenshaw[0])))
print('Number of nodes clenshaw-curtis quadrature with sparse grid : {}'.format(len(nodes_clenshaw_sparse[0])))

fig1, ax1 = plt.subplots()
ax1.scatter(*nodes_clenshaw, marker= 'x', color='r')
ax1.scatter(*nodes_clenshaw_sparse, marker= 'o', color='b')
ax1.set_xlabel("Uniform 1")
ax1.set_ylabel("Uniform 2")
ax1.axis('equal')

Step 4.c: Polynomial Chaos Expansion¶

After the model is evaluated for all integration nodes, the polynomial chaos expansion can be generated with the following method:

In [19]:

# spectral projection in chaospy
cp.fit_quadrature?

In the following we show again a complete example for polynomial chaos expansion using the pseudo spectral approach to calculate the expansion coefficients. The model applied the same simple mathematical expression as before:

$$ \begin{equation} y(z_1, z_2) = z_1 + z_1 z_2 \label{eq:dummy_model_repeat} \tag{11} \end{equation} $$

The random variables for $Z_1, Z_2$ are defined as simple uniform random variables:

$$ \begin{equation} Z_1 = \mbox{U}(0,1), \quad Z_2 = \mbox{U}(0,1) \label{eq:dummy_rv_repeat} \tag{12} \end{equation} $$

In [20]:

# example spectral projection
# 1. define marginal and joint distributions
u1 = cp.Uniform(0,1)
u2 = cp.Uniform(0,1)
joint_distribution = cp.J(u1, u2)

# 2. generate orthogonal polynomials
polynomial_order = 3
poly = cp.orth_ttr(polynomial_order, joint_distribution)

# 4.1 generate quadrature nodes and weights
order = 5
nodes, weights = cp.generate_quadrature(order=order, domain=joint_distribution, rule='G')

# 4.2 evaluate the simple model for all nodes
model_evaluations = nodes[0]+nodes[1]*nodes[0]

# 4.3 use quadrature to generate the polynomial chaos expansion
gpce_quadrature = cp.fit_quadrature(poly, nodes, weights, model_evaluations)
print("Success")

Step 5: Statistical Analysis¶

Once the polynomial chaos expansion is created either with pseudo-spectral projection or with regression method The calculation of statistics is straight forward. The following listing gives an overview of all available methods take all the same input parameter the polynomial-expansion and the joint-distribution (see also example below).

Note, that one can also calculate uncertainty statistics on distributions only as well.

Uncertainty quantification¶

Expected value: cp.E
Variance: cp.Var
Standard deviation: cp.Std
Curtosis: cp.Kurt
Skewness: cp.Skew
Distribution of Y: cp.QoI_Dist
Prediction intervals: cp.Perc, which is a method to calculate percentiles: an additional argument defining the percentiles needs to be passed.

If multiple quantities of interest are available:

Covariance matrix: cp.Cov
Correlation matrix: cp.Corr
Spearman correlation: cp.Spearman
Auto-correlation function: cp.Acf

In [21]:

# example uq
exp_reg = cp.E(gpce_regression, joint_distribution)
exp_ps =  cp.E(gpce_quadrature, joint_distribution)

std_reg = cp.Std(gpce_regression, joint_distribution)
str_ps = cp.Std(gpce_quadrature, joint_distribution)

prediction_interval_reg = cp.Perc(gpce_regression, [5, 95], joint_distribution)
prediction_interval_ps = cp.Perc(gpce_quadrature, [5, 95], joint_distribution)

print("Expected values   Standard deviation            90 % Prediction intervals\n")
print(' E_reg |  E_ps     std_reg |  std_ps                pred_reg |  pred_ps')
print('  {} | {}       {:>6.3f} | {:>6.3f}       {} | {}'.format(exp_reg,
                                                                  exp_ps,
                                                                  std_reg,
                                                                  str_ps,
                                                                  ["{:.3f}".format(p) for p in prediction_interval_reg],
                                                                  ["{:.3f}".format(p) for p in prediction_interval_ps]))

Sensitivity analysis¶

The variance bases sensitivity indices can be calculated directly from the expansion. The chaospy package provides the following methods:

first order indices: cp.Sens_m
second order indices: cp.Sens_m2
total indices: cp.Sens_t

Here is an example for the first and total indices for both expansions:

In [22]:

# example sens
sensFirst_reg = cp.Sens_m(gpce_regression, joint_distribution)
sensFirst_ps = cp.Sens_m(gpce_quadrature, joint_distribution)

sensT_reg = cp.Sens_t(gpce_regression, joint_distribution)
sensT_ps = cp.Sens_t(gpce_quadrature, joint_distribution)

print("First Order Indices           Total Sensitivity Indices\n")
print('       S_reg |  S_ps                 ST_reg |  ST_ps  \n')
for k, (s_reg, s_ps, st_reg, st_ps) in enumerate(zip(sensFirst_reg, sensFirst_ps, sensT_reg, sensT_ps)):
    print('S_{} : {:>6.3f} | {:>6.3f}         ST_{} : {:>6.3f} | {:>6.3f}'.format(k, s_reg, s_ps, k, st_reg, st_ps))

References¶

**D. Xiu**. *Numerical Methods for Stochastic Computations: a Spectral Method Approach*, Princeton University Press, 2010.
**R. C. Smith**. *Uncertainty Quantification: Theory, Implementation, and Applications*, *Computational science and engineering series*, Society for Industrial and Applied Mathematics, 2013.
**J. Feinberg and H. P. Langtangen**. Chaospy: An open source tool for designing methods of uncertainty quantification, *Journal of Computational Science*, 11, pp. 46-57, 2015.