dlmstan - Dynamic Linear Models fitted with Stan

This package demonstrates how to use Stan to fit dynamic linear models of form

$\begin{align*} x_{k+1} & = A(\theta)x_k + B(\theta)u_k & + N(0, Q(\theta))\\ y_{k+1} & = K(\theta)x_{k+1} & + N(0, R(\theta)) \end{}$

That is, we fit some static parameters of a linear state space model, including possibly parameters for the model and observation noise. In addition, we show how to sample the states given the parameters efficiently, inspired by the blog post in http://www.juhokokkala.fi/blog/posts/kalman-filter-style-recursion-to-marginalize-state-variables-to-speed-up-stan-inference/ (here we consider a more general case than in the blog post).

Note that there is also a function already available in Stan for DLM fitting, see https://mc-stan.org/docs/2_26/functions-reference/gaussian-dynamic-linear-models.html. The difference here is that we include the "forcing term" B(θ)u_k and consider also the random sampling of the states given the parameters.

The code runs with CmdStanPy, which is a thin python-wrapper around CmdStan, the command line interface to Stan. In addition to CmdStanPy, you'll need some standard python stuff like numpy, pandas and matplotlib. Note that you need to also install CmdStan, which can be easily done with cmdstanpy.install_cmdstan().

Theory

For a linear state space system, the likelihood of the parameters can be efficiently calculated by integrating out the state variables using a Kalman Filter recursion. The likelihood of the parameters can be calculated using the chain rule of joint probability:

$p(y_{1:n} | \theta) = p(y_n | y_{1:n-1}, \theta)p(y_{1:n-1} | y_{1:n-2}, \theta) \cdots p(y_2 | y_1, \theta) p(y_1 | \theta)$

The individual predictive distributions can be calculated in the linear case recursively as follows (dropping the parameter dependency from the matrices here for simplicity), starting from $x_k | y_{1:k}, \theta \sim N(x_k^{est}, C_k^{est})$ :

Predict the state forward: $x_k | y_{1:k-1}, \theta \sim N(x_k^p, C_k^p)$ where the predicted mean and covariance are

$\begin{align*} x_k^p & = A x_{k-1}^{est} + B u_k \\ C_k^p & = A C_{k-1}^{est} A^T + Q \end{}$

Update the state with the new observation: $x_k | y_{1:k}, \theta \sim N(x_k^{est}, C_k^{est})$ where the posterior mean and covariance are

$\begin{align*} x_k^{est} & = x_k^p + G_k(y_k-Kx_k^p) \\ C_k^{est} & = C_k^p - G_k K C_k^p \\ G_k & = C_k^p K^T (KC_k^pK^T + R)^{-1} \end{}$

Calculate the posterior predictive distribution: $y_k | y_{1:k-1}, \theta \sim N(Kx_k^p, K C_k^p K^T + R)$

Now the final likelihood is the combination of the above densities, and we can run MCMC to sample the posterior for θ. Along the posterior sampling, we can get also samples of the states x_1:T using the following backward recursion:

sample x_T from $p(x_T | y_{1:T}, \theta)$ , which is just the filtering distribution for the last state, $N(x_T^{est}, C_T^{est})$
sample from $p(x_{T-1} | x_T, y_{1:T}, \theta)$
sample from $p(x_{T-2} | x_{T-1}, x_T, y_{1:T}, \theta)$
....

That is, we end up having to sample from densities like

$p(x_k | x_{k+1:T}, y_{1:T}, \theta) = p(x_k | x_{k+1}, y_{1:k}, \theta) \propto p(x_{k+1} | x_k) p(x_k | y_{1:k}, \theta)$

where the first equation follows from the Markovian model (conditional independence) and the second equation is the Bayes formula. The components in the final expressions are Gaussians:

$\begin{align*} x_{k+1} & \sim N(Ax_k+Bu_k, Q) \\ x_k | y_{1:k}, \theta & \sim N(x_k^{est}, C_k^{est}) \end{}$

We can think of the first one as "likelihood" with x_k+1 as "data" and the latter as prior, which gives us the posterior $x_k | x_{k+1}, y_{1:k}, \theta \sim N(\mu_k, \Sigma_k)$ , where

$\begin{align*} \Sigma_k & = (A^T Q^{-1} A + (C_k^{est})^{-1})^{-1} \\ \mu_k & = \Sigma_k (A^T Q^{-1} (x_{k+1}-Bu_k)+(C_k^{est})^{-1}x_k^{est}) \end{}$

That is, if we store all the results from the Kalman filter forward pass, we can calculate a random sample given θ using the pre-calculated results.

Stan code

The general Stan code for sampling the parameters and states given the parameters using the above trick is given in https://github.com/solbes/dlmstan/blob/main/dlm.stan. The code does not run on it's own, the user needs to write the functions build_A, build_B etc for the application in question, which can be done using the functions block in Stan, see the examples below. The user also needs to feed in data in a standard format, see again the examples.

Some comments about the code:

User writes the functions block for the problem at hand, which is then appended by the general code dlm.stan, see the examples.
If matrix B is not present in the application, the user needs to still feed in a dummy zero matrix, see the first example
The code separates two types of parameters, normal "model parameters" and noise parameters that enter the covariance matrices Q and R. Normal priors are given for both, whose parameters can be given in the data, see the examples.
Initial values for the states need to be given in the data, see the examples.
In the DLM code, the matrix inversion lemma is used to avoid explicitly inverting the filtering covariance matrices.

Examples

Nile river, estimating 3 noise parameters: https://github.com/solbes/dlmstan/blob/main/examples/stan_dlm_nile.ipynb
Beer cooling, 2 model parameters and 2 noise parameters: https://github.com/solbes/dlmstan/blob/main/examples/stan_dlm_beer.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dlm.stan		dlm.stan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dlmstan - Dynamic Linear Models fitted with Stan

Theory

Stan code

Examples

About

Uh oh!

Releases

Packages

Languages

License

solbes/dlmstan

Folders and files

Latest commit

History

Repository files navigation

dlmstan - Dynamic Linear Models fitted with Stan

Theory

Stan code

Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages