FlexiChains.jl

Flexible Markov chains.

Quickstart

FlexiChain{T} represents a chain that stores data for parameters of type T. VNChain is a alias for FlexiChain{VarName}, and is the appropriate type for storing Turing.jl's outputs.

To obtain a VNChain from MCMC sampling, pass the chain_type argument to the sample function.

using Turing
using FlexiChains

@model function f()
    x ~ Normal()
    y ~ Poisson(3.0)
    z ~ MvNormal(zeros(2), I)
end
chain = sample(f(), NUTS(), MCMCThreads(), 1000, 3; chain_type=VNChain)

You can index into a VNChain using VarNames. Data is returned as a DimMatrix from the DimensionalData.jl package, which behaves exactly like an ordinary Matrix but additionally carries more information about its dimensions.

chain[@varname(x)]    # -> DimMatrix{Float64}
chain[@varname(y)]    # -> DimMatrix{Int}
chain[@varname(z)]    # -> DimMatrix{Vector{Float64}}; but see also DimensionalDistributions.jl
chain[@varname(z[1])] # -> DimMatrix{Float64}
chain[:logjoint]      # -> DimMatrix{Float64} NOTE: not `:lp`

Quick and dirty summary stats can be obtained with:

ss = summarystats(chain)         # mean, std, mcse, ess, rhat for all variables
ss[@varname(x), stat=At(:mean)]  # -> Float64 (the mean of x)

Or you can compute statistics directly with individual functions:

mean(chain)              # just the mean for all variables
mean(chain)[@varname(x)] # -> Float64
mean(chain; dims=:iter)  # take the mean over iterations only

Visualisation with Plots.jl and Makie.jl is supported:

using StatsPlots  # or CairoMakie
plot(chain)                    # trace + densities for all variables
plot(chain, [@varname(z)])     # trace + density for z[1] and z[2] only
plot(chain; pool_chains=true)  # combining samples from all chains

Finally, functions in Turing.jl which take chains as input, such as returned, predict, and logjoint should work out of the box with exactly the same behaviour as before.

The online documentation contains much more detail about FlexiChains and its interface. Please do check it out!

Why use FlexiChains?

Turing's default data type for Markov chains is MCMCChains.Chains.

This entire package essentially came about because MCMCChains.Chains is, in my opinion, a bad data structure.

Consider the following model:

@model function f()
    x ~ Poisson(1.0)
    y ~ MvNormal(zeros(2), I)
end

The way that MCMCChains and FlexiChains stores the outputs of this model is illustrated here:

MCMCChains vs FlexiChains data structure comparison

Specifically, MCMCChains represents data as a mapping of individual Symbols to arrays of Float64s. However, Turing.jl uses VarNames as keys in its models, and the values can be anything that is sampled from a distribution. In this case, x is an integer-valued parameter, and y is a vector-valued parameter.

This leads to several problems:

The conversion from VarName to Symbol is lossy. See here and here for examples of how this can bite users. It also forces Turing.jl to store extra information in the chain which specifies how to retrieve the original VarNames.
Array-valued parameters must be broken up in a chain. This makes it very annoying to reconstruct the full arrays. It also causes massive slowdowns for some operations, and is also responsible for some hacky code in AbstractPPL and DynamicPPL that has no reason to exist beyond the limitations of MCMCChains.
Inability to store generic information from MCMC sampling. For example, a model containing a line such as x := s::String (see here for the meaning of :=) will error.
Overly aggressive casting to avoid abstract types. If you have an integer-valued parameter, it will be cast to Float64 when sampling using MCMCChains. See this issue for details.

FlexiChains solves all of these by making the mapping of parameters to values much more flexible (hence the name). Both keys and values can, in general, be of any type. This makes for a less compact representation of the data, but means that information about the chain is preserved much more faithfully.

For more information, and some concrete examples of where FlexiChains does better, please see the 'Why FlexiChains' documentation page.

Name		Name	Last commit message	Last commit date
Latest commit History 344 Commits
.github/workflows		.github/workflows
docs		docs
ext		ext
src		src
test		test
.JuliaFormatter.toml		.JuliaFormatter.toml
.gitignore		.gitignore
HISTORY.md		HISTORY.md
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md
flexichains.svg		flexichains.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FlexiChains.jl

Quickstart

Why use FlexiChains?

About

Uh oh!

Releases 10

Packages

Contributors 2

Uh oh!

Languages

License

penelopeysm/FlexiChains.jl

Folders and files

Latest commit

History

Repository files navigation

FlexiChains.jl

Quickstart

Why use FlexiChains?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 2

Uh oh!

Languages

Packages