Name	Name	Last commit message	Last commit date
Latest commit serban-nicusor-toptal release/v2.29.1 Mar 2, 2022 94cd230 · Mar 2, 2022 History 5,005 Commits
.github	.github	Add issue templates	Dec 3, 2021
docs	docs	Update getting started docs	Dec 21, 2021
scripts	scripts	[WIP] Move ci to flatiron institute (#1087 )	Feb 16, 2022
src	src	Merge pull request #1132 from stan-dev/improve-tilde-type-err	Feb 23, 2022
test	test	Merge pull request #1132 from stan-dev/improve-tilde-type-err	Feb 23, 2022
.gitattributes	.gitattributes	Add linguist gitattributes	Oct 20, 2021
.gitignore	.gitignore	update to master	Aug 11, 2021
.ocamlformat	.ocamlformat	Format with ocamlformat 0.19.0	Nov 11, 2021
Jenkinsfile	Jenkinsfile	Split compile tests into smaller chunks to improve CI times (#1123 )	Feb 21, 2022
Jenkinsfile-test-binaries	Jenkinsfile-test-binaries	Move tensorflow backend to separate repo	Nov 16, 2021
LICENSE.txt	LICENSE.txt	Dockerfile movement	Dec 6, 2018
Makefile	Makefile	Clean up makefile	Feb 2, 2022
README.md	README.md	Split compile tests into smaller chunks to improve CI times (#1123 )	Feb 21, 2022
RELEASE-NOTES.txt	RELEASE-NOTES.txt	release/v2.29.1	Mar 2, 2022
default.nix	default.nix	Reverted to redundant Nixpkgs fetch because workaround had bug	Oct 8, 2020
dune	dune	Update to OCaml 4.12.0	Nov 11, 2021
dune-project	dune-project	Add stancjs capability, fix dune pinning	Nov 29, 2021
shell.nix	shell.nix	Reverted to redundant Nixpkgs fetch because workaround had bug	Oct 8, 2020
stanc.opam	stanc.opam	Add stancjs capability, fix dune pinning	Nov 29, 2021

Name

Last commit message

Last commit date

serban-nicusor-toptal

release/v2.29.1

Mar 2, 2022

94cd230 · Mar 2, 2022

Dec 3, 2021

Update getting started docs

Dec 21, 2021

scripts

[WIP] Move ci to flatiron institute (#1087 )

Feb 16, 2022

src

Merge pull request #1132 from stan-dev/improve-tilde-type-err

Feb 23, 2022

test

Merge pull request #1132 from stan-dev/improve-tilde-type-err

Feb 23, 2022

.gitattributes

Add linguist gitattributes

Oct 20, 2021

.gitignore

update to master

Aug 11, 2021

.ocamlformat

Format with ocamlformat 0.19.0

Nov 11, 2021

Jenkinsfile

Split compile tests into smaller chunks to improve CI times (#1123 )

Feb 21, 2022

Jenkinsfile-test-binaries

Move tensorflow backend to separate repo

Nov 16, 2021

Dec 6, 2018

Feb 2, 2022

Split compile tests into smaller chunks to improve CI times (#1123 )

Feb 21, 2022

RELEASE-NOTES.txt

release/v2.29.1

Mar 2, 2022

default.nix

Reverted to redundant Nixpkgs fetch because workaround had bug

Oct 8, 2020

dune

Update to OCaml 4.12.0

Nov 11, 2021

dune-project

Add stancjs capability, fix dune pinning

Nov 29, 2021

shell.nix

Reverted to redundant Nixpkgs fetch because workaround had bug

Oct 8, 2020

stanc.opam

Add stancjs capability, fix dune pinning

Nov 29, 2021

A New Stan-to-C++ Compiler, stanc3

This repo contains a new compiler for Stan, stanc3, written in OCaml. Since version 2.26, this has been the default compiler for Stan. See this wiki for a list of minor differences between this compiler and the previous Stan compiler.

To read more about why we built this, see this introductory blog post. For some discussion as to how we chose OCaml, see this accidental flamewar. We're testing these models (listed under Test Results) on every pull request.

Documentation

Documentation for users of stanc3 is in the Stan Users' Guide here

The Stanc3 Developer documentation is available here: https://mc-stan.org/stanc3/stanc

Want to contribute? See Getting Started for setup instructions and some useful commands.

High-level concepts, invariants, and 30,000-ft view

Stanc3 has 3 main src packages: frontend, middle, and stan_math_backend.

The Middle contains the MIR and currently any types or functions used by the two ends. The entrypoint for the compiler is in src/stanc/stanc.ml which sequences the various components together.

Distinct stanc Phases

The phases of stanc are summarized in the following information flowchart and list.

Lex the Stan language into tokens.
Parse Stan language into AST that represents the syntax quite closely and aides in development of pretty-printers and linters. stanc --debug-ast to print this out.
Typecheck & add type information Typechecker.ml. stanc --debug-decorated-ast
Lower into Middle Intermediate Representation (AST -> MIR) stanc --debug-mir (or --debug-mir-pretty)
Analyze & optimize (MIR -> MIR)
Backend MIR transform (MIR -> MIR) Transform_Mir.ml stanc --debug-transformed-mir
Hand off to a backend to emit C++ (or LLVM IR, or Tensorflow, or interpret it!).

The two central data structures

src/frontend/Ast.ml defines the AST. The AST is intended to have a direct 1-1 mapping with the syntax, so there are things like parentheses being kept around. The pretty-printer in the frontend uses the AST and attempts to keep user syntax the same while just adjusting whitespace.

The AST uses a particular functional programming trick to add metadata to the AST (and its other tree types), sometimes called the "two-level types" pattern. Essentially, many of the tree variant types are parameterized by something that ends up being a placeholder not for just metadata but for the recursive type including metadata, sometimes called the fixed point. So instead of recursively referencing expression you would instead reference type parameter 'e, which will later be filled in with something like type expr_with_meta = metadata expression. The AST intends to keep very close to Stan-level semantics and syntax in every way. 2. src/middle/Mir.ml contains the MIR (Middle Intermediate Language - we're saving room at the bottom for later). src/frontend/Ast_to_Mir.ml performs the lowering and attempts to strip out as much Stan-specific semantics and syntax as possible, though this is still something of a work-in-progress.

The MIR uses the same two-level types pattern to add metadata, notably expression types and autodiff levels as well as locations on many things. The MIR is used as the output data type from the frontend and the input for dataflow analysis, optimization (which also outputs MIR), and code generation.

Design goals

Multiple phases, each with human-readable intermediate representations for easy debugging and optimization design.
Optimizing - takes advantage of info known at the Stan language level. Minimize information we must teach users for them to write performant code.
Holistic- bring as much of the code as possible into the MIR for whole-program optimization.
Research platform- enable a new class of optimizations based on probability theory.
Modular - architect & build in a way that makes it easy to outsource things like symbolic differentiation to external libraries and to use parts of the compiler as the basis for other tools built around the Stan language.
Simplicity first - When making a choice between correct simplicity and a perceived performance benefit, we want to make the choice for simplicity unless we can show significant (> 5%) benchmark improvements to compile times or run times. Premature optimization is the root of all evil.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Repository files navigation

A New Stan-to-C++ Compiler, stanc3

Documentation

High-level concepts, invariants, and 30,000-ft view

Distinct stanc Phases

The two central data structures

Design goals

About

Releases 47

Sponsor this project

Packages

Contributors 29

Languages

License

stan-dev/stanc3

Folders and files

Latest commit

History

Repository files navigation

A New Stan-to-C++ Compiler, stanc3

Documentation

High-level concepts, invariants, and 30,000-ft view

Distinct stanc Phases

The two central data structures

Design goals

About

Resources

License

Stars

Watchers

Forks

Releases 47

Sponsor this project

Packages 0

Contributors 29

Languages

Packages