Issue/838 hmm #840

charlesm93 · 2024-12-06T21:35:28Z

Submission Checklist

Builds locally
New functions marked with <<{ since VERSION }>>
Declare copyright holder and open-source license: see below

Summary

Addresses issue #838 and updates user-doc on HMMs.

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): Charles Margossian, Simons Foundation

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC BY-ND 4.0 (https://creativecommons.org/licenses/by-nd/4.0/)

bob-carpenter · 2024-12-06T22:13:36Z

I took a look through this and while I'm in favor of minimal examples, this one's a bit too minimal. I would really like to see the model laid out in more detail than the conditional distributions. Here are some concrete suggestions:

Use z for latent discrete parameters. We already use x for covariates, so it's confusing to use x. I've used z elsewhere in the user manual for this in the latent discrete parameter chapter.
Define the complete data likelihood $p(y, z | \phi) = p(z \mid \phi) \cdot p(y \mid z, \phi)$ and then the marginalization $p(y \mid \phi)$ that we actually fit.
Mention that $\phi$ is actually several parameters.
I find the constrained version of the transition matrix where you insert zeros makes this first example too challenging. If you want to discuss that case, I'd suggest another section after this one where you talk about imposing structural zeros. Otherwise it makes the simple case seem too complicated. And then you can just define

array[3] simplex[3] gamma_arr;

matrix[3, 3] gamma;
for (n in 1:3) gamma[n] = gamma_arr[n];

For the doc, those mu and sigma are not just the measurement model---that's all the error terms.
Wherever you have repetition, use loops. It's less error prone and more clear that it's a homogeneous operation:

for (n in 1:N) {
  for (k in 1:3) {
    log_omega[k, n] = normal_lpdf(y[n] | mu[k], sigma);
  }
}

Given that you're tying the parameter sigma across outputs, you need to mention that. I'd recommend just keeping this simple with a vector of sigma values.
You can't just say "computes the relevant log marginal distribution"---you have to say what that is. I don't mean including the marginalization algorithm, I just mean as I wrote it above.
For more details, since -> For more details, see. Also, I wouldn't say "corresponding case study", I'd just say it's a case study on HMMs. And then you should put it in the bibtex file and cite it properly with reference to Ben (the author). And you should cite that you "borrowed" the example from Ben Bales's case study.

charlesm93 · 2025-01-15T10:46:12Z

I'm rewriting the code to make the transition matrix less constrained (per comment 4) and I wanted to check: we don't have a stochastic matrix type, right?

WardBrian · 2025-01-15T13:36:54Z

We have row and column but not double yet https://mc-stan.org/docs/reference-manual/types.html#stochastic-matrices

… issue/838-hmm

charlesm93 · 2025-03-14T18:27:14Z

@bob-carpenter I implemented your feedback.

Some questions:

What exactly is the difference between a measurement and an error model? I'm ok to use either term, and I know they have different conceptual implication, but I'm wondering if they have a formal definition.
I'm keeping sigma a scalar. I'm not sure it's simpler conceptually to pass it as a vector.

bob-carpenter · 2025-03-14T20:15:01Z

For (5), I think the idea's that there are three sources of error: measurement error, modeling error, and sampling error. For example, sampling error arises when you subsample a population and use that for estimation. You get modeling error if you use a linear regression for a relationship that's not linear or use normal errors when the errors are skewed, and so on. If you're weighing things with a scale and you know the scale's biased to the high side, you can correct that measurement error. You can explicitly add a measurement error model if you know your measurement model (e.g., gravitational lensing is part of the measurement error model; your work with Bruno et al. on deconvolving galactic dust is part of the measurement model for the CMB, etc.).

bob-carpenter

There's a ton of little things to fix, but nothing major.

bob-carpenter · 2025-03-14T20:16:15Z

src/functions-reference/functions_index.qmd

@@ -294,6 +294,21 @@ pagetitle: Alphabetical Index
 - <div class='index-container'>[distribution statement](unbounded_discrete_distributions.qmd#index-entry-0c7465aa1beceb6e7e303af36b60e2b847fc562a) <span class='detail'>(unbounded_discrete_distributions.html)</span></div>


+<a id='beta_neg_binomial_cdf' href='#beta_neg_binomial_cdf' class='anchored unlink'>**beta_neg_binomial_cdf**:</a>


Why is this PR touching negative binomial? Are you up to date with the main branch?

hmmm... that's odd. Let me check.

As far as I can tell, I am up to date with the main branch, so I'm not sure how this error occurred. I suppose I could simply delete the extra lines.

src/stan-users-guide/time-series.qmd

bob-carpenter · 2025-03-14T20:32:06Z

src/stan-users-guide/time-series.qmd

-The model for the supervised data does not change; the unsupervised
-data are handled with the following Stan implementation of the forward
-algorithm.
+The last function `hmm_marginal` takes in all the ingredients of the HMM,


Eliminate the comma---English doesn't use commas between conjunctions unless there are more than two. So it's just "A and B", but it's either "A, B, and C" (Oxford style) or "A, B and C" (defective American style).

Ok. Although I was under the impression they didn't use the Oxford comma in the UK. I remember noticing this during StanCon in Cambridge and I figured out the Cambridge folks were just taking a piss. But when I asked some of the locals, it turned out they were familiar with the Oxford comma.

src/stan-users-guide/time-series.qmd

charlesm93 · 2025-04-16T20:41:43Z

@bob-carpenter I completely missed your review. Sorry about that. Hopefully we'll merge this in soon.

bob-carpenter

Looks good.

src/stan-users-guide/time-series.qmd

charlesm93 added 4 commits December 5, 2024 18:00

update time series section on hmm.

35ed46c

Merge branch 'master' of https://github.com/stan-dev/docs

12ab091

Merge branch 'master' into issue/838-hmm

cc0e2b3

correct typos in hmm documents.

38c48dc

bob-carpenter and others added 3 commits January 21, 2025 10:41

updated docs for hmm

18682cb

implement PR feedback for HMM doc.

78e92de

Merge branch 'issue/838-hmm' of https://github.com/stan-dev/docs into…

dea7cb2

… issue/838-hmm

bob-carpenter requested changes Mar 14, 2025

View reviewed changes

This was linked to issues Mar 27, 2025

rewrite HMM section of User's Guide to use new functions #548

Closed

Update Stan users guide chapter on HMMs #838

Closed

implement minor feedback from Bob.

7584f2c

bob-carpenter approved these changes Aug 26, 2025

View reviewed changes

src/stan-users-guide/time-series.qmd Show resolved Hide resolved

src/stan-users-guide/time-series.qmd Show resolved Hide resolved

src/stan-users-guide/time-series.qmd Show resolved Hide resolved

bob-carpenter merged commit d4f1f37 into master Aug 26, 2025

bob-carpenter deleted the issue/838-hmm branch August 26, 2025 22:46

		@@ -294,6 +294,21 @@ pagetitle: Alphabetical Index
		- <div class='index-container'>[distribution statement](unbounded_discrete_distributions.qmd#index-entry-0c7465aa1beceb6e7e303af36b60e2b847fc562a) <span class='detail'>(unbounded_discrete_distributions.html)</span></div>


		<a id='beta_neg_binomial_cdf' href='#beta_neg_binomial_cdf' class='anchored unlink'>beta_neg_binomial_cdf:</a>

Uh oh!

Issue/838 hmm #840

Issue/838 hmm #840

Uh oh!

Conversation

charlesm93 commented Dec 6, 2024

Submission Checklist

Summary

Copyright and Licensing

Uh oh!

bob-carpenter commented Dec 6, 2024

Uh oh!

charlesm93 commented Jan 15, 2025

Uh oh!

WardBrian commented Jan 15, 2025

Uh oh!

charlesm93 commented Mar 14, 2025

Uh oh!

bob-carpenter commented Mar 14, 2025

Uh oh!

bob-carpenter left a comment

Choose a reason for hiding this comment

Uh oh!

bob-carpenter Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

charlesm93 Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

charlesm93 Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bob-carpenter Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

charlesm93 Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

charlesm93 commented Apr 16, 2025

Uh oh!

bob-carpenter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!