Show one progress bar per chain when sampling #7634

jessegrabowski · 2025-01-06T14:31:06Z

Description

I really like what nutpie gives you while sampling, so I tried to make something using rich that copies it. Example:

test_lr_scheduler.-.Jupyter.Notebook.Mozilla.Firefox.2025-01-07.10-57-56.mp4

Features are:

One progress bar per chain
Sampling statistics per chain. I copied nutpie, but we can haggle over what these should be (or give the user more control)
Color change based on status. Blue when sampling, turns red after a divergence. Finished bar is either green (no divergences) or purple (with divergences).

Related Issue

Closes #
Related to #

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

📚 Documentation preview 📚: https://pymc--7634.org.readthedocs.build/en/7634/

ricardoV94

How does this look like when 1) you have another step sampler in the mix and 2) there's no nuts at all or 3) there are more than one NUTS step samplers?

ricardoV94 · 2025-01-07T13:42:04Z

pymc/util.py

@@ -1,4 +1,4 @@
-#   Copyright 2024 The PyMC Developers
+#   Copyright 2025 The PyMC Developers


According to Oriol this should just be 20xx-present

ricardoV94 · 2025-01-07T13:46:48Z

Doesn't need to be this PR but would be nice to show a relevant statistic for each sampler (or at least for when a single non NUTS sampler is being used).

Conversely not showing these columns when there's no NUTS, as it gives a false sense of everything is going great

jessegrabowski · 2025-01-07T14:12:34Z

test_lr_scheduler.-.Jupyter.Notebook.Mozilla.Firefox.2025-01-07.22-09-52.mp4

Here's a comparison between NUTS and non-nuts sampler.

Ideally we'd add a method to the step samplers themselves that would return the rich columns that sampler wants to use, then we just gather them and display. In that case you could even different sampler stats from different steps in the same run. Maybe it's worth doing. The actual code for this PR is pretty gnarly.

jessegrabowski · 2025-01-08T10:21:31Z

I moved the responsibility for setting up the progressbars and updating stats to the step samplers. This means each step method can choose what stats are to be shown on the progress bars, and we can also combine them. Example vid attached.

test_lr_scheduler.-.Jupyter.Notebook.Mozilla.Firefox.2025-01-08.18-16-37.mp4

This is a pretty big scope creep for this PR, so I'm not against reverting these changes and going with something more basic. If we like it though I can lean into it.

I will say it's broken right now because when you have e.g. multiple metropolis steps (one per variable) the only stats that get reported are the last one. It needs some logic on how to aggregate the stats across samplers with the same stats.

ricardoV94 · 2025-01-08T10:22:45Z

The step sampler specifics looks amazing 😍 Gonna give it a try today.

I'll test it but I assume things behave gracefully if the step samplers don't specify the display columns info?

jessegrabowski · 2025-01-08T10:24:29Z

No it will break. I need to put in a default for the base class. It just needs to return empty stuff.

ricardoV94 · 2025-01-08T10:27:41Z

I would still like to see the global runtime and /eta like we had before. Is that feasible or too ugly?

Re: repeated samplers, show the mean? Or maybe only display specialized info when a single step sampler is being used?

jessegrabowski · 2025-01-08T10:35:19Z

I added the base impl, so things will go gracefully if there's no implementation. This would only show the NUTS stats for example, because there's no implementation for BinaryGibbsMetropolis:

import pymc as pm

with pm.Model() as m:
    x = pm.Bernoulli('x', p=0.5)
    y = pm.Normal('y', mu=pm.math.switch(x, -3, 3), sigma=10, shape=(10,))
    
    idata = pm.sample(step=[pm.BinaryGibbsMetropolis(x), pm.NUTS(y)], tune=2000, draws=2000, chains=8, cores=8, compile_kwargs={'mode':'NUMBA'})

Re: global, yes we can keep it. But we can't have it as a single long bar that breaks the columns, because there's no colspan operator for rich tables (see Textualize/rich#164).

We could make a separate table though. It just won't be as pretty as nutpie.

I was thinking about the mean as well. If it only shows up when there's a single step sampler it would be pretty rare that anyone would use it, because the non-NUTS samplers pretty much always show up as one per variable.

We might also need some priority logic to decide what to show if too many stats get involved. You can see just NUTS + Metropolis already breaks the table. We could do a LOW/MEDIUM/HIGH priority for displaying stats, and only at max 5 ever get displayed?

ricardoV94 · 2025-01-09T20:13:42Z

Sequential sampling (cores=1) still has the old approach. It has one bar per chain but not the stats

ricardoV94 · 2025-01-09T20:14:12Z

Re: global, yes we can keep it. But we can't have it as a single long bar that breaks the columns, because there's no colspan operator for rich tables (see Textualize/rich#164).

What if we show as a column per chain then? elapsed/left?

jessegrabowski · 2025-01-12T11:45:55Z

New version with timing info:

test_lr_scheduler.-.Jupyter.Notebook.Mozilla.Firefox.2025-01-12.19-44-52.mp4

ricardoV94 · 2025-01-14T15:20:50Z

Sequential sampling (cores=1) still has the old approach. It has one bar per chain but not the stats

Did you address this?

jessegrabowski · 2025-01-14T16:08:39Z

Not yet, but it will be an easy fix.

ricardoV94 · 2025-01-14T16:10:32Z

Some failing tests as well. Otherwise I'm happy with the changes. I'll paste in the discord to see if anybody has big complaints

aloctavodia · 2025-01-15T06:18:37Z

This is looks really nice and modern and its very informative, but do we have an option for a single progress bar with less information.

jessegrabowski · 2025-01-15T11:23:09Z

We can do that painlessly yeah

twiecki · 2025-01-15T11:46:35Z

This looks great. I assume blue means tuning and red means post-tuning? If so, I wonder if red is the best color choice as it suggests something gone wrong. Maybe replace red with green? Or make tuning red and sampling blue?

ricardoV94 · 2025-01-15T12:03:22Z

This looks great. I assume blue means tuning and red means post-tuning? If so, I wonder if red is the best color choice as it suggests something gone wrong. Maybe replace red with green? Or make tuning red and sampling blue?

It turns red if there's any divergence

twiecki · 2025-01-15T15:46:50Z

I see, maybe then green post-tuning without divergences? Or maybe a non-colorblind color.

aloctavodia · 2025-01-15T16:05:49Z

If you want some colorblind-friendly palletes https://github.com/arviz-devs/arviz-plots/tree/main/src/arviz_plots/styles

fonnesbeck · 2025-01-15T16:16:31Z

I like it a lot.

I don't know if we need different colors for pre-/post-tuning.

Can we get red for any warning, not just divergences (so that it makes users read the warning)?

Definitely no green if we are using red. I like blue/red for clean/warning.

ricardoV94 · 2025-01-15T18:49:35Z

I think 2 colors is enough. It will be clear when you use compared to a gif. Otherwise a single color.

@fonnesbeck what sort of warnings are you thinking about? Are they emmited during sampling or only at the end?

fonnesbeck · 2025-01-15T19:26:53Z

Things like max tree depth, missing the target acceptance rate and r-hat. Those that are calculated at the end could flip the line to red upon completion.

Not a big deal, though. What's here is a big improvement!

jessegrabowski changed the title ~~Show one progress bars per chain when sampling~~ Show one progress bar per chain when sampling Jan 6, 2025

jessegrabowski requested review from fonnesbeck, tomicapretto and ricardoV94 January 6, 2025 14:32

ricardoV94 reviewed Jan 7, 2025

View reviewed changes

ricardoV94 added maintenance samplers labels Jan 7, 2025

jessegrabowski force-pushed the more-progress branch from 3db028d to 65af907 Compare January 8, 2025 11:11

jessegrabowski added 9 commits January 10, 2025 12:35

One progress bar per chain when samplings

693ebe5

Add guard against divide by zero when computing draws per second

eb8aefd

No more purple

0f0414d

Step samplers are responsible for setting up progress bars

dae4c6d

Fix typos

5c2c968

Add progressbar defaults to BlockedStep ABC

444e394

pre-commit

1ab0b1c

Only update NUTS divergence stats after tuning

2bf89ff

Add Elapsed and Remaining columns

667c78e

jessegrabowski force-pushed the more-progress branch from 5f5c648 to 667c78e Compare January 10, 2025 04:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show one progress bar per chain when sampling #7634

Show one progress bar per chain when sampling #7634

jessegrabowski commented Jan 6, 2025 •

edited

Loading

ricardoV94 left a comment •

edited

Loading

ricardoV94 Jan 7, 2025

ricardoV94 commented Jan 7, 2025

jessegrabowski commented Jan 7, 2025

jessegrabowski commented Jan 8, 2025

ricardoV94 commented Jan 8, 2025

jessegrabowski commented Jan 8, 2025 •

edited

Loading

ricardoV94 commented Jan 8, 2025

jessegrabowski commented Jan 8, 2025 •

edited

Loading

ricardoV94 commented Jan 9, 2025

ricardoV94 commented Jan 9, 2025

jessegrabowski commented Jan 12, 2025

ricardoV94 commented Jan 14, 2025

jessegrabowski commented Jan 14, 2025

ricardoV94 commented Jan 14, 2025

aloctavodia commented Jan 15, 2025

jessegrabowski commented Jan 15, 2025

twiecki commented Jan 15, 2025

ricardoV94 commented Jan 15, 2025

twiecki commented Jan 15, 2025

aloctavodia commented Jan 15, 2025

fonnesbeck commented Jan 15, 2025 •

edited

Loading

ricardoV94 commented Jan 15, 2025

fonnesbeck commented Jan 15, 2025

		@@ -1,4 +1,4 @@
		# Copyright 2024 The PyMC Developers
		# Copyright 2025 The PyMC Developers

Show one progress bar per chain when sampling #7634

Are you sure you want to change the base?

Show one progress bar per chain when sampling #7634

Conversation

jessegrabowski commented Jan 6, 2025 • edited Loading

Description

Related Issue

Checklist

Type of change

ricardoV94 left a comment • edited Loading

Choose a reason for hiding this comment

ricardoV94 Jan 7, 2025

Choose a reason for hiding this comment

ricardoV94 commented Jan 7, 2025

jessegrabowski commented Jan 7, 2025

jessegrabowski commented Jan 8, 2025

ricardoV94 commented Jan 8, 2025

jessegrabowski commented Jan 8, 2025 • edited Loading

ricardoV94 commented Jan 8, 2025

jessegrabowski commented Jan 8, 2025 • edited Loading

ricardoV94 commented Jan 9, 2025

ricardoV94 commented Jan 9, 2025

jessegrabowski commented Jan 12, 2025

ricardoV94 commented Jan 14, 2025

jessegrabowski commented Jan 14, 2025

ricardoV94 commented Jan 14, 2025

aloctavodia commented Jan 15, 2025

jessegrabowski commented Jan 15, 2025

twiecki commented Jan 15, 2025

ricardoV94 commented Jan 15, 2025

twiecki commented Jan 15, 2025

aloctavodia commented Jan 15, 2025

fonnesbeck commented Jan 15, 2025 • edited Loading

ricardoV94 commented Jan 15, 2025

fonnesbeck commented Jan 15, 2025

jessegrabowski commented Jan 6, 2025 •

edited

Loading

ricardoV94 left a comment •

edited

Loading

jessegrabowski commented Jan 8, 2025 •

edited

Loading

jessegrabowski commented Jan 8, 2025 •

edited

Loading

fonnesbeck commented Jan 15, 2025 •

edited

Loading