Add custom group_ids support to Chronos2Pipeline by StatMixedML · Pull Request #429 · amazon-science/chronos-forecasting

StatMixedML · 2025-12-09T13:49:27Z

Summary

This PR adds support for custom group IDs in Chronos2Pipeline, enabling fine-grained control over which time series share information during prediction through cross-attention. Users can now specify meaningful groupings (e.g., by geography, sector, ...) to improve forecast accuracy while preventing information leakage between unrelated series.

Motivation

Currently, users can either:

Predict each series independently (default)
Share information across all series in a batch (cross_learning=True)

This PR adds a middle ground: selective information sharing where only series within the same group exchange information, while different groups remain independent.

Changes

Core API Changes

Added group_ids parameter to predict_df() and predict_quantiles()
Added helper functions in src/chronos/utils.py

Backward Compatibility

✅ Fully backward compatible

Default behavior unchanged when group_ids=None
Existing cross_learning parameter works as before
No changes to training code or dataset construction

Documentation

✅ Comprehensive docstrings for all new parameters
✅ Updated notebooks/chronos-2-quickstart.ipynb with examples
✅ Helper function documentation with examples

abdulfatir · 2025-12-12T10:56:56Z

@StatMixedML Thanks for the PR. We are currently working towards AutoGluon 1.5, so I will take a careful look after the release. Before that, one small feedback I have after a quick skim is that a 900 line PR sounds a bit too much to enable this capability. Could you please check if the size of the PR can be reduced?

shchur · 2025-12-12T11:09:42Z

src/chronos/chronos2/pipeline.py

            batch_future_covariates = batch["future_covariates"]
            batch_target_idx_ranges = batch["target_idx_ranges"]

-            if cross_learning:


Could the user instead just run something like the following code?

prediction_per_group = [] for _, group_df in df.groupby("group_id"): prediction_per_group.append(pipeline.predict(group_df.drop(columns=["group_id"], cross_learning=True, ...)) predictions = pd.concat(prediction_per_group)

@shchur I suppose your suggestion is an elegant way of doing it :-) It works the same way the PR suggests

StatMixedML · 2025-12-12T12:44:02Z

@StatMixedML Thanks for the PR. We are currently working towards AutoGluon 1.5, so I will take a careful look after the release. Before that, one small feedback I have after a quick skim is that a 900 line PR sounds a bit too much to enable this capability. Could you please check if the size of the PR can be reduced?

Most of the additions come from the notebook examples and the unit tests, so the actual changes in pipeline.py are rather minor. Given @shchur' suggestion, we can also go with the loop approach.

abdulfatir · 2025-12-12T14:38:33Z

@StatMixedML Thanks! In that case, maybe users can just go with the idea that @shchur suggested. That said, it would be cool to cover non-trivial grouping in the "advanced" section of the tutorials. Do you happen to have good (preferably not synthetic) examples of such grouping helping accuracy?

StatMixedML · 2025-12-12T15:33:52Z

@StatMixedML Thanks! In that case, maybe users can just go with the idea that @shchur suggested.

Shall I then close the PR?

That said, it would be cool to cover non-trivial grouping in the "advanced" section of the tutorials. Do you happen to have good (preferably not synthetic) examples of such grouping helping accuracy?

I can keep the PR open and add some examples to the notebook using publicly available data? Sth. like M5 or some monthly seasonal data with geographic grouping?

StatMixedML added 2 commits December 9, 2025 14:40

Add custom group_ids support to Chronos2Pipeline

d5b5282

Merge branch 'main' into custom_group_ids

0e621fa

shchur reviewed Dec 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add custom group_ids support to Chronos2Pipeline#429

Add custom group_ids support to Chronos2Pipeline#429
StatMixedML wants to merge 2 commits intoamazon-science:mainfrom
StatMixedML:custom_group_ids

StatMixedML commented Dec 9, 2025

Uh oh!

abdulfatir commented Dec 12, 2025

Uh oh!

shchur Dec 12, 2025

Uh oh!

StatMixedML Dec 12, 2025

Uh oh!

StatMixedML commented Dec 12, 2025

Uh oh!

abdulfatir commented Dec 12, 2025

Uh oh!

StatMixedML commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

StatMixedML commented Dec 9, 2025

Summary

Motivation

Changes

Core API Changes

Backward Compatibility

Documentation

Uh oh!

abdulfatir commented Dec 12, 2025

Uh oh!

shchur Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

StatMixedML Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

StatMixedML commented Dec 12, 2025

Uh oh!

abdulfatir commented Dec 12, 2025

Uh oh!

StatMixedML commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants