Chronos-2 fine-tuning does not respect the GPU index

Regardless of which GPU a `Chronos2Pipeline` is loaded on, fine-tuning will always happen on GPU 0. This is because we have [this hack](https://github.com/amazon-science/chronos-forecasting/blob/1f099eb265a4b423529929321929d4258dc031d8/src/chronos/chronos2/pipeline.py#L325) to disable data parallel. 


To reproduce:

```py
import numpy as np

from chronos import Chronos2Pipeline


def generate_data(num_items: int = 10_000):
    rng = np.random.default_rng(seed=42)
    train_data = [{"target": rng.normal(size=2048)} for i in range(num_items)]

    return train_data


def main():
    train_data = generate_data()
    pipeline = Chronos2Pipeline.from_pretrained("amazon/chronos-2", device_map="cuda:5")
    print(pipeline.model.device)  # cuda:5
    ft_pipeline = pipeline.fit(train_data, context_length=512, prediction_length=64, num_steps=10)
    print(ft_pipeline.model.device)  # cuda:0


if __name__ == "__main__":
    main()
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chronos-2 fine-tuning does not respect the GPU index #457

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Chronos-2 fine-tuning does not respect the GPU index #457

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions