Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions units/en/unit1/4.md
Original file line number Diff line number Diff line change
Expand Up @@ -1327,7 +1327,7 @@ trl sft --config sft_config.yaml
**If models fail to load:**
- Check your internet connection
- Try using `device_map="cpu"` for CPU loading
- Use a smaller model like `HuggingFaceTB/SmolLM3-1.7B` for testing
- Use a smaller model like `HuggingFaceTB/SmolLM-1.7B` for testing
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not HuggingFaceTB/SmolLM2-1.7B?


**If training fails:**
- Make sure your dataset is properly formatted
Expand All @@ -1352,4 +1352,4 @@ These skills form the foundation for building sophisticated instruction-tuned mo
- [SmolLM3 Model Card](https://huggingface.co/HuggingFaceTB/SmolLM3-3B) - Model details
- [SmolTalk2 Dataset](https://huggingface.co/datasets/HuggingFaceTB/smoltalk2) - Training data
- [Hugging Face Hub](https://huggingface.co/models) - Share your models
- [Discord Community](https://discord.gg/UrrTSsSyjb) - Get help and discuss
- [Discord Community](https://discord.gg/UrrTSsSyjb) - Get help and discuss