Supervised finetuning Gemma-9B on Google Colab #9

chiffonng · 2024-12-08T21:17:35Z

Closes #6 Supervised finetuning Gemma-9B on Google Colab. The dataset and fine-tuned model are pushed to Hugging Face collection.

This PR has the main following components:

Adapted from Unsloth notebook for finetuning Gemma-2-9B
Incorporate wandb to track finetuning
Brute-force values for LoRA config by referring to Determined AI experiments here and Stephen Diehl tutorial here

Auxilary:

A lot of time was spent on setting up an instance of Google Cloud Deep Learning VM to use NVIDIA T4 for training. Deployed here
Some time was also spent hooking VSCode with that virtual machine and configuring git + SSH
Set up wandb to use with huggingface Trainer

What I tried but failed:

Used clearml to track experiments but it crashed training with unsloth Training crashes with ClearML unslothai/unsloth#1365

- Refactor .gitignore to exclude /temp directory and ignore all .parquet and .csv files. - Format mnemonics more consistently - Drop mnemonics with only 2 words or less. - Add more error handling for path

…io-chatbot-interface

…dling and path validation

- Convert dataset into datasets.Dataset format - Train test split the data - Push data to HF hub - Load data from HF hub - Remove redundant functions - Swap strings with constants

* Update setup script and environment variables * Add ClearML and unsloth as dependencies in pyproject.toml * Resolve other packages

* Downgrade to Python version to 3.10 * Use conda as package manager for better integration of CUDA * Add clearml for tracking experiments

* Update .gitignore * Modify .env.template to use wandb instead of clearml * Add error handling with explicit enums

chiffonng and others added 26 commits October 10, 2024 18:26

Add mnemonic examples to README

40efb3c

Refactor .gitignore and data_processing.py

e54f7b7

- Refactor .gitignore to exclude /temp directory and ignore all .parquet and .csv files. - Format mnemonics more consistently - Drop mnemonics with only 2 words or less. - Add more error handling for path

Categorize small number of mnemonics using OpenAI's API.

bc43357

Separate prompts storage to YAML files

7629b21

Refactor prompt config and mnemonic processing module

4d07824

Merge remote-tracking branch 'origin/main' into chiffonng/cap-11-grad…

34d0675

…io-chatbot-interface

chore: Set up Gradio for the project

f37e49e

Add more linting rules, including bugbear checks

f62ad26

Refactor modules and add huggingface libraries

740beca

Refactor constants and enhance utility functions for better error han…

01bebf4

…dling and path validation

Sort imports and add more constants

e6dad93

Interact with Hugging Face dataset hub

22c8b8f

- Convert dataset into datasets.Dataset format - Train test split the data - Push data to HF hub - Load data from HF hub - Remove redundant functions - Swap strings with constants

Add wandb as dependency and fix optional installations

c2fd805

Use type aliases for improved readability and maintainability

91345da

Initial Colab version

2afae11

Add initial notebook

756d5a1

Update mnemonic category mappings

e91c5f7

Minor fixes to data module

e7d29aa

Add new setup and packages for finetuning

747bb96

* Update setup script and environment variables * Add ClearML and unsloth as dependencies in pyproject.toml * Resolve other packages

Update CI workflow and dependencies

5373fbf

Fix requirements.txt

962f193

Fix environment setup on Linux VM

75c5396

* Downgrade to Python version to 3.10 * Use conda as package manager for better integration of CUDA * Add clearml for tracking experiments

Supervised finetuning with wandb tracking on Google Colab

d0eb4ff

Resolve some dependencies

8c70c5a

* Update .gitignore * Modify .env.template to use wandb instead of clearml * Add error handling with explicit enums

Merge main into chiffonng/sft, favoring current branch changes

4718a65

Move notebook to notebooks/ folder

21fe800

chiffonng marked this pull request as ready for review December 11, 2024 23:07

chiffonng added 2 commits December 11, 2024 15:25

Fix CI

dbecdd7

Fix CI by adding back requirements.txt

01c098e

chiffonng merged commit bee206b into main Dec 11, 2024
1 check passed

chiffonng linked an issue Dec 11, 2024 that may be closed by this pull request

[CAP-9] Supervised fine-tuning #6

Closed

chiffonng changed the title ~~Supervised finetuning (Gemma-9B with generic instructions)~~ Supervised finetuning Gemma-9B on Google Colab Dec 11, 2024

chiffonng added the feature New feature label Dec 12, 2024

chiffonng deleted the chiffonng/sft branch February 21, 2025 23:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supervised finetuning Gemma-9B on Google Colab #9

Supervised finetuning Gemma-9B on Google Colab #9

chiffonng commented Dec 8, 2024 •

edited

Loading

Supervised finetuning Gemma-9B on Google Colab #9

Supervised finetuning Gemma-9B on Google Colab #9

Conversation

chiffonng commented Dec 8, 2024 • edited Loading

chiffonng commented Dec 8, 2024 •

edited

Loading