This repository stores the main brain of my project:
- Finetuning Llama models
- Running inference on finetuned models.
Required dependencies such as datasets and models can be found below:
Datasets:
MCV
@ CMP3753-Project-MCVABCD
@ CMP3753-Project-ABCD
Models:
Llama 3.1 8B Instruct
@ Hugging FaceLlama 3.1 8B
@ Hugging Face
See SETUP.md for detailed setup instructions.
A nice and easy one-liner for each dataset after you've gone through all the effort of setup :)
(venv) python finetune_ABCD.py
(venv) python finetune_MCV.py
(venv)
indicates the virtual environment has been activated, see SETUP.md#virtual-environment for a revision on how to do that.
To run Archived finetuning scripts, copy them out of the archived_finetune_scripts
directory and into the root directory and then do the same:
(venv) python old_finetune.py
Another one-liner for running the latest finetuned model (Note: the finetuning MUST be completed first)
(venv) python inference.py
To change model used, see Line #5:
MODEL_PATH = "./finetuned/67_results_abcd"
Note: this inference.py is specific to the latest finetune.py found in root of the project, different finetune scripts cause slightly different parameters and especially in earlier version (0-20) the inference script had to be changed to run each finetuned model.
If you are desperate to run these old models, see
archived_inference_scripts
which may contain the needed parameters for loading certain older models.-- NO SUPPORT FOR OLD MODELS --
CMP3753 Project - Action Tokens in LLM's
Copyright (C) 2025 Jack Honour
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU Affero General Public License as published
by the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License
along with this program. If not, see <https://www.gnu.org/licenses/>.