[Train] Training Pipeline #1214

horheynm · 2025-02-28T20:57:23Z

Order of reviews:
#1206
#1207
#1209
#1212
#1214 <-- Here

SUMMARY:

Refactor Training pipeline
Remove initialize, finalize from the session functions
Add training information on entrypoints/readme.md on the different types of training that can be carried out on llm-compressor
Decouple training from text_generation.py::main. The new logic loves in llmcompressor/entrypoints/train.py that takes the flow of pre-process, carry out training logic and then post-process
Delete outdated info on transformers/finetune/readme.md
Update session_mixin.py to use session().initialize or session().finalize.
Deprecate train.py in text_generation.py, raising deprecation message if used.
Update tests to use llmcompressor's train, not llmcompressor.transformers' train

TEST PLAN:

Pass tests

Signed-off-by: George Ohashi <[email protected]>

github-actions · 2025-02-28T20:57:35Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Order of reviews: #1206 <-- Here #1207 #1209 #1212 #1214 SUMMARY: Rename data_args to dataset_args TEST PLAN: Pass tests FInd `data_args` using `grep` --------- Signed-off-by: George Ohashi <[email protected]> Co-authored-by: Dipika Sikka <[email protected]>

Order of reviews: #1206 #1207 #1209 <-- Here #1212 #1214 SUMMARY: * Move dataset logic out of transformers module `src/llmcompressor/transformers/finetune/data/data_helpers.py`, add it to `src/llmcompressor/datasets/utils.py` TEST PLAN: Pass tests

…ot (#1212) Order of reviews: #1206 #1207 #1209 #1212 <-- Here #1214 SUMMARY: * Move the preprocessing and postprocessing logic out of `src/llmcompressor/transformers/finetune/text_generation.py` and into `src/llmcompressor/entrypoints/utils.py` TEST PLAN: Pass tests

rahul-tuli · 2025-03-10T14:21:20Z

LGTM pending tests!

…ot (#1212) Order of reviews: #1206 #1207 #1209 #1212 <-- Here #1214 SUMMARY: * Move the preprocessing and postprocessing logic out of `src/llmcompressor/transformers/finetune/text_generation.py` and into `src/llmcompressor/entrypoints/utils.py` TEST PLAN: Pass tests Signed-off-by: Brian Dellabetta <[email protected]>

…rain

dsikka

Please write a more descriptive PR description, summarizing changes and test steps.
The current description isn't very helpful.

kylesayrs · 2025-03-11T16:29:10Z

src/llmcompressor/core/session_functions.py

@@ -58,79 +55,6 @@ def reset_session():
    session._lifecycle.reset()


-def initialize(


Why are we removing initialize? This seems out of scope of these changes and we still plan on supporting initialize into the future

Sorry, why do we need to keep it? This function is a just a pointer to active_session().initialize.
For the scope we're going to trim unnecessary pathways in train, and this func doesn't do anything.

What is the purpose of your PR, and how does it relate to removing this function?

Anything can be deemed as "unnecessary", but in this case, initialize() is a core functionality of LLM Compressor and user-level api we're very very likely to maintain.

train - main refac

82b1b62

Signed-off-by: George Ohashi <[email protected]>

This was referenced Feb 28, 2025

[Training] Datasets - update Module #1209

Merged

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

Merged

[Training] Decouple Argument parser #1207

Merged

[Cosmetic] Rename data_args to dataset_args #1206

Merged

dsikka pushed a commit that referenced this pull request Mar 3, 2025

[Training] Decouple Argument parser (#1207)

7bb517f

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Merge branch 'main' into train

11a8ad6

brian-dellabetta previously approved these changes Mar 6, 2025

View reviewed changes

Merge branch 'main' into train

5edf461

horheynm dismissed brian-dellabetta’s stale review via 5edf461 March 6, 2025 19:07

horheynm added 2 commits March 6, 2025 14:40

merge main

0ce00ad

add train logic

11d0cc3

horheynm added the ready When a PR is ready for review label Mar 6, 2025

add type checks

fdcfc0d

horheynm changed the title ~~[Train] Main refac~~ [Train] Training Pipeline Mar 6, 2025

Merge branch 'main' into train

0b0dd59

brian-dellabetta previously approved these changes Mar 7, 2025

View reviewed changes

horheynm added 3 commits March 11, 2025 11:09

Merge branch 'main' into train

75c8e92

remove link on markdown

3d23bb0

Merge branch 'train' of github.com:vllm-project/llm-compressor into t…

9b0dee4

…rain

horheynm dismissed brian-dellabetta’s stale review via 9b0dee4 March 11, 2025 15:24

brian-dellabetta approved these changes Mar 11, 2025

View reviewed changes

dsikka requested changes Mar 11, 2025

View reviewed changes

kylesayrs requested changes Mar 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Train] Training Pipeline #1214

[Train] Training Pipeline #1214

horheynm commented Feb 28, 2025 •

edited

Loading

github-actions bot commented Feb 28, 2025

rahul-tuli commented Mar 10, 2025

dsikka left a comment

kylesayrs Mar 11, 2025

horheynm Mar 11, 2025

kylesayrs Mar 12, 2025

		@@ -58,79 +55,6 @@ def reset_session():
		session._lifecycle.reset()


		def initialize(

[Train] Training Pipeline #1214

Are you sure you want to change the base?

[Train] Training Pipeline #1214

Conversation

horheynm commented Feb 28, 2025 • edited Loading

github-actions bot commented Feb 28, 2025

rahul-tuli commented Mar 10, 2025

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs Mar 11, 2025

Choose a reason for hiding this comment

horheynm Mar 11, 2025

Choose a reason for hiding this comment

kylesayrs Mar 12, 2025

Choose a reason for hiding this comment

horheynm commented Feb 28, 2025 •

edited

Loading