[Callbacks] Remove pre_initialize_structure #1160

kylesayrs · 2025-02-17T18:59:06Z

Purpose

Remove pre_initialize_structure to simplify codebase
Fix recipe appending for appending a recipe to a model which already has a recipe
Remove misleading logging messages

2025-02-17T17:48:38.477750-0500 | _check_create_state | INFO - State created for compression lifecycle
2025-02-17T17:48:38.478670-0500 | pre_initialize_structure | INFO - Compression lifecycle structure pre-initialized for 0 modifiers
2025-02-17T17:48:38.478836-0500 | pre_initialize_structure | INFO - Compression lifecycle structure pre-initialized for 0 modifiers

Prerequisites

[Callbacks] Consolidate Saving Methods #1168

Follow-ups

Remove double initialization

Changes

The preinitialization step used to fulfill a few purposes

Construct the lifecycle state
- This is now done by the dataclass directly

- state: Optional[State] = None
+ state: Optional[State] = field(default_factory=State)

Populate state with model and recipe
- This is now done (and has always been done) by initialize
- Some functions such as Trainer.init_model attempt to access the model through the session before initialize is called. In these cases, we can pass the model directly

trainer = Trainer(
-     model_init=get_session_model,
+     model_init=lambda: model,

Prepend recipes to the recipe.yaml if the model has already been compressed once
- Move this logic from preinitialization to the save_pretrained function
- Consolidate all save pathways to use the the same wrapped method

def save_pretrained_wrapper(...):
    update_and_save_recipe(model.name_or_path, save_directory)

Provide a way for modifiers to influence the model after they have already been applied
- This can still be a enacted via recipe validation, but likely no longer has a use case and shouldn't be done automatically, at most the LLM Compressor should warn if the recipe configuration is invalid / requires modification
Create quantization modifier on GPTQ
- This is now done within the on_initialize function
- In the future, this should be done by a high-level recipe validation step

def on_initialize(...)
-     self.on_initialize_structure(state, **kwargs)
+     self._maybe_build_quant_modifier(state.model)

Remove EventType.order() method which is unused
Extend the Recipe.simplify_recipe class method to support strings

Lifecycle

create_session() (doesn't do much and can be hidden behind initialize)
initialize(model=..., recipe=...)
1. Maybe start modifiers
LifecycleCallback.event(...)
1. Maybe start/end modifiers
finalize()

Regression Evaluation

Main

vllm (pretrained=/home/kyle/llm-compressor/Meta-Llama-3-8B-Instruct-W4A16-G128,dtype=bfloat16,add_bos_token=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|  Tasks   |Version|Filter|n-shot|Metric|   |Value |   |Stderr|
|----------|------:|------|-----:|------|---|-----:|---|-----:|
|winogrande|      1|none  |     5|acc   |↑  |0.7482|±  |0.0122|

This branch

vllm (pretrained=/home/kyle/llm-compressor/Meta-Llama-3-8B-Instruct-W4A16-G128,dtype=bfloat16,add_bos_token=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|  Tasks   |Version|Filter|n-shot|Metric|   |Value |   |Stderr|
|----------|------:|------|-----:|------|---|-----:|---|-----:|
|winogrande|      1|none  |     5|acc   |↑  |0.7482|±  |0.0122|

Signed-off-by: Kyle Sayers <[email protected]>

github-actions · 2025-02-17T18:59:19Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <[email protected]>

…lize-structure

Signed-off-by: Kyle Sayers <[email protected]>

…initialize-structure

Signed-off-by: Kyle Sayers <[email protected]>

horheynm

Good job XD

## Purpose ## * Simplify all methods of saving into one point, namely the wrapped `save_pretrained` function * Precursor to #1160 * Needed for having a single point for saving on top of existing recipes ## Background ## All the things needed to be done during saving 1. Save the model weights, potentially compressed 2. Save the processor 3. Update the recipe checkpoint 4. Copy any necessary python files from the model cache 5. Only save on the main process After these changes, (1, 2, 3, 4) will be done within the `save_pretrained` function, and (5) will be the responsibility of the caller. (3) will be implemented by #1160 so as not to conflict with existing logic in pre_init All of the places where a model is saved are * If an output dir is specified, at the end of the main function * Between stages of the stage runner * Between epochs of the HF Trainer * By the user after oneshot/training completes After these changes, all of these will be replaced by a single `save_checkpoint` function which calls `save_pretrained` to do all the necessary things ## Changes ## * Remove `save_model_and_recipe` * Saving recipes is now done by `save_pretrained` function * Implement `save_checkpoint` * Single entrypoint for saving a model and its processor * Performs actions (1, 2, 4) * Replace all locations where a model is saved with `save_checkpoint` * All applicable callers with only saving on the main process (5) * Remove support for `modify_fsdp_model_save_pretrained` and `unwrap_and_export_model`, to be added back in a future release --------- Signed-off-by: Kyle Sayers <[email protected]> Co-authored-by: Dipika Sikka <[email protected]>

The base branch was changed.

Signed-off-by: Kyle Sayers <[email protected]>

…lize-structure

brian-dellabetta

looking good, but i'm going off an assumption that none of this deleted code we actually need

src/llmcompressor/transformers/utils/helpers.py

src/llmcompressor/recipe/recipe.py

kylesayrs · 2025-02-25T17:25:28Z

@brian-dellabetta The PR description lists all of the functionality that was needed by preinitialize and where that functionality now lives

Signed-off-by: Kyle Sayers <[email protected]>

src/llmcompressor/entrypoints/oneshot.py

Signed-off-by: Kyle Sayers <[email protected]>

dsikka

great work!

src/llmcompressor/transformers/sparsification/compressed_tensors_utils.py

## Purpose ## * Fix staged 2of4 example ## Background ## * When #1160 landed, this change introduced a bug in the recipe container which meant that the recipe was not recompiled after `append`ing. This caused sgpt to initialize twice and gptq to never initialize, leading to a sparsity-only quantization config * At some point, a changed was introduced which causes previous stages to become reconstructed after recipe recompilation. This means that without resetting the session in between stages, previous stages will initialize twice. * In order to avoid this issue, this PR introduces `session.reset()` in between stages * This change has the consequence of creating `recipe.yaml` files which do not have the full recipe history. However, I believe this is acceptable for the time being, as the stage runner and this work flow will be removed in the next release. --------- Signed-off-by: Kyle Sayers <[email protected]>

remove pre_initialize_structure

9b3e216

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs marked this pull request as draft February 17, 2025 19:29

kylesayrs added 3 commits February 17, 2025 15:17

remove preinit event

3bff7d1

Signed-off-by: Kyle Sayers <[email protected]>

remove order test

14e47a5

Signed-off-by: Kyle Sayers <[email protected]>

Merge branch 'main' into kylesayrs/remove-preinitialize-structure

c6a9e6b

kylesayrs added the ready When a PR is ready for review label Feb 17, 2025

kylesayrs marked this pull request as ready for review February 17, 2025 20:47

kylesayrs self-assigned this Feb 17, 2025

kylesayrs marked this pull request as draft February 17, 2025 23:28

kylesayrs removed the ready When a PR is ready for review label Feb 18, 2025

kylesayrs added 2 commits February 17, 2025 19:40

consolodate saving

6b882bb

Signed-off-by: Kyle Sayers <[email protected]>

typos

bb35a74

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs added the ready When a PR is ready for review label Feb 18, 2025

kylesayrs added 4 commits February 17, 2025 19:49

add todos

71903ff

Signed-off-by: Kyle Sayers <[email protected]>

dreggs, style

d39d375

Signed-off-by: Kyle Sayers <[email protected]>

typo

7cc5a6d

Signed-off-by: Kyle Sayers <[email protected]>

adjust typehint

9865fa3

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs marked this pull request as ready for review February 18, 2025 05:09

kylesayrs added 8 commits February 18, 2025 14:48

allow prepending

68ce624

Signed-off-by: Kyle Sayers <[email protected]>

check saved recipe contents

5b7cc03

Signed-off-by: Kyle Sayers <[email protected]>

consolidate saving paths

bdc4fa5

Signed-off-by: Kyle Sayers <[email protected]>

remove broken import

a83b0aa

Signed-off-by: Kyle Sayers <[email protected]>

Merge remote-tracking branch 'origin' into kylesayrs/consolidate-saving

4efd116

add back def

b9f0bd1

Signed-off-by: Kyle Sayers <[email protected]>

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

29ab794

…lize-structure

save state

0a2642b

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs mentioned this pull request Feb 18, 2025

[Callbacks] Consolidate Saving Methods #1168

Merged

Merge branch 'kylesayrs/consolidate-saving' into kylesayrs/remove-pre…

0c70881

…initialize-structure

kylesayrs changed the base branch from main to kylesayrs/consolidate-saving February 18, 2025 21:18

kylesayrs added 2 commits February 18, 2025 16:29

remove verbose messages

60371ef

Signed-off-by: Kyle Sayers <[email protected]>

fix double initialization

bf9a8cd

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs mentioned this pull request Feb 19, 2025

[Callbacks] Remove double initialization, replace with updating the state directly #1169

Open

horheynm previously approved these changes Feb 20, 2025

View reviewed changes

Base automatically changed from kylesayrs/consolidate-saving to main February 25, 2025 15:46

kylesayrs added 4 commits February 25, 2025 11:42

rename function

3d64d57

Signed-off-by: Kyle Sayers <[email protected]>

remove accidentally added files

55984c4

Signed-off-by: Kyle Sayers <[email protected]>

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

05fa5f6

…lize-structure

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

53d762e

…lize-structure

brian-dellabetta previously approved these changes Feb 25, 2025

View reviewed changes

src/llmcompressor/transformers/utils/helpers.py Outdated Show resolved Hide resolved

src/llmcompressor/recipe/recipe.py Show resolved Hide resolved

add debug statement

e026658

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs dismissed brian-dellabetta’s stale review via e026658 February 25, 2025 17:29

pass model to stage runner

00961a0

Signed-off-by: Kyle Sayers <[email protected]>

brian-dellabetta reviewed Feb 25, 2025

View reviewed changes

src/llmcompressor/entrypoints/oneshot.py Outdated Show resolved Hide resolved

remove breakpoint

c7678b0

Signed-off-by: Kyle Sayers <[email protected]>

horheynm approved these changes Feb 26, 2025

View reviewed changes

kylesayrs mentioned this pull request Feb 26, 2025

[Callbacks] Remove EventLifecycle and on_start event #1170

Open

dsikka approved these changes Feb 26, 2025

View reviewed changes

src/llmcompressor/transformers/sparsification/compressed_tensors_utils.py Show resolved Hide resolved

dsikka requested a review from brian-dellabetta February 26, 2025 18:48

dsikka enabled auto-merge (squash) February 26, 2025 18:49

kylesayrs mentioned this pull request Feb 26, 2025

[Callbacks][Docs] Add docstrings to saving functions #1201

Open

Merge branch 'main' into kylesayrs/remove-preinitialize-structure

049e3cc

brian-dellabetta approved these changes Feb 26, 2025

View reviewed changes

dsikka merged commit a88b72b into main Feb 26, 2025
7 checks passed

dsikka deleted the kylesayrs/remove-preinitialize-structure branch February 26, 2025 20:37

kylesayrs mentioned this pull request Mar 10, 2025

[Bugfix] Staged 2of4 example #1238

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Callbacks] Remove pre_initialize_structure #1160

[Callbacks] Remove pre_initialize_structure #1160

kylesayrs commented Feb 17, 2025 •

edited

Loading

github-actions bot commented Feb 17, 2025

horheynm left a comment

brian-dellabetta left a comment

kylesayrs commented Feb 25, 2025 •

edited

Loading

dsikka left a comment

[Callbacks] Remove pre_initialize_structure #1160

[Callbacks] Remove pre_initialize_structure #1160

Conversation

kylesayrs commented Feb 17, 2025 • edited Loading

Purpose

Prerequisites

Follow-ups

Changes

Lifecycle

Regression Evaluation

github-actions bot commented Feb 17, 2025

horheynm left a comment

Choose a reason for hiding this comment

brian-dellabetta left a comment

Choose a reason for hiding this comment

kylesayrs commented Feb 25, 2025 • edited Loading

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs commented Feb 17, 2025 •

edited

Loading

kylesayrs commented Feb 25, 2025 •

edited

Loading