Update finetune and oneshot tests #114

dsikka · 2024-08-26T16:40:11Z

SUMMARY:

Properly update finetune tests such that they can run on a nightly cadence. This includes defining a simple config file that can be used by the tests which defines the cadence
Mark finetune tests which are not integration tests as unit
Seems as oneshot does not work with auto - update the tests to use cuda: 0

kylesayrs

Nice

Satrat · 2024-08-28T15:22:25Z

.github/workflows/test-check.yaml

+      - name: "🔬 Running transformers tests"
        if: always() && steps.install.outcome == 'success'
        run: |
          pytest tests/llmcompressor/transformers/compression -v


What about the tests under obcq/gptq/oneshot/sparsification?

They're all running. Lines 117 onwards. Just separating them out into separate steps

Satrat · 2024-08-28T15:23:51Z

tests/llmcompressor/transformers/obcq/obcq_configs/completion/gpu/llama_7b_quant.yaml

-device: "auto"
+device: "cuda:0"


What was the error you were getting with auto? I think we want at least one test confirming that auto is functional

"Unspecified CUDA launch errors" - this was on an a100

…ue with the W4A8 representation to have dynamic token for activations (vllm-project#114)

dsikka added 3 commits August 26, 2024 16:37

update tests

f280743

update

37e8435

update tests

bb5f9a3

dsikka requested a review from Satrat August 26, 2024 20:52

kylesayrs approved these changes Aug 26, 2024

View reviewed changes

Satrat reviewed Aug 28, 2024

View reviewed changes

dsikka requested a review from Satrat August 28, 2024 19:32

dsikka added 3 commits August 28, 2024 21:29

Merge branch 'main' into swap_auto

5feb914

Merge branch 'main' into swap_auto

46629ee

Merge branch 'main' into swap_auto

cad396d

dsikka merged commit 2135c4c into main Sep 5, 2024
6 of 7 checks passed

dsikka deleted the swap_auto branch September 5, 2024 02:33

markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024

Add FP8 dynamic scheme for latest Llama3.1 meta models and fix an iss…

c214cbc

…ue with the W4A8 representation to have dynamic token for activations (vllm-project#114)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update finetune and oneshot tests #114

Update finetune and oneshot tests #114

dsikka commented Aug 26, 2024 •

edited

Loading

kylesayrs left a comment

Satrat Aug 28, 2024

dsikka Aug 28, 2024

Satrat Aug 28, 2024

dsikka Aug 28, 2024

		device: "auto"
		device: "cuda:0"

Update finetune and oneshot tests #114

Update finetune and oneshot tests #114

Conversation

dsikka commented Aug 26, 2024 • edited Loading

kylesayrs left a comment

Choose a reason for hiding this comment

Satrat Aug 28, 2024

Choose a reason for hiding this comment

dsikka Aug 28, 2024

Choose a reason for hiding this comment

Satrat Aug 28, 2024

Choose a reason for hiding this comment

dsikka Aug 28, 2024

Choose a reason for hiding this comment

dsikka commented Aug 26, 2024 •

edited

Loading