WIP: GHA test with model revisions #499

ckadner · 2025-10-02T09:29:35Z

Description

Cache models with revision for GitHub Action test runs.

TODOs:

rebase this PR onto main (currently based on @prashantgupta24 branch)
--> [CI] Enable model revisions in GHA test #523

load model with revision in unit tests (not revision: main)
https://github.com/vllm-project/vllm-spyre/actions/runs/18189810233/job/51782179337?pr=499#step:14:258

> ☑️ Download HF models
Downloading 'ibm-ai-platform/micro-g3.3-8b-instruct-1b' with revision '2714578f54cfb744ece40df9326ee0b47e879e03' ...


> ❌ Runt tests
...
head_call_error = OfflineModeIsEnabled("Cannot access file since 'local_files_only=True' as been set. (repo_id: ibm-ai-platform/micro-g3.3-8b-instruct-1b, repo_type: model, revision: main, filename: config.json)")

Related Issues

#497

Signed-off-by: Prashant Gupta <[email protected]>

Signed-off-by: Christian Kadner <[email protected]>

joerunde · 2025-10-06T22:19:17Z

.github/workflows/test.yml


-          download_tinygranite() {
-            python -c "from transformers import pipeline, AutoTokenizer; pipeline('text-generation', model='$1'); tokenizer=AutoTokenizer.from_pretrained('$1')"
+          download_granite_or_llama() {


I think all of this can be simplified to a single function like:

from transformers import pipeline from sentence_transformers import SentenceTransformer try: pipeline(model='$1', revision='$2') except RuntimeError: SentenceTransformer('$1', revision='$2')

I'd be fine sticking that in a script that we keep around here in the .github/workflows folder.

python3 ./just_download_the_dang_model.py --model foo --revision bar

That should get rid of a lot of the complexity here. Otherwise these changes all look great to me- putting the revisions in the cache key and splitting out the multi-model cases to have individual cache entries is exactly what we want. Sure hf_model_2 is a little wonky but I can live with it.

Signed-off-by: Christian Kadner <[email protected]>

ckadner · 2025-10-13T18:04:00Z

Need to rebase this PR onto main (currently it's based on top of Prashant's branch)

Signed-off-by: Christian Kadner <[email protected]>

ckadner · 2025-10-13T20:24:39Z

Closing this. Changes from this PR are rebased onto main in Pr #523

prashantgupta24 and others added 7 commits October 1, 2025 11:41

⏪ revert back to 3-layer micro model

87b339d

Signed-off-by: Prashant Gupta <[email protected]>

🐛 download the right revision

7174818

Signed-off-by: Prashant Gupta <[email protected]>

🎨 fmt

5aeeb88

Signed-off-by: Prashant Gupta <[email protected]>

🐛 download the right model

38acec8

Signed-off-by: Prashant Gupta <[email protected]>

🚧 flip condition so that we download the right model

49c5077

Signed-off-by: Prashant Gupta <[email protected]>

run test on all branches

61391b5

Signed-off-by: Christian Kadner <[email protected]>

disentangle model caches, add model revisions

7b1eae0

Signed-off-by: Christian Kadner <[email protected]>

ckadner requested review from joerunde, prashantgupta24, rafvasq and sducouedic as code owners October 2, 2025 09:29

ckadner mentioned this pull request Oct 2, 2025

⏪ revert back to 3-layer micro model #497

Closed

ckadner marked this pull request as draft October 2, 2025 09:34

ckadner added 2 commits October 2, 2025 02:45

test with revisions

bd88026

Signed-off-by: Christian Kadner <[email protected]>

shellcheck

db65341

Signed-off-by: Christian Kadner <[email protected]>

vllm-project deleted a comment from github-actions bot Oct 2, 2025

ckadner mentioned this pull request Oct 3, 2025

[Tests] Model revision parameter not used consistently #503

Closed

joerunde reviewed Oct 6, 2025

View reviewed changes

ckadner added 2 commits October 8, 2025 23:32

move model download logic into a Python script

8a52aa7

Signed-off-by: Christian Kadner <[email protected]>

revert revision for tiny granite

7e717c2

Signed-off-by: Christian Kadner <[email protected]>

test without cache

fc22751

Signed-off-by: Christian Kadner <[email protected]>

ckadner mentioned this pull request Oct 13, 2025

[CI] Enable model revisions in GHA test #523

Merged

ckadner closed this Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: GHA test with model revisions #499

WIP: GHA test with model revisions #499

Uh oh!

ckadner commented Oct 2, 2025 •

edited

Loading

Uh oh!

joerunde Oct 6, 2025

Uh oh!

ckadner Oct 13, 2025

Uh oh!

ckadner commented Oct 13, 2025

Uh oh!

ckadner commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WIP: GHA test with model revisions #499

WIP: GHA test with model revisions #499

Uh oh!

Conversation

ckadner commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

TODOs:

Related Issues

Uh oh!

joerunde Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

ckadner Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

ckadner commented Oct 13, 2025

Uh oh!

ckadner commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ckadner commented Oct 2, 2025 •

edited

Loading