[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

ckadner · 2025-10-02T21:52:20Z

Description

Draft PR #497 has all tests pass because it uses both tiny granite models

the 4-layer model from the cache
the 3-layer model downloaded with revision

This PR checks the test results for bypassing the cache (not using 4-layer model)

Related Issues

#497

Signed-off-by: Prashant Gupta <[email protected]>

…icro-model

Signed-off-by: Prashant Gupta <[email protected]>

Signed-off-by: Christian Kadner <[email protected]>

prashantgupta24 · 2025-10-02T22:04:17Z

.github/workflows/test.yml

      - name: "Restore HF models cache"
        id: cache_restore
-        if: steps.changed-src-files.outputs.any_changed == 'true'
+        if: ! contains(env.model_key, 'micro')


Okay I think I see what you're trying to do here

Signed-off-by: Christian Kadner <[email protected]>

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 · 2025-10-03T17:57:59Z

Going to push commits to this PR to test the model.revision bug fixes!

Signed-off-by: Prashant Gupta <[email protected]>

ckadner · 2025-10-03T19:30:03Z

Going to push commits to this PR to test the model.revision bug fixes!

Cool! All tests green 💯

prashantgupta24 · 2025-10-03T21:35:48Z

So my plan at this time is to open a new PR with the latest commits from this PR which fix the model revision used in the tests. Once we get that in then we can discuss about whether we even want to revert to the 3 layer model or not

prashantgupta24 and others added 8 commits October 1, 2025 11:41

⏪ revert back to 3-layer micro model

87b339d

Signed-off-by: Prashant Gupta <[email protected]>

🐛 download the right revision

7174818

Signed-off-by: Prashant Gupta <[email protected]>

🎨 fmt

5aeeb88

Signed-off-by: Prashant Gupta <[email protected]>

🐛 download the right model

38acec8

Signed-off-by: Prashant Gupta <[email protected]>

🚧 flip condition so that we download the right model

49c5077

Signed-off-by: Prashant Gupta <[email protected]>

Merge remote-tracking branch 'upstream/main' into switch-to-3-layer-m…

e90387e

…icro-model

⚡️ add hf_cache for older revision

b194d0a

Signed-off-by: Prashant Gupta <[email protected]>

Don't cache granite model

036be3b

Signed-off-by: Christian Kadner <[email protected]>

ckadner requested review from joerunde, prashantgupta24, rafvasq and sducouedic as code owners October 2, 2025 21:52

ckadner mentioned this pull request Oct 2, 2025

⏪ revert back to 3-layer micro model #497

Closed

ckadner removed request for rafvasq and sducouedic October 2, 2025 21:54

ckadner marked this pull request as draft October 2, 2025 21:54

quote not contains

f030017

Signed-off-by: Christian Kadner <[email protected]>

vllm-project deleted a comment from github-actions bot Oct 2, 2025

prashantgupta24 reviewed Oct 2, 2025

View reviewed changes

ckadner and others added 3 commits October 2, 2025 15:17

add separate method to download FP8 model

bf3c7f9

Signed-off-by: Christian Kadner <[email protected]>

🐛 use model.revision

a1e355b

Signed-off-by: Prashant Gupta <[email protected]>

🐛 use model.revision

2d81843

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 added 3 commits October 3, 2025 11:08

🐛 use model.revision

d8aa1f5

Signed-off-by: Prashant Gupta <[email protected]>

🎨 don't run quantized for these GH suites

d685978

Signed-off-by: Prashant Gupta <[email protected]>

🐛 whoops

a6d469b

Signed-off-by: Prashant Gupta <[email protected]>

ckadner closed this Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

Uh oh!

ckadner commented Oct 2, 2025

Uh oh!

prashantgupta24 Oct 2, 2025

Uh oh!

prashantgupta24 commented Oct 3, 2025

Uh oh!

ckadner commented Oct 3, 2025 •

edited

Loading

Uh oh!

prashantgupta24 commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

Uh oh!

Conversation

ckadner commented Oct 2, 2025

Description

Related Issues

Uh oh!

prashantgupta24 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 commented Oct 3, 2025

Uh oh!

ckadner commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prashantgupta24 commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ckadner commented Oct 3, 2025 •

edited

Loading