[WIP][AQUA] GPU Shape Recommendation #1221

elizjo · 2025-07-07T20:41:19Z

Wrote an additional POST API and aqua command for recommending GPU shapes for a particular model

ads aqua recommend shapes --model_ocid 'ocid1.datasciencemodel.oc1.<ocid>' ""

Returns

{
  "model_ocid" : "ocid1.datasciencemodel.oc1.<ocid>"
}

Returns

{                                                                                                                                                                                                                                            
    "display_name": "Almawave/Velvet-14B",
    "recommendations": [
        {
            "shape_details": {
                "available": false,
                "core_count": null,
                "memory_in_gbs": null,
                "name": "BM.GPU.MI300X.8",
                "shape_series": "GPU",
                "gpu_specs": {
                    "gpu_memory_in_gbs": 1536,
                    "gpu_count": 8,
                    "gpu_type": "MI300X",
                    "quantization": [
                        "fp8",
                        "gguf"
                    ],
                    "ranking": {
                        "cost": 90,
                        "performance": 90
                    }
                }
            },
            "configurations": [
                {
                    "model_details": {
                        "model_size_gb": 28.16,
                        "kv_cache_size_gb": 26.84,
                        "total_model_gb": 55.0
                    },
                    "deployment_params": {
                        "quantization": "bfloat16",
                        "max_model_len": 131072,
                        "params": ""
                    },
                    "recommendation": "No override PARAMS needed. \n\nModel fits well within the allowed compute shape (55.0GB used / 1536.0GB allowed)."
                }
            ]
        }
    ],
     "troubleshoot": ""
}

Status: business logic works, API works, unit tests finished, rich diff CLI table finished

github-actions · 2025-07-07T21:10:04Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-07-08T23:16:28Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-07-25T18:00:09Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-07-29T19:33:45Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-07-30T18:54:42Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-07-31T18:50:31Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2025-08-01T17:50:01Z

📌 Cov diff with main:

📌 Overall coverage:

mrDzurb · 2025-08-01T17:24:07Z

ads/aqua/resources/gpu_shapes_index.json

    },
    "VM.GPU.A10.1": {
      "gpu_count": 1,
      "gpu_memory_in_gbs": 24,
-      "gpu_type": "A10"
+      "gpu_type": "A10",


Let's add FP8 for the A10 shapes as well.

mrDzurb · 2025-08-01T17:31:29Z

ads/aqua/common/utils.py

@@ -1287,6 +1287,7 @@ def load_gpu_shapes_index(

    # Merge: remote shapes override local
    local_shapes = local_data.get("shapes", {})
+    remote_data = {}


Why do we need this?

mrDzurb · 2025-08-01T17:37:41Z

ads/aqua/extension/__init__.py

@@ -13,6 +13,7 @@
 from ads.aqua.extension.evaluation_handler import __handlers__ as __eval_handlers__
 from ads.aqua.extension.finetune_handler import __handlers__ as __finetune_handlers__
 from ads.aqua.extension.model_handler import __handlers__ as __model_handlers__
+from ads.aqua.extension.recommend_handler import __handlers__ as __gpu_handlers__


Maybe we can name it as __shape_handler?

mrDzurb · 2025-08-01T18:31:03Z

ads/aqua/shaperecommend/llm_config.py

+        Detects quantization bit-width as a string (e.g., '4bit', '8bit') from Hugging Face config dict.
+        """
+        if raw.get("load_in_8bit"):
+            return "8bit"


It would be better to move this into constants.

mrDzurb · 2025-08-01T18:31:27Z

ads/aqua/shaperecommend/llm_config.py

+        If model is un-quantized, uses the weight size.
+        If model is pre-quantized, uses the quantization level.
+        """
+        key = (self.quantization or self.weight_dtype or "float32").lower()


Let's move "float32" to constants

mrDzurb · 2025-08-01T18:32:04Z

ads/aqua/shaperecommend/llm_config.py

+        """
+        vals = []
+        curr = min_len
+        max_seq_len = 16384 if not self.max_seq_len else self.max_seq_len


Let's move the numbers like 16384 to constants and add some description there

github-actions · 2025-08-02T00:17:37Z

📌 Cov diff with main:

📌 Overall coverage:

elizjo requested review from darenr, mayoor, mrDzurb, VipulMascarenhas, qiuosier and ahosler as code owners July 7, 2025 20:41

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Jul 7, 2025

mrDzurb changed the title ~~GPU Shape Recommendation~~ [AQUA] GPU Shape Recommendation Jul 7, 2025

mrDzurb changed the title ~~[AQUA] GPU Shape Recommendation~~ [WIP][AQUA] GPU Shape Recommendation Jul 15, 2025

elizjo added 4 commits July 25, 2025 10:27

inital code for GPU Shape Recommendator

18a92c4

modifications to handler

bd026e7

init implementation for gpu recommendations

4461af7

fixed docstrings and unused imports

26e08a2

elizjo force-pushed the ODSC-74228/GPU-Shape-Recommendation branch from 2f54f8b to 26e08a2 Compare July 25, 2025 17:29

added unit tests

7ce57c8

added rich diff table

a17b035

fixed unit tests

e94b6f1

Merge branch 'main' into ODSC-74228/GPU-Shape-Recommendation

fbfdb91

mrDzurb reviewed Aug 1, 2025

View reviewed changes

addressed comments, fixed rich diff table

ba605ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP][AQUA] GPU Shape Recommendation #1221

[WIP][AQUA] GPU Shape Recommendation #1221

elizjo commented Jul 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

github-actions bot commented Jul 29, 2025

Uh oh!

github-actions bot commented Jul 30, 2025

Uh oh!

github-actions bot commented Jul 31, 2025

Uh oh!

github-actions bot commented Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Uh oh!

github-actions bot commented Aug 2, 2025

Uh oh!

Uh oh!

[WIP][AQUA] GPU Shape Recommendation #1221

Are you sure you want to change the base?

[WIP][AQUA] GPU Shape Recommendation #1221

Conversation

elizjo commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

github-actions bot commented Jul 29, 2025

Uh oh!

github-actions bot commented Jul 30, 2025

Uh oh!

github-actions bot commented Jul 31, 2025

Uh oh!

github-actions bot commented Aug 1, 2025

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

mrDzurb Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 2, 2025

Uh oh!

Uh oh!

elizjo commented Jul 7, 2025 •

edited

Loading