add aws support to ai-inference in pkb #6201

kiryl-filatau · 2025-11-05T16:52:50Z

add aws support to ai-inference in pkb with temporary link to wg-serving custom branch

…ing custom branch

perfkitbenchmarker/resources/kubernetes/wg_serving_inference_server.py

perfkitbenchmarker/data/container/kubernetes_ai_inference/serving_catalog_cli.yaml.j2

…repo and branch to flags, removed legacy list()

hubatish · 2025-11-19T22:11:38Z

perfkitbenchmarker/linux_benchmarks/kubernetes_ai_inference_benchmark.py

+
+flags.DEFINE_string(
+    'wg_serving_repo_branch',
+    'main',


Nice. I was wondering if you'd have to leave off values altogether with like a jinja if, but this totally makes sense as defaults.

hubatish · 2025-11-19T22:31:20Z

perfkitbenchmarker/resources/kubernetes/wg_serving_inference_server.py

        self.cluster.ApplyManifest(
            'container/kubernetes_ai_inference/serving_catalog_cli.yaml.j2',
            image_repo=FLAG_IMAGE_REPO.value,
+            wg_serving_repo_url=FLAGS.wg_serving_repo_url,


Ok, flag location is always tricky though. Let's put the flags in this file (wg_serving_inference_server) & use flagholders with eg REPO_URL = FLAGS.DEFINE.... REPO_URL.value.

…gholders

hubatish · 2025-11-24T19:00:23Z

/gcbrun

…o-ai-inference-pkb PiperOrigin-RevId: 836297609

hubatish · 2026-01-06T16:30:24Z

perfkitbenchmarker/resources/kubernetes/wg_serving_inference_server.py


  def _GetInferenceServerManifest(self) -> str:
    """Generates and retrieves the inference server manifest content."""
+    # Ensure GPU capacity exists before scheduling GPU workloads


On my end this looks merged in:
https://screenshot.googleplex.com/AREUuCaL76yAdQg

Ooh ok actually perhaps this has made it in but been superceded by follow up PRs? https://screenshot.googleplex.com/BNVuvNLwNKckBF5 is the current state of wg_serving_inference_server.py & it does have a cloud == 'AWS' section.
I suppose we simply close this PR?

add aws support to ai-inference in pkb with temporary link to wg-serv…

7ed5fef

…ing custom branch

hubatish self-assigned this Nov 5, 2025

hubatish reviewed Nov 5, 2025

View reviewed changes

kiryl-filatau added 2 commits November 18, 2025 15:23

Merge branch 'master' into feature/add-aws-support-to-ai-inference-pkb

013cb43

added AWS-only condition for gpu nodepool manifest, moved wg-serving …

b2c0067

…repo and branch to flags, removed legacy list()

hubatish approved these changes Nov 19, 2025

View reviewed changes

hubatish added the ready to pull label Nov 19, 2025

move WG Serving repo flags to wg_serving_inference_server and use fla…

c2f11fc

…gholders

hubatish added ready to pull and removed ready to pull labels Nov 24, 2025

copybara-service bot pushed a commit that referenced this pull request Nov 24, 2025

Merge pull request #6201 from kiryl-filatau:feature/add-aws-support-t…

1e8478a

…o-ai-inference-pkb PiperOrigin-RevId: 836297609

hubatish reviewed Jan 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add aws support to ai-inference in pkb #6201

add aws support to ai-inference in pkb #6201

Uh oh!

kiryl-filatau commented Nov 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hubatish Nov 19, 2025

Uh oh!

hubatish Nov 19, 2025

Uh oh!

kiryl-filatau Nov 24, 2025

Uh oh!

hubatish commented Nov 24, 2025

Uh oh!

hubatish Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add aws support to ai-inference in pkb #6201

Are you sure you want to change the base?

add aws support to ai-inference in pkb #6201

Uh oh!

Conversation

kiryl-filatau commented Nov 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hubatish Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

hubatish Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

kiryl-filatau Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

hubatish commented Nov 24, 2025

Uh oh!

hubatish Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants