Examples should come with health and readiness checks #685

Jeffwan · 2025-02-16T19:02:52Z

🚀 Feature Description and Motivation

Currently, the pod becomes ready immediately, however, the application loading time is still long, at this moment, request to the model server will fail. We used to have such settings but we recently remove them for simplicity.

Use Case

for stable deployment

Proposed Solution

No response

Jeffwan · 2025-02-18T00:30:17Z

please focus on samples folder

vivek-orbi · 2025-02-27T08:06:06Z

I'm willing to take this up.

Based on the samples, here's my understanding of the solution for your requirement:

Problem: Pod becomes ready immediately while the application/model is still loading, causing failed requests.
Proposed Solution: Implement health and readiness probes with appropriate delays:

livenessProbe:
  httpGet:
    path: /health
    port: 8000
  initialDelaySeconds: 120
  periodSeconds: 5
  timeoutSeconds: 1
  failureThreshold: 3

readinessProbe:
  httpGet:
    path: /health
    port: 8000
  initialDelaySeconds: 120
  periodSeconds: 5
  timeoutSeconds: 1
  failureThreshold: 5

Key Settings:

120 seconds initial delay to account for model loading time
Same /health endpoint for both probes
Different failure thresholds (3 for liveness, 5 for readiness)

Is my understanding correct that:

Your main issue is premature traffic routing before the model is fully loaded?
The 120-second initial delay would be sufficient for your model loading time?
You're using a setup similar to the samples (vLLM or similar serving framework)?

Please let me know if any of these assumptions need adjustment for your specific use case.

jolfr · 2025-02-28T23:05:21Z

The Quickstart Model Sample already includes checks, but they are too tight for the current model download. 120 seconds is not enough. Going to log an issue and will link it here.

jolfr · 2025-02-28T23:28:03Z

See #772

Jeffwan mentioned this issue Feb 24, 2025

v0.3.0 roadmap #698

Open

41 tasks

Jeffwan added this to the v0.3.0 milestone Feb 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples should come with health and readiness checks #685

Examples should come with health and readiness checks #685

Jeffwan commented Feb 16, 2025

Jeffwan commented Feb 18, 2025

vivek-orbi commented Feb 27, 2025

jolfr commented Feb 28, 2025

jolfr commented Feb 28, 2025

Examples should come with health and readiness checks #685

Examples should come with health and readiness checks #685

Comments

Jeffwan commented Feb 16, 2025

🚀 Feature Description and Motivation

Use Case

Proposed Solution

Jeffwan commented Feb 18, 2025

vivek-orbi commented Feb 27, 2025

jolfr commented Feb 28, 2025

jolfr commented Feb 28, 2025