You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Start a long running task - cf run-task dorifi -c "sleep 10000"
Wait ~30 seconds - the job for the task fails
The task is shown as failed:
❯ cf tasks dorifi
Getting tasks for app dorifi in org org / space space as cf-admin...
id name state start time command
20250120143013390 3d6aea4f-d8c0-4545-b2e4-c2c3fa20d91c FAILED Mon, 20 Jan 2025 14:30:13 UTC sleep 10000
Dev notes
Is that expected behaviour?
The job fails with the following codition:
conditions:
- lastProbeTime: "2025-01-20T14:30:52Z"
lastTransitionTime: "2025-01-20T14:30:52Z"
message: Job has reached the specified backoff limit
reason: BackoffLimitExceeded
status: "True"
type: Failed
There is the following k8s events:
ExceededGracePeriod
Name: 3d6aea4f-d8c0-4545-b2e4-c2c3fa20d91c-hc5ml.181c6d069ffa849a
Namespace: db3185c1-4d36-4391-b663-c1e96d39f84e
Labels: <none>
Annotations: <none>
API Version: v1
Count: 1
Event Time: <nil>
First Timestamp: 2025-01-20T14:30:31Z
Involved Object:
API Version: v1
Kind: Pod
Name: 3d6aea4f-d8c0-4545-b2e4-c2c3fa20d91c-hc5ml
Namespace: db3185c1-4d36-4391-b663-c1e96d39f84e
Resource Version: 2892
UID: 60af529d-30bd-46cb-ba8c-b2f17fa14368
Kind: Event
Last Timestamp: 2025-01-20T14:30:31Z
Message: Container runtime did not kill the pod within specified grace period.
Metadata:
Creation Timestamp: 2025-01-20T14:30:31Z
Resource Version: 2996
UID: 7f0aeb1e-0bb5-4b98-b0a9-c39fc57b9762
Reason: ExceededGracePeriod
Reporting Component: kubelet
Reporting Instance: e2e-control-plane
Source:
Component: kubelet
Host: e2e-control-plane
Type: Warning
Events: <none>
We believe that it is being killed maybe because the default terminationGracePeriodSeconds for the job is 30 seconds. Note that we do not set that in the job runner.
Note that in order to keep the job around, the following helm values are adjusted in deploy-on-kind:
How to reproduce
cf run-task dorifi -c "sleep 10000"
Dev notes
terminationGracePeriodSeconds
for the job is 30 seconds. Note that we do not set that in the job runner.deploy-on-kind
:The text was updated successfully, but these errors were encountered: