You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I installed dcgm-export with helm chart in k8s.
But I can't get DCGM_FI_PROF_GR_ENGINE_ACTIVE metric.
The log is as below.
time="2025-01-14T05:25:28Z" level=info msg="Starting dcgm-exporter"
time="2025-01-14T05:25:28Z" level=info msg="DCGM successfully initialized!"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting DCP metrics: This request is serviced by a module of DCGM that is not currently loaded"
time="2025-01-14T05:25:28Z" level=info msg="Falling back to metric file '/etc/dcgm-exporter/dcp-metrics-included.csv'"
time="2025-01-14T05:25:28Z" level=warning msg="Skipping line 3 ('DCGM_FI_PROF_GR_ENGINE_ACTIVE'): metric not enabled"
time="2025-01-14T05:25:28Z" level=info msg="Initializing system entities of type: GPU"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting NvSwitch metrics; no fields to watch for device type: 3"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting NvLink metrics; no fields to watch for device type: 6"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting CPU metrics; no fields to watch for device type: 7"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting CPU Core metrics; no fields to watch for device type: 8"
time="2025-01-14T05:25:28Z" level=info msg="Kubernetes metrics collection enabled!"
time="2025-01-14T05:25:28Z" level=info msg="Pipeline starting"
time="2025-01-14T05:25:28Z" level=info msg="Starting webserver"
time="2025-01-14T05:25:28Z" level=info msg="Listening on" address="[::]:9400"
time="2025-01-14T05:25:28Z" level=info msg="TLS is disabled." address="[::]:9400" http2=false
My envrionment is:
GPU : A100
MIG : enabled
Driver : 515.105.01
Host : VM(XCP-ng), Passthrough (not vgpu)
dcgm-exporter version : 3.6.1
Any have an idea?
The text was updated successfully, but these errors were encountered:
Ask your question
Hi,
I installed dcgm-export with helm chart in k8s.
But I can't get DCGM_FI_PROF_GR_ENGINE_ACTIVE metric.
The log is as below.
My envrionment is:
Any have an idea?
The text was updated successfully, but these errors were encountered: