Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skipping line 3 ('DCGM_FI_PROF_GR_ENGINE_ACTIVE'): metric not enabled #439

Open
SeungyeopShin opened this issue Jan 14, 2025 · 0 comments
Open
Labels
question Further information is requested

Comments

@SeungyeopShin
Copy link

SeungyeopShin commented Jan 14, 2025

Ask your question

Hi,
I installed dcgm-export with helm chart in k8s.
But I can't get DCGM_FI_PROF_GR_ENGINE_ACTIVE metric.

The log is as below.

time="2025-01-14T05:25:28Z" level=info msg="Starting dcgm-exporter"
time="2025-01-14T05:25:28Z" level=info msg="DCGM successfully initialized!"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting DCP metrics: This request is serviced by a module of DCGM that is not currently loaded"
time="2025-01-14T05:25:28Z" level=info msg="Falling back to metric file '/etc/dcgm-exporter/dcp-metrics-included.csv'"
time="2025-01-14T05:25:28Z" level=warning msg="Skipping line 3 ('DCGM_FI_PROF_GR_ENGINE_ACTIVE'): metric not enabled"
time="2025-01-14T05:25:28Z" level=info msg="Initializing system entities of type: GPU"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting NvSwitch metrics; no fields to watch for device type: 3"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting NvLink metrics; no fields to watch for device type: 6"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting CPU metrics; no fields to watch for device type: 7"
time="2025-01-14T05:25:28Z" level=info msg="Not collecting CPU Core metrics; no fields to watch for device type: 8"
time="2025-01-14T05:25:28Z" level=info msg="Kubernetes metrics collection enabled!"
time="2025-01-14T05:25:28Z" level=info msg="Pipeline starting"
time="2025-01-14T05:25:28Z" level=info msg="Starting webserver"
time="2025-01-14T05:25:28Z" level=info msg="Listening on" address="[::]:9400"
time="2025-01-14T05:25:28Z" level=info msg="TLS is disabled." address="[::]:9400" http2=false

My envrionment is:

  • GPU : A100
  • MIG : enabled
  • Driver : 515.105.01
  • Host : VM(XCP-ng), Passthrough (not vgpu)
  • dcgm-exporter version : 3.6.1

Any have an idea?

@SeungyeopShin SeungyeopShin added the question Further information is requested label Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant