Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SYS_ADMIN capability requirement for docker run #427

Merged
merged 1 commit into from
Dec 2, 2024

Conversation

kei-yamazaki
Copy link
Contributor

When starting without --cap-add SYS_ADMIN, an error message is displayed prompting to add the SYS_ADMIN capability. Therefore, I updated the instructions in the README to align with this requirement.

$ docker run --gpus all --rm -p 9400:9400 nvcr.io/nvidia/k8s/dcgm-exporter:3.3.9-3.6.1-ubuntu22.04
Warning #2: dcgm-exporter doesn't have sufficient privileges to expose profiling metrics. To get profiling metrics with dcgm-exporter, use --cap-add SYS_ADMIN
2024/11/30 07:41:50 maxprocs: Leaving GOMAXPROCS=9: CPU quota undefined
time="2024-11-30T07:41:50Z" level=info msg="Starting dcgm-exporter"
time="2024-11-30T07:41:51Z" level=info msg="DCGM successfully initialized!"
time="2024-11-30T07:41:51Z" level=info msg="Collecting DCP Metrics"
time="2024-11-30T07:41:51Z" level=info msg="Falling back to metric file '/etc/dcgm-exporter/default-counters.csv'"
time="2024-11-30T07:41:51Z" level=info msg="Initializing system entities of type: GPU"
time="2024-11-30T07:41:51Z" level=info msg="Not collecting NvSwitch metrics; no fields to watch for device type: 3"
time="2024-11-30T07:41:51Z" level=info msg="Not collecting NvLink metrics; no fields to watch for device type: 6"
time="2024-11-30T07:41:51Z" level=info msg="Not collecting CPU metrics; no fields to watch for device type: 7"
time="2024-11-30T07:41:51Z" level=info msg="Not collecting CPU Core metrics; no fields to watch for device type: 8"
time="2024-11-30T07:41:51Z" level=fatal msg="Failed to watch metrics: Error watching fields: Host engine is running as non-root"

Add --cap-add SYS_ADMIN flag to docker run command in quickstart section,
as it is required for DCGM-Exporter to function properly.
@glowkey glowkey merged commit 900d465 into NVIDIA:main Dec 2, 2024
1 check passed
@kei-yamazaki kei-yamazaki deleted the cap-add-sys-admin branch December 3, 2024 01:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants