Skip to content

Commit

Permalink
Update DCGM-Exporter documentation
Browse files Browse the repository at this point in the history
Add split containers use case, dcgm-exporter options and helm options

Signed-off-by: Douglas Wightman <[email protected]>
  • Loading branch information
glowkey committed Oct 13, 2021
1 parent 872dc27 commit 16c8c99
Show file tree
Hide file tree
Showing 17 changed files with 708 additions and 577 deletions.
2 changes: 0 additions & 2 deletions contents.rst
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,6 @@ using NVIDIA GPUs with Kubernetes.
openshift/cluster-entitlement.rst
openshift/install-nfd.rst
openshift/install-gpu-ocp.rst
openshift/mig-ocp.rst
openshift/clean-up.rst
openshift/troubleshooting-gpu-ocp.rst

Expand All @@ -60,7 +59,6 @@ using NVIDIA GPUs with Kubernetes.
kubernetes/install-k8s.rst
kubernetes/mig-k8s.rst
kubernetes/anthos-guide.rst
kubernetes/dcgme2e

.. toctree::
:maxdepth: 2
Expand Down
716 changes: 707 additions & 9 deletions gpu-telemetry/dcgm-exporter.rst

Large diffs are not rendered by default.

Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added gpu-telemetry/graphics/dcgm-exporter_embedded.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added gpu-telemetry/graphics/dcgm_and_dcgm-exporter.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
565 changes: 0 additions & 565 deletions kubernetes/dcgme2e.rst

This file was deleted.

2 changes: 1 addition & 1 deletion kubernetes/install-k8s.rst
Original file line number Diff line number Diff line change
Expand Up @@ -934,5 +934,5 @@ And check the logs of the ``gpu-operator-test`` pod:
GPU Telemetry
^^^^^^^^^^^^^^

Refer to the `DCGM-Exporter <https://docs.nvidia.com/datacenter/cloud-native/kubernetes/dcgme2e.html#gpu-telemetry>`_ documentation
Refer to the `DCGM-Exporter <https://docs.nvidia.com/datacenter/cloud-native/gpu-telemetry/dcgm-exporter.html#integrating-gpu-telemetry-into-kubernetes.html>`_ documentation
to get started with integrating GPU metrics into a Prometheus monitoring system.

0 comments on commit 16c8c99

Please sign in to comment.