Fix #884: test for `nvidia-smi` instead of `nvcc` #918

mhucka · 2025-10-29T01:09:16Z

It seems that checking for the program nvidia-smi is the more reliable way to determine if a system has a usable NVIDIA GPU and the necessary drivers installed for running CUDA applications. Checking for nvcc may only tell you if the system can compile CUDA code, not necessarily if it can run it.

This PR also fixes a small portability issue, where the use of command -v is preferrable to the use of which for testing whether a program is available.

It seems that checking for the program `nvidia-smi` is the more reliable way to determine if a system has a usable NVIDIA GPU and the necessary drivers installed for running CUDA applications. Checking for `nvcc` may only tell you if the system can compile CUDA code, not necessarily if it can run it.

sergeisakov · 2025-10-29T09:56:25Z

Makefile

+# an NVIDIA GPU. Checking for nvcc may only tell you if the system can compile
+# CUDA code, not necessarily if it can run it.)
+
+ifneq (,$(shell command -v nvidia-smi > /dev/null 2>&1))


There might be two issues here.

You discard both the standard output and the error output. So ifneq (,$(shell command -v nvidia-smi > /dev/null 2>&1)) is always false.

It seems checking for nvidia-smi can also be unreliable. One can install nvidia-utils to get nvidia-smi (even on a machine that does not have an NVidia GPU) without installing cuda, etc. But we need nvcc to compile the code.

You discard both the standard output and the error output. So ifneq (,$(shell command -v nvidia-smi > /dev/null 2>&1)) is always false.

Ack, you're right of course. That should have been only 2> /dev/null. And of course, I only bothered to test it on a system that didn't have nvidia-smi, so the resulting behavior was what I expected. Anyway, thank you very for catching that.

It seems checking for nvidia-smi can also be unreliable. One can install nvidia-utils to get nvidia-smi (even on a machine that does not have an NVidia GPU) without installing cuda, etc. But we need nvcc to compile the code.

Darn it. I guess one would need to test for nvcc and nvidia-smi, then grep the output of running nvidia-smi for an indication of a gpu card. This is getting complicated :-(.

I'm doing to close this PR and issue #884 and try to reproduce whatever led me to think it was an issue in the first place.

mhucka · 2025-10-29T15:12:21Z

This PR is flawed and does not even address the problem fully. Closing.

mhucka added 2 commits October 29, 2025 00:58

Use command -v instead of which

6f2bc01

github-actions bot added the size: S 10< lines changed <50 label Oct 29, 2025

mhucka mentioned this pull request Oct 29, 2025

The Makefile should test for nvidia-smi, not nvcc, to guess if CUDA available #884

Closed

Merge branch 'main' into mh-fix-884

c521e52

mhucka marked this pull request as ready for review October 29, 2025 02:06

mhucka requested review from fdmalone and sergeisakov October 29, 2025 02:07

sergeisakov requested changes Oct 29, 2025

View reviewed changes

mhucka closed this Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #884: test for `nvidia-smi` instead of `nvcc` #918

Fix #884: test for `nvidia-smi` instead of `nvcc` #918

Uh oh!

mhucka commented Oct 29, 2025

Uh oh!

sergeisakov Oct 29, 2025

Uh oh!

mhucka Oct 29, 2025

Uh oh!

mhucka commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix #884: test for nvidia-smi instead of nvcc #918

Fix #884: test for nvidia-smi instead of nvcc #918

Uh oh!

Conversation

mhucka commented Oct 29, 2025

Uh oh!

sergeisakov Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

mhucka Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

mhucka commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix #884: test for `nvidia-smi` instead of `nvcc` #918

Fix #884: test for `nvidia-smi` instead of `nvcc` #918