Add CUDA 13.0 Tests for CuFile I/O Operations #1060

chloechia4 · 2025-10-01T19:04:38Z

Description

This PR includes tests for the low-level bindings and the generated low-level bindings introduced in CUDA 13.0 for CUFile.

CUDA 13.0 CuFile Operations

test_set_stats_level
test_stats_start
test_stats_stop
test_stats_reset
test_get_stats_l1
test_get_stats_l2
test_get_stats_l3
test_get_bar_size_in_kb
test_set_parameter_posix_pool_slab_array
test_set_get_parameter_size_t

Note: The original test_batch_io_large_operations() did not pass once switched from CUDA 12.9 to 13.0. I realized it was because the file reads were occurring before the writes as it was submitting all operations (reads and writes) together in one batch. As a result, it was trivially failing because the reads would return as 0 bytes, since they were happening before any write I/O occurred. I changed it to so it would be separated into two phases: writes complete first in one batch handle, and then reads are submitted in another batch handle. This new test works with CUDA 12.9 versioning as well.

All tests passing across CUDA versions

copy-pr-bot · 2025-10-01T19:04:42Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

leofang · 2025-10-01T19:59:59Z

Thanks, Chloe! Pinning you internally...

cuda_bindings/cuda/bindings/_internal/cycufile.pxd

cuda_bindings/cuda/bindings/_internal/cycufile.pyx

leofang · 2025-10-14T18:42:31Z

cuda_bindings/cuda/bindings/cycufile.pxd

@chloechia4 any reason cuFileDriverClose_v2 is removed? I see this symbol still exists in the cuFILE header. For cuda-bindings, the Cython layer (cyxxxxx.{pxd,pyx}) are consider stable public APIs.

@chloechia4: I left a comment on why I think this is happening in your cybind MR. Addressing that, putting the changes here, and removing those extra files @leofang mentioned should hopefully do it. I haven't tried testing (I'm on WSL and it looks like cuFile doesn't work there).

Addressed this

cuda_bindings/tests/test_cufile.py

sourabgupta3 · 2025-10-14T19:21:38Z

cuda_bindings/tests/test_cufile.py

+            assert io_events[i].status == cufile.Status.COMPLETE, f"Write {i} failed with status {io_events[i].status}"

+        # Force file sync
+        os.fsync(fd)


This isn't needed.

sourabgupta3 · 2025-10-14T19:24:21Z

cuda_bindings/tests/test_cufile.py

+
+        # Verify that statistics data was written to the buffer
+        # Convert buffer to bytes and check that it's not all zeros
+        buffer_bytes = bytes(stats_buffer)


Rather than just checking for buffer_bytes, can you verify by looking at actual fields of the data structure(CUfileStatsLevel1_t)?

Good point I added field checks in get_stats_l1/get_stats_l2/get_stats_l3. It appears that the Python bindings don't expose the CUfileStatsLevel*_t structures as ctypes classes that we can directly use. So I just added Python equivalent classes

sourabgupta3 · 2025-10-14T19:25:42Z

cuda_bindings/cuda/bindings/cufile.pyx

    check_status(__status__)

+
+cpdef get_parameter_min_max_value(int param, intptr_t min_value, intptr_t max_value):


Add tests for this API as well.

mdboom · 2025-10-15T15:14:20Z

/ok to test

copy-pr-bot · 2025-10-15T15:14:23Z

/ok to test

@mdboom, there was an error processing your request: E1

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/

chloechia4 added 6 commits October 1, 2025 17:39

Add 13.0 Tests

d2e782e

Add first set of generated cybind bindings

251e888

Add _internal cybind generated bindings

3ea65c8

Remove overriding tests

343c4c7

Add previously deleted test

d29340a

Simplify test_batch_io_large_operations

ef73d2e

leofang requested review from cpcloud and mdboom October 1, 2025 23:03

leofang assigned chloechia4 Oct 1, 2025

leofang added P0 High priority - Must do! feature New feature or request cuda.bindings Everything related to the cuda.bindings module labels Oct 1, 2025

fmt

ffe71b5

leofang added this to the cuda-python parking lot milestone Oct 1, 2025

add _internal bindings

01a4b71

chloechia4 force-pushed the main branch from be92020 to 01a4b71 Compare October 13, 2025 21:11

chloechia4 added 4 commits October 13, 2025 21:12

add main bindings

1193c05

fmt

202a206

test fmt

e2a7e4b

bindings

dad236b

leofang requested changes Oct 14, 2025

View reviewed changes

leofang reviewed Oct 14, 2025

View reviewed changes

cuda_bindings/tests/test_cufile.py Show resolved Hide resolved

sourabgupta3 reviewed Oct 14, 2025

View reviewed changes

remove files

cdee97d

chloechia4 force-pushed the main branch from 1d2ac75 to cdee97d Compare October 14, 2025 23:47

remove unnecessary write check

902406b

chloechia4 added 7 commits October 15, 2025 18:09

add field checks for stats

2beaf11

add test for get_parameter_min_max and add pytest marker

9883354

reference exposed classes for stats tests

60728da

add OpsCounter and GPUStats

8ff0764

add new driver pxd and pyx files

73ece8b

add right bindings

98b2c95

Merge branch 'main' into main

a45ea07

		check_status(__status__)


		cpdef get_parameter_min_max_value(int param, intptr_t min_value, intptr_t max_value):

Add CUDA 13.0 Tests for CuFile I/O Operations #1060

Are you sure you want to change the base?

Add CUDA 13.0 Tests for CuFile I/O Operations #1060

Uh oh!

Conversation

chloechia4 commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

copy-pr-bot bot commented Oct 1, 2025

Uh oh!

leofang commented Oct 1, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdboom commented Oct 15, 2025

Uh oh!

copy-pr-bot bot commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chloechia4 commented Oct 1, 2025 •

edited

Loading