Releases: NVIDIA/cuda-python
CUDA Python 12.9.2
CUDA Python 13.0.0
cuda.core v0.3.2
cuda.core
v0.3.2 release announcement
Release note
All functionalities are currently hosted under the cuda.core.experimental
namespace. Once the features become stable they will be moved out of experimental
.
Documentation
Sample codes
What's Changed
- Fix CI build-time parallelism + support the same env var in
cuda.core
by @leofang in #743 - [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in #746
SPDX-FileCopyrightText
cleanup by @rwgk in #753- Construct instance of appropriate class in Event._init and Stream._init by @shwina in #752
- Fix typos and grammar in documentation by @cluster2600 in #774
- Update
cuda.bindings
to 13.0.0 by @leofang in #792 - getting started example is complete by @richardhboyd in #797
- Make
Device.set_current()
faster by @leofang in #781 - Bump cuda.core to 0.3.2 by @leofang in #817
New Contributors
- @cluster2600 made their first contribution in #774
- @richardhboyd made their first contribution in #797
Full Changelog: cuda-core-v0.3.1...cuda-core-v0.3.2
CUDA Python 12.9.1
cuda-pathfinder v1.1.0
- CTK 13.0.0 compatibility
- Bug fix: load
libnvJitLink.so.12
from conda, not/usr/local/cuda
PR #767
cuda-pathfinder v1.0.0
First release of cuda-pathfinder
as a stand-alone module.
cuda.pathfinder
replaces cuda.bindings.path_finder
, which was released with cuda-bindings
12.9.0
and is now deprecated.
Note that cuda-pathfinder
is a noarch package and has no dependencies (other than a Python 3.9+ interpreter).
Please see cuda/pathfinder/README for more information.
cuda.core v0.3.1
cuda.core
v0.3.1 release announcement
Release note
All functionalities are currently hosted under the cuda.core.experimental
namespace. Once the features become stable they will be moved out of experimental
.
Documentation
Sample codes
What's Changed
- Bump github/codeql-action from 3.28.19 to 3.29.0 by @dependabot in #710
- Fix Windows build CI by @leofang in #713
- Bump pypa/cibuildwheel from 2.23.3 to 3.0.0 by @dependabot in #711
- Ensure correct handling of buffers allocated with
LegacyPinnedMemoryResource.allocate
as kernel parameters by @shwina in #717 - Fix nvbugpro 5348750 by @oleksandr-pavlyk in #725
- Add a "Getting Started" page to the documentation by @shwina in #720
- Bump korthout/backport-action from 3.2.0 to 3.2.1 by @dependabot in #738
- Bump github/codeql-action from 3.29.0 to 3.29.2 by @dependabot in #737
- cuda_core/tests/test_event.py::test_timing_success WSL compatibility by @rwgk in #740
- Restore option to run testing without cupy installed. by @rwgk in #741
- Cythonize away some perf hot spots by @leofang in #709
- cuda_core forward compatibility changes. by @rwgk in #722
- Update docs for v0.3.1 release by @leofang in #695
Full Changelog: cuda-core-v0.3.0...cuda-core-v0.3.1
cuda.core v0.3.0
cuda.core
v0.3.0 release announcement
Release note
All functionalities are currently hosted under the cuda.core.experimental
namespace. Once the features become stable they will be moved out of experimental
.
Documentation
Sample codes
What's Changed
- cuda.core:
CUResult
,cudaError
explanations by @rwgk in #503 - DOC: Add admonition to docstrings for cuda.core handle properties by @carterbox in #573
- NEW: Make event timing error messages more specific and actionable by @carterbox in #559
- Change
cuda.core
license to Apache-2.0 & make contributing guides clear by @leofang in #583 - Add lint instructions by @msaroufim in #581
- fix indexing bug in saxpy.py by @msaroufim in #582
- Consolidate shared info between README.md and DESCRIPTION.rst by @vzhurba01 in #590
- PyTorch example by @msaroufim in #579
- Address remaining OSRB requests + document known installation issues by @leofang in #626
- Initial version of pre-commit "Check SPDX-License-Identifier" by @rwgk in #625
- Implement Kernel.num_arguments, and Kernel.arguments_info by @oleksandr-pavlyk in #612
- Move dependencies from requirements.txt to an optional packaging extra by @kkraus14 in #638
- Always build and run Cython tests + other CI improvements by @leofang in #640
- Implement device and context properties for Event by @NaderAlAwar in #618
- Enable serialization/deserialization of
ObjectCode
instances by @brandon-b-miller in #660 - Add more
ObjectCode
constructors by @brandon-b-miller in #652 - Unify Common CI Code for Windows and Linux by @cryos in #645
- MNT: Bump DLPack header to 1.1 by @leofang in #667
- Add tests to cover scalar handling in
launch()
+ Fix fp16 bug by @leofang in #669 - Feature/occupancy by @oleksandr-pavlyk in #648
- Repair Windows wheels by @leofang in #673
- Migrate to
windows-2022
for Windows CI builds by @cryos in #672 - Bugfix/multiple ptxas options values by @oleksandr-pavlyk in #678
- Clean up cffi resources in file by @oleksandr-pavlyk in #679
- Support cooperative launch by @leofang in #676
- Allow
ObjectCode
to have a name by @leofang in #682 - Make compute-sanitizer not report API errors as errors by @leofang in #687
- Update the notes on the CCCL and nvmath-python projects by @leofang in #688
- Switch to use CUDA driver APIs in
Device
constructor by @leofang in #460 - Bump github/codeql-action from 3.28.18 to 3.28.19 by @dependabot in #700
- Bump conda-incubator/setup-miniconda from 3.1.1 to 3.2.0 by @dependabot in #701
- Add phase 1 of CUDA Graphs support by @vzhurba01 in #455
- Make a few memory management objects public + Miscellaneous doc updates by @leofang in #693
- Bump
cuda.core
to v0.3.0 by @leofang in #703
New Contributors
- @gmarkall made their first contribution in #522
- @kkraus14 made their first contribution in #549
- @pre-commit-ci made their first contribution in #552
- @msaroufim made their first contribution in #581
- @cryos made their first contribution in #555
- @dependabot made their first contribution in #700
Full Changelog: cuda-core-v0.2.0...cuda-core-v0.3.0