NVIDIA Corporation

All

588 repositories

cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•268•769•377•82•Updated Aug 13, 2025Aug 13, 2025
TensorRT-LLM
Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
cuda pytorch moe blackwell llm-serving
C++
•
Apache License 2.0
•1.7k•11k•712•353•Updated Aug 13, 2025Aug 13, 2025
grove
Public
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
kubernetes gpu inference operator auto-scaling role-based grove multinode auto-scaling-group gang-scheduling
Go
•
Apache License 2.0
•10•25•9•5•Updated Aug 13, 2025Aug 13, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•255•1.8k•1k•145•Updated Aug 13, 2025Aug 13, 2025
holodeck
Public
Holodeck is a project to create test environments optimised for GPU projects.
Go
•
Apache License 2.0
•7•18•1•9•Updated Aug 13, 2025Aug 13, 2025
numba-cuda
Public
The CUDA target for Numba
Python
•
BSD 2-Clause "Simplified" License
•35•167•94•33•Updated Aug 13, 2025Aug 13, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•64•347•159•165•Updated Aug 13, 2025Aug 13, 2025
NVTX
Public
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
C++
•
Other
•63•434•7•0•Updated Aug 13, 2025Aug 13, 2025
NeMo
Public
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models
Python
•
Apache License 2.0
•3k•15k•65•68•Updated Aug 13, 2025Aug 13, 2025
Isaac-GR00T
Public
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
Jupyter Notebook
•
Apache License 2.0
•685•4.7k•88•17•Updated Aug 13, 2025Aug 13, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Jupyter Notebook
•
Other
•84•482•46•70•Updated Aug 13, 2025Aug 13, 2025
TransformerEngine
Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
python machine-learning deep-learning gpu cuda pytorch jax fp8
Python
•
Apache License 2.0
•477•2.6k•211•78•Updated Aug 13, 2025Aug 13, 2025
k8s-samples
Public
Sample Dockerfiles for Docker Hub images
Dockerfile
•
Apache License 2.0
•8•5•0•5•Updated Aug 13, 2025Aug 13, 2025
warp
Public
A Python framework for accelerated simulation, data generation and spatial computing.
python gpu cuda nvidia gpu-acceleration differentiable-programming nvidia-warp
Python
•
Apache License 2.0
•343•5.4k•233•9•Updated Aug 13, 2025Aug 13, 2025
physicsnemo
Public
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
machine-learning deep-learning physics pytorch nvidia-gpu
Python
•
Apache License 2.0
•409•1.7k•33•26•Updated Aug 13, 2025Aug 13, 2025
numbast
Public
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
cuda numba
Python
•
Apache License 2.0
•13•50•26•7•Updated Aug 13, 2025Aug 13, 2025
NeMo-Skills
Public
A project to improve skills of large language models
Python
•
Apache License 2.0
•90•511•37•8•Updated Aug 13, 2025Aug 13, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•371•2.2k•369•54•Updated Aug 13, 2025Aug 13, 2025
MatX
Public
An efficient C++17 GPU numerical computing library with Python-like syntax
hpc gpu cuda gpgpu gpu-computing
C++
•
BSD 3-Clause "New" or "Revised" License
•103•1.3k•39•7•Updated Aug 13, 2025Aug 13, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•317•1.2k•62•25•Updated Aug 13, 2025Aug 13, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•72•49•76•11•Updated Aug 13, 2025Aug 13, 2025
cuopt
Public
NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.
Cuda
•
Apache License 2.0
•57•356•72•14•Updated Aug 13, 2025Aug 13, 2025
cloud-native-docs
Public
Documentation repository for NVIDIA Cloud Native Technologies
kubernetes containers kubernetes-operator
PowerShell
•
Apache License 2.0
•28•26•4•8•Updated Aug 13, 2025Aug 13, 2025
NVFlare
Public
NVIDIA Federated Learning Application Runtime Environment
python decentralized pet privacy-protection federated-learning federated-analytics federated-computing
Python
•
Apache License 2.0
•208•775•12•8•Updated Aug 13, 2025Aug 13, 2025
kubectl-nv
Public
Kubectl NV plugin, a tool for managing NVIDIA objects on a kubernetes cluster.
Go
•
Apache License 2.0
•3•6•0•1•Updated Aug 12, 2025Aug 12, 2025
TensorRT-Incubator
Public
Experimental projects related to TensorRT
MLIR
•17•110•36•10•Updated Aug 12, 2025Aug 12, 2025
aistore
Public
AIStore: scalable storage for AI applications
kubernetes sds erasure-coding object-storage software-defined multiple-backends batch-jobs distributed-shuffle linear-scalability etl-offload
Go
•
MIT License
•215•1.6k•0•1•Updated Aug 12, 2025Aug 12, 2025
VisRTX
Public
NVIDIA OptiX based implementation of ANARI
C++
•
Other
•32•258•9•0•Updated Aug 12, 2025Aug 12, 2025
earth2studio
Public
Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
weather ai deep-learning climate-science
Python
•
Apache License 2.0
•62•233•12•6•Updated Aug 12, 2025Aug 12, 2025
nv-ingest
Public
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
Python
•
Apache License 2.0
•258•2.7k•88•20•Updated Aug 12, 2025Aug 12, 2025