NVIDIA Corporation
- 27k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- Model-Optimizer Public
A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
NVIDIA/Model-Optimizer’s past year of commit activity - NeMo-Retriever Public
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
NVIDIA/NeMo-Retriever’s past year of commit activity - TensorRT-LLM Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
NVIDIA/TensorRT-LLM’s past year of commit activity - k8s-nim-operator Public
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
NVIDIA/k8s-nim-operator’s past year of commit activity - gpu-driver-container Public
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
NVIDIA/gpu-driver-container’s past year of commit activity - NemoClaw Public
Run agents like Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference
NVIDIA/NemoClaw’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…