Skip to content
Change the repository type filter

All

    Repositories list

    • vllm-omni

      Public
      A high-throughput and memory efficient inference and serving engine for Omni-modality models
      Python
      Apache License 2.0
      526000Updated Mar 13, 2026Mar 13, 2026
    • aiter

      Public
      AI Tensor Engine for ROCm
      Python
      MIT License
      236003Updated Mar 13, 2026Mar 13, 2026
    • litellm

      Public
      Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Re…
      Python
      Other
      6.4k001Updated Mar 13, 2026Mar 13, 2026
    • vllm

      Public
      vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      14k9417Updated Mar 12, 2026Mar 12, 2026
    • Python
      Apache License 2.0
      0000Updated Mar 10, 2026Mar 10, 2026
    • JamAIBase

      Public
      The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work tog…
      Python
      Apache License 2.0
      391.1k11Updated Mar 6, 2026Mar 6, 2026
    • skypilot

      Public
      SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
      Python
      Apache License 2.0
      988000Updated Mar 5, 2026Mar 5, 2026
    • nanobot

      Public
      "🐈 nanobot: The Ultra-Lightweight OpenClaw"
      Python
      MIT License
      5.5k000Updated Mar 3, 2026Mar 3, 2026
    • vllm-rocm-wheel

      Public
      Python
      Apache License 2.0
      0005Updated Feb 24, 2026Feb 24, 2026
    • vllmtests

      Public
      This is a repository containing the tools for testing vLLM correctness and perf regression
      Python
      Apache License 2.0
      2200Updated Jan 15, 2026Jan 15, 2026
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      14k100Updated Jan 9, 2026Jan 9, 2026
    • Typescript Documentation of JamAISDK
      HTML
      0000Updated Jan 8, 2026Jan 8, 2026
    • lmms-eval

      Public
      One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
      Python
      Other
      539000Updated Jan 6, 2026Jan 6, 2026
    • High-performance safetensors model loader
      Python
      Apache License 2.0
      21002Updated Dec 29, 2025Dec 29, 2025
    • vllm-wheel

      Public
      Python
      Apache License 2.0
      0005Updated Dec 8, 2025Dec 8, 2025
    • Collect the scripts and results of all reasoning experiments.
      Python
      Apache License 2.0
      1000Updated Dec 7, 2025Dec 7, 2025
    • recipes

      Public
      Common recipes to run vLLM
      Jupyter Notebook
      Apache License 2.0
      167000Updated Nov 25, 2025Nov 25, 2025
    • vllm-rocm

      Public
      Python
      Apache License 2.0
      0200Updated Nov 21, 2025Nov 21, 2025
    • HTML
      73000Updated Oct 3, 2025Oct 3, 2025
    • Python
      14000Updated Sep 25, 2025Sep 25, 2025
    • LMCache

      Public
      ROCm support of Ultra-Fast and Cheaper Long-Context LLM Inference
      Python
      Apache License 2.0
      1k000Updated Jul 15, 2025Jul 15, 2025
    • roxl

      Public
      NVIDIA Inference Xfer Library (NIXL)
      C++
      Apache License 2.0
      262000Updated Jun 6, 2025Jun 6, 2025
    • This is a repository to monitor the fast changing ROCm/aiter repository to alert user that AITER function of interests e.g. in vLLM, in SGLang has been updated …
      Python
      Apache License 2.0
      00390Updated Apr 27, 2025Apr 27, 2025
    • vLLM Workshop Content
      Apache License 2.0
      0200Updated Apr 3, 2025Apr 3, 2025
    • Jupyter Notebook
      5000Updated Mar 20, 2025Mar 20, 2025
    • Python
      Apache License 2.0
      1000Updated Feb 24, 2025Feb 24, 2025
    • The driver for LMCache core to run in vLLM
      Python
      Apache License 2.0
      33000Updated Jan 24, 2025Jan 24, 2025
    • Python
      8000Updated Jan 23, 2025Jan 23, 2025
    • Python
      Apache License 2.0
      373000Updated Jan 22, 2025Jan 22, 2025
    • kvpress

      Public
      LLM KV cache compression made easy
      Python
      Apache License 2.0
      121100Updated Jan 21, 2025Jan 21, 2025