Change the repository type filter
All
Repositories list
26 repositories
vllm-spyre
Publicvllm-gaudi
Public- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
- Community maintained hardware plugin for vLLM on Ascend
- Intelligent Mixture-of-Models Router for Efficient LLM Inference
recipes
PublicFlashMLA
Publicproduction-stack
Publicvllm-xpu-kernels
Publicvllm-neuron
PublicDeepGEMM
Publicvllm-openvino
Publicrfcs
Publicvllm-project.github.io-static
Public archivemedia-kit
Publicdashboard
Public