GPUStack
GPU cluster manager for optimized AI model deployment
Pinned Loading
Repositories
Showing 10 of 14 repositories
- runner Public
Collection of Dockerfiles to build images for various inference services across different accelerated backends.
gpustack/runner’s past year of commit activity - gpustack-ui Public
gpustack/gpustack-ui’s past year of commit activity - gpustack Public
Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.
gpustack/gpustack’s past year of commit activity - gguf-parser-go Public
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
gpustack/gguf-parser-go’s past year of commit activity - gpustack.github.io Public
gpustack/gpustack.github.io’s past year of commit activity - gpustack-higress-plugin Public
gpustack/gpustack-higress-plugin’s past year of commit activity - vox-box Public
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
gpustack/vox-box’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…