This repo builds binary versions of llama.cpp for Ubuntu with CUDA and Vulkan support.
New releases are automatically built for the latest release version of llama.cpp.
Currently supported CUDA build configurations:
| CPU arch | OS | CUDA | Nvidia Compute arch |
|---|---|---|---|
| amd64 | Ubuntu 24.04 | 12.9 | 86, 89 |
| arm64 | Ubuntu 22.04 | 12.9 | 87 |
Compute architectures 86 and 89 are those used by consumer video cards.
Compute architecture 87 is used by Jetson Orin and Jetson AGX.
Currently supported Vulkan build configurations:
| CPU arch | OS | Vulkan |
|---|---|---|
| arm64 | Ubuntu 22.04 | 1.4.328.1 |
The prebuilt Vulkan SDK for ARM64 used for our builds comes from https://github.com/jakoch/vulkan-sdk-arm
Thank you!