Skip to content

Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6

Open
Ooooze wants to merge 1 commit into
feature/turboquant-kv-cachefrom
fix/cuda-mma-dkq512-fallback
Open

Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6
Ooooze wants to merge 1 commit into
feature/turboquant-kv-cachefrom
fix/cuda-mma-dkq512-fallback

Commits

Commits on May 8, 2026