Add Flash FMHA backend #278

AdityaKane2001 · 2025-10-30T02:22:41Z

TL;DR: Ported over Flash Attention CUTLASS 3.x kernels to NATTEN as-is, with wrappers to fit into NATTEN. Exposing through the flash-fmha backend.

Summary of changes:

C++:
- Added actual kernel files under csrc/include/natten/cuda/flash_fmha/flash_kernel.
- Torch C++ interface at csrc/src/flash_fmha.cu, which call into csrc/.../flash_fmha/flash_fmha_{forward/bakcward}.cuh
- Added a utility file csrc/.../flash_kernel/param_utils.h for param conversion.
Python
- Exposed C++ function through flash-fmha backend.
- Added autograd function and configs for the same.
- Wherever possible, some arrangement is done to later implement flash-fna backend.
- Added autogen scripts and tests for flash-fma.

Present rough edges:

Python frontend might have some code style inconsistencies.
Stray template parameters for flash bwd template currently housed in flash_fmha_backward.cuh, as opposed to autogen. It seems that adding those to autogen scripts will make the scripts overly complex.
Although correctness is guaranteed (because of tests), no particular refactoring of the actual flash attn kernel code was done.

tests/test_fmha.py

AdityaKane2001 added 10 commits October 26, 2025 21:47

Move devsetup to shi-01

405d88f

Flash FMHA fwd bwd works

d012ac3

Rebased

5a8eba3

Added flash FMHA (not FNA) frontend and tests, minor LSE bugfix

17ed03f

Cleaning up diff

5c1d5d0

Cleaning up diff again

080ef06

Cleaning up diff one last time

4d0640e

One teeny tiny cleaning up of diff

3303404

Forgot one directory

f2cd1b5

Unnecessary diff delete

25d8e99

alihassanijr reviewed Nov 1, 2025

View reviewed changes

tests/test_fmha.py Outdated Show resolved Hide resolved

alihassanijr mentioned this pull request Nov 1, 2025

Updated setup.py and pyproject.toml #279

Open

Leftover from debugging

83e4bb8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Flash FMHA backend #278

Add Flash FMHA backend #278

Uh oh!

AdityaKane2001 commented Oct 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Flash FMHA backend #278

Are you sure you want to change the base?

Add Flash FMHA backend #278

Uh oh!

Conversation

AdityaKane2001 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdityaKane2001 commented Oct 30, 2025 •

edited

Loading