You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed in kernel_traits that in FA3, Q and K are kept fixed in memory, while V and O can reuse the same space. However, isn't Q the only tensor that must remain fixed? (Since our block keeps moving to the right, Q must stay fixed, while K and V are continuously updated.)
Why not allow KV and O to share memory space (using a union)? Is it because O occupies very little space, making such a modification unnecessary?
The text was updated successfully, but these errors were encountered:
I noticed in kernel_traits that in FA3, Q and K are kept fixed in memory, while V and O can reuse the same space. However, isn't Q the only tensor that must remain fixed? (Since our block keeps moving to the right, Q must stay fixed, while K and V are continuously updated.)
Why not allow KV and O to share memory space (using a union)? Is it because O occupies very little space, making such a modification unnecessary?
The text was updated successfully, but these errors were encountered: