Commit f849dcd
fix integer overflow problem (pytorch#5051)
Summary:
Pull Request resolved: pytorch#5051
X-link: https://github.com/facebookresearch/FBGEMM/pull/2055
When token size is very big, the integer multiplication will lead to integer overflow therefore lead to a cuda memory access error.
Reviewed By: royren622
Differential Revision: D85330676
fbshipit-source-id: 51a5f61ed52f43358b61f4e689b33b95d051f9871 parent 3c4ae35 commit f849dcd
File tree
1 file changed
+3
-3
lines changed- fbgemm_gpu/experimental/gen_ai/src/attention/cuda/cutlass_blackwell_fmha/device
1 file changed
+3
-3
lines changedLines changed: 3 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
293 | 293 | | |
294 | 294 | | |
295 | 295 | | |
296 | | - | |
| 296 | + | |
297 | 297 | | |
298 | 298 | | |
299 | 299 | | |
| |||
320 | 320 | | |
321 | 321 | | |
322 | 322 | | |
323 | | - | |
| 323 | + | |
324 | 324 | | |
325 | | - | |
| 325 | + | |
326 | 326 | | |
327 | 327 | | |
328 | 328 | | |
| |||
0 commit comments