Skip to content

Commit 95230e2

Browse files
kausvmeta-codesync[bot]
authored andcommitted
Support larger lookup in permute (#5086)
Summary: Pull Request resolved: #5086 X-link: https://github.com/facebookresearch/FBGEMM/pull/2094 For lengths per shard exceeding 2^31, we avoid overflow resulting in undefined behavior. Reviewed By: spcyppt Differential Revision: D86209662 fbshipit-source-id: 6d51290f3436629571677091c42b76b6f98e5790
1 parent cfe8683 commit 95230e2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ template <
2020
typename indices_t,
2121
typename weights_t>
2222
__global__ __launch_bounds__(kMaxThreads) void permute_2D_data_kernel(
23-
int32_t len,
23+
int64_t len,
2424
int32_t T,
2525
int32_t B,
2626
const indices_t* __restrict__ indices,

0 commit comments

Comments
 (0)