[Common] Reduced padding kernel compilation time#2827
[Common] Reduced padding kernel compilation time#2827Oleg-Goncharov wants to merge 1 commit intoNVIDIA:mainfrom
Conversation
Signed-off-by: Oleg Goncharov <ogoncharov@nvidia.com>
Greptile SummaryThis PR reduces Confidence Score: 5/5Safe to merge — minimal two-line change with no correctness risk and a large compile-time benefit.
No files require special attention. Important Files Changed
Reviews (1): Last reviewed commit: "Reduced padding kernel compilation time" | Re-trigger Greptile |
Description
This PR reduces the compilation time of
padding.cufrom approximately 600 seconds to 15 seconds by lowering the outer-loop unroll factor.Type of change
Changes
Please list the changes introduced in this PR:
#pragma unrolldirective from8to4.Checklist: