Cache swizzled tensor for tuning #1686

Serge45 · 2025-02-20T05:25:00Z

Cache swizzled tensor according to its datatype and size, to avoid repeating host side swizzles.
An LRUCache was introduced for balancing memory usage and performance.
~~Re-layout will be skipped if validation disabled.~~

boringmorning · 2025-02-24T02:54:18Z

tensilelite/Tensile/Source/client/source/DataInitialization.cpp

-                //TODO: Support more swizzling type, such as 32x32x8, currently we have 16x16x8 only.
-                if(needSwizzle)
+
+                //if no validation, skip the swizzle


I understand we don't need to permute the tensor when no validation. But is it correct to skip the padding?

After doing some experiments on swizzled and non-swizzled on STA problem, I observed that there's performance disparity. In the latest commit, it uses LRUCache to manage the cached tensor for balancing memory usage and runtime performance, and now it always performs swizzle for STA/STB problem.

Cache swizzled tensor according to its size and dtype

95776b8

Serge45 requested review from jichangjichang, KKyang, vin-huang, imcarsonliao, hcman2, Jinp800125, TonyYHsieh and solaslin as code owners February 20, 2025 05:25

Skip swizzling tensor if validation disabled

cf35d36

jichangjichang previously approved these changes Feb 21, 2025

View reviewed changes

Serge45 dismissed jichangjichang’s stale review via cf35d36 February 21, 2025 06:07

boringmorning reviewed Feb 24, 2025

View reviewed changes

Use LRUCache for tensor swizzling

4948354

Serge45 force-pushed the feature/tensilelite-cache-swizzle branch from 0120521 to 4948354 Compare February 24, 2025 09:37

geotseng-amd self-requested a review February 26, 2025 04:18

Serge45 added 2 commits February 27, 2025 05:42

Reduced the number of copys of swizzle tensor between CPU & GPU

97de1d7

Fixed order of swizzle cache update

e1a2ef8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache swizzled tensor for tuning #1686

Cache swizzled tensor for tuning #1686

Serge45 commented Feb 20, 2025 •

edited

Loading

boringmorning Feb 24, 2025

Serge45 Feb 24, 2025

Cache swizzled tensor for tuning #1686

Are you sure you want to change the base?

Cache swizzled tensor for tuning #1686

Conversation

Serge45 commented Feb 20, 2025 • edited Loading

boringmorning Feb 24, 2025

Choose a reason for hiding this comment

Serge45 Feb 24, 2025

Choose a reason for hiding this comment

Serge45 commented Feb 20, 2025 •

edited

Loading