Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers. #4128

greggman · 2025-01-02T22:28:35Z

Modified so this test doesn't use storage buffers by having it return values from a fragment shader as rgba32uint

jrprice · 2025-01-07T17:35:31Z

src/webgpu/shader/execution/expression/call/builtin/fwidth.ts

-  pass.setPipeline(pipeline);
-  pass.setBindGroup(0, group);
-  for (let quad = 0; quad < cases.length / vectorWidth; quad++) {
+  for (let quad = 0; quad / vectorWidth; quad++) {


This loop condition seems wrong? Looks like the loop never executes because 0 / ... is 0 and that gets coerced to false...

ha! that's embarrassing 😅

greggman · 2025-01-08T21:17:39Z

Here's a fixed version. Sorry about the previous bug. I originally tried to make it work a quad at a time and I got it working but it was taking 10 seconds per case on my M1 mac allocating 1 buffer per quad to copy the texture and then mapping that 1 buffer etc.. I did some optimizations and got it down to 5 seconds but ran into another issue with that path. Eventually I realized I could do a bunch of quads at once and this is the result. I could optimize more as it's not always using all of the quads now. It uses all 256 quad (2x2 texel areas in a 512x2 texture) for f32 but less for each vectorized version (only 64 for vec4). But, it's fast enough as is, maybe 10% slower than before.

jrprice

LGTM

Modified so this test doesn't use storage buffers by having it return values from a fragment shader as rgba32uint

greggman requested a review from jrprice January 2, 2025 22:29

jrprice reviewed Jan 7, 2025

View reviewed changes

greggman force-pushed the fix-fwidth branch 2 times, most recently from 97afdbb to d43a1ce Compare January 8, 2025 21:13

jrprice approved these changes Jan 9, 2025

View reviewed changes

Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers.

8f1cce9

Modified so this test doesn't use storage buffers by having it return values from a fragment shader as rgba32uint

greggman force-pushed the fix-fwidth branch from d43a1ce to 8f1cce9 Compare January 9, 2025 02:19

greggman enabled auto-merge (squash) January 9, 2025 02:20

greggman merged commit 077ffee into gpuweb:main Jan 9, 2025
1 check passed

greggman deleted the fix-fwidth branch January 10, 2025 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers. #4128

Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers. #4128

greggman commented Jan 2, 2025

jrprice Jan 7, 2025

greggman Jan 8, 2025

greggman commented Jan 8, 2025

jrprice left a comment

Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers. #4128

Compat: Refactor fwidth/Fine/Coarse for 0 storage buffers. #4128

Conversation

greggman commented Jan 2, 2025

jrprice Jan 7, 2025

Choose a reason for hiding this comment

greggman Jan 8, 2025

Choose a reason for hiding this comment

greggman commented Jan 8, 2025

jrprice left a comment

Choose a reason for hiding this comment