Optimize skip_slice by recording serialized length #1836

Gankra · 2017-10-10T15:19:47Z

For @jrmuizel to test.

This change is

glennw · 2017-10-12T06:01:37Z

@gankro do we want to review / merge this?

glennw · 2017-10-12T06:01:43Z

@gankro do we want to review / merge this?

Gankra · 2017-10-13T05:16:37Z

Still need to do perf eval, gotten distracted the last few days.

jrmuizel · 2017-10-17T19:38:48Z

I tried performance testing this. I couldn't get solid numbers. WebRender seems to have some weird bimodal timing behaviour that makes measuring header.

glennw · 2017-10-19T06:57:30Z

@gankro I can try to do some profiling with this tomorrow, if that would be helpful?

glennw · 2017-10-20T02:12:34Z

I did some simple profiling on one of our benchmarks, with the following:

vblank_mode=0 ../target/release/wrench -r show benchmarks/text-rendering.yaml

This uses -r to replay the display list every frame, and vblank_mode=0 disables vsync on Intel GPUs. The benchmark draws 64 reasonably long text runs of varying colors and font sizes, so it contains a lot of glyphs in the display list.

Without: ~325 fps.
With: ~385 fps.

So, at least in this test case, it's a very significant win. It'd be interesting to see if others can reproduce similar results.

With this patch, this is what the profile graph looks look:

The two areas I've marked with a red dashed line are time spent inside deserialization functions / iterators, so hopefully there's still quite a lot of wins to be found there. There's certainly a couple of other big targets in that profile I can look at performance-wise in the future too.

Gankra · 2017-10-20T21:08:33Z

Wow that's really surprising since text is one of the slices that shouldn't actually need this optimization (with a Sufficiently Smart Compiler).

Seems pretty convincing to me it's a good win. Let's ship it.

glennw · 2017-10-20T22:10:22Z

@bors-servo r+

bors-servo · 2017-10-20T22:10:23Z

📌 Commit 13bfa65 has been approved by glennw

bors-servo · 2017-10-20T22:10:25Z

⌛ Testing commit 13bfa65 with merge 517ef53...

@jrmuizel

Optimize skip_slice by recording serialized length For @jrmuizel to test.  --- This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/servo/webrender/1836)

bors-servo · 2017-10-21T00:38:38Z

☀️ Test successful - status-appveyor, status-travis
Approved by: glennw
Pushing 517ef53 to master...

jrmuizel · 2017-10-24T21:54:00Z

serde-rs/serde#855 is probably the next place where we're going to get deserialization wins. @gankro will look at it when he gets a chance.

Optimize skip_slice by recording serialized length

13bfa65

bors-servo merged commit 13bfa65 into servo:master Oct 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize skip_slice by recording serialized length #1836

Optimize skip_slice by recording serialized length #1836

Gankra commented Oct 10, 2017 •

edited by larsbergstrom

Loading

glennw commented Oct 12, 2017

glennw commented Oct 12, 2017

Gankra commented Oct 13, 2017

jrmuizel commented Oct 17, 2017

glennw commented Oct 19, 2017

glennw commented Oct 20, 2017

Gankra commented Oct 20, 2017

glennw commented Oct 20, 2017

bors-servo commented Oct 20, 2017

bors-servo commented Oct 20, 2017

bors-servo commented Oct 21, 2017

jrmuizel commented Oct 24, 2017

Optimize skip_slice by recording serialized length #1836

Optimize skip_slice by recording serialized length #1836

Conversation

Gankra commented Oct 10, 2017 • edited by larsbergstrom Loading

glennw commented Oct 12, 2017

glennw commented Oct 12, 2017

Gankra commented Oct 13, 2017

jrmuizel commented Oct 17, 2017

glennw commented Oct 19, 2017

glennw commented Oct 20, 2017

Gankra commented Oct 20, 2017

glennw commented Oct 20, 2017

bors-servo commented Oct 20, 2017

bors-servo commented Oct 20, 2017

bors-servo commented Oct 21, 2017

jrmuizel commented Oct 24, 2017

Gankra commented Oct 10, 2017 •

edited by larsbergstrom

Loading