Add PR and Release benchmark with new changes in framework #2871

roshkhatri · 2025-11-25T00:29:49Z

This adds the workflow improvements for PR and Release benchmark where it runs on
c8g.metal-48xl for ARM64 and c7i.metal-48xl for X86

Cluster mode: disabled
TLS: disabled
io-threads: 1, 9
Pipelining: 1, 10
Clients: 1600
Benchmark Treads: 90
Data size: 16 ,96
Commands: SET, GET

c8g.metal-48xl Spec: https://aws.amazon.com/ec2/instance-types/c8g/
c7i.metal.48xl Spec: https://aws.amazon.com/ec2/instance-types/c7i/

vCPU: 192
NUMA nodes: 2
Memory (GiB): 384
Network Bandwidth (Gbps): 50

PR benchmarking will be executed on ARM64 machine as it has been seen to be more consistent.
Additionally, it runs 5 iterations for each tests and posts the average and other statistical metrics like

CI99%: 99% Confidence Interval - range where the true population mean is likely to fall
PI99%: 99% Prediction Interval - range where a single future observation is likely to fall
CV: Coefficient of Variation - relative variability (σ/μ × 100%)

Note: Values with (n=X, σ=Y, CV=Z%, CI99%=±W%, PI99%=±V%) indicate averages from X runs with standard deviation Y, coefficient of variation Z%, 99% confidence interval margin of error ±W% of the mean, and 99% prediction interval margin of error ±V% of the mean. CI bounds [A, B] and PI bounds [C, D] show the actual interval ranges.

For comparing between versions, it adds a workflow which runs on both ARM64 and X86 machine. It will also post the comparison between the versions like this: #2580 (comment)

Signed-off-by: Roshan Khatri <[email protected]>

codecov · 2025-11-25T02:12:14Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 72.43%. Comparing base (8ea7f13) to head (cec12f6).
⚠️ Report is 3 commits behind head on unstable.

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #2871      +/-   ##
============================================
- Coverage     72.44%   72.43%   -0.01%     
============================================
  Files           128      128              
  Lines         70415    70439      +24     
============================================
+ Hits          51011    51026      +15     
- Misses        19404    19413       +9

see 19 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

sarthakaggarwal97 · 2025-11-25T21:52:45Z

@roshkhatri looks like we are adding x86 as well? I am not sure if we should add x86 runs if we are not confident on the stability and numbers yet.

rainsupreme

LGTM! 👍

I wrote a few notes but you don't have to fix them in this PR.

rainsupreme · 2025-11-26T00:35:28Z

.github/benchmark_configs/benchmark-config-x86.json

+    "io-threads": [1,9],
+    "benchmark-threads": 90,
+    "server_cpu_range": "0-8",
+    "client_cpu_range": "144-191,48-95"


curious why the range is split. Is it something to do with NUMA nodes..?

Yes for X86 the NUMA nodes have cores split

.github/workflows/benchmark-on-label.yml

rainsupreme · 2025-11-26T00:42:26Z

.github/workflows/benchmark-release.yml

+          python-version: "3.10"
+          cache: "pip"
+
+      - name: Install dependencies


I feel like I've seen this checkout and dependency setup stuff in multiple places. Is there some way we could deduplicate it and avoid the possibility of bugs from having accidental differences between them over time?

rainsupreme · 2025-11-26T00:54:35Z

.github/workflows/benchmark-release.yml

+        with:
+          path: artifacts
+
+      - name: Combine results and create comprehensive report


for the record, I'm not a fan of putting "business logic" in yml files like this. I'd prefer this to be in a script that we call, but I'm not going to make a big fuss in this PR. It's not worse than what you're replacing.

Thats true, I was thinking the other way, where I didnt want to add another script which will be used just for one simple purpose 😅

roshkhatri · 2025-11-26T03:06:39Z

@roshkhatri looks like we are adding x86 as well? I am not sure if we should add x86 runs if we are not confident on the stability and numbers yet.

Yes, we would still like to get the benchmark numbers for X86, while doing the releases. The PR only used ARM64 though

Signed-off-by: Roshan Khatri <[email protected]>

rainsupreme

Updates look fine 👍

github-actions bot assigned roshkhatri Nov 25, 2025

Align PR and Release benchmarking with new changes

e4d129c

Signed-off-by: Roshan Khatri <[email protected]>

roshkhatri force-pushed the improve-pr-benchmark branch from a7f8b5c to e4d129c Compare November 25, 2025 00:30

yaml formatting

12e4fe1

Signed-off-by: Roshan Khatri <[email protected]>

roshkhatri marked this pull request as ready for review November 25, 2025 02:02

roshkhatri changed the title ~~Align PR and Release benchmarking with new changes~~ Add PR and Release benchmark with new changes in framework Nov 25, 2025

rainsupreme approved these changes Nov 26, 2025

View reviewed changes

Update .github/workflows/benchmark-on-label.yml

cec12f6

Signed-off-by: Roshan Khatri <[email protected]>

roshkhatri requested a review from zuiderkwast December 1, 2025 23:13

rainsupreme approved these changes Dec 2, 2025

View reviewed changes

sarthakaggarwal97 approved these changes Dec 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add PR and Release benchmark with new changes in framework #2871

Add PR and Release benchmark with new changes in framework #2871

Uh oh!

roshkhatri commented Nov 25, 2025 •

edited by zuiderkwast

Loading

Uh oh!

codecov bot commented Nov 25, 2025 •

edited

Loading

Uh oh!

sarthakaggarwal97 commented Nov 25, 2025

Uh oh!

rainsupreme left a comment

Uh oh!

rainsupreme Nov 26, 2025

Uh oh!

roshkhatri Nov 26, 2025

Uh oh!

Uh oh!

rainsupreme Nov 26, 2025

Uh oh!

rainsupreme Nov 26, 2025

Uh oh!

roshkhatri Nov 26, 2025

Uh oh!

roshkhatri commented Nov 26, 2025

Uh oh!

rainsupreme left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add PR and Release benchmark with new changes in framework #2871

Are you sure you want to change the base?

Add PR and Release benchmark with new changes in framework #2871

Uh oh!

Conversation

roshkhatri commented Nov 25, 2025 • edited by zuiderkwast Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sarthakaggarwal97 commented Nov 25, 2025

Uh oh!

rainsupreme left a comment

Choose a reason for hiding this comment

Uh oh!

rainsupreme Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

roshkhatri Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rainsupreme Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

rainsupreme Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

roshkhatri Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

roshkhatri commented Nov 26, 2025

Uh oh!

rainsupreme left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

roshkhatri commented Nov 25, 2025 •

edited by zuiderkwast

Loading

codecov bot commented Nov 25, 2025 •

edited

Loading