Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document how to benchmark Dragonfly #101

Closed
wants to merge 2 commits into from
Closed
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
85 changes: 85 additions & 0 deletions docs/development/benchmarking.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
---
sidebar_position: 3
---

# Benchmarking

Do you have an existing Redis environment and would like to see if Dragonfly could be a better
replacement? <br/>
Are you developing a service and would like to determine which cloud instance type to
allocate for Dragonfly? <br/>
Do you wonder how many replicas you need to support your workload?

If so, read on, because this page is for you!

## Choosing an Environment

A benchmark is done to assess the performance aspects of a system. In the case of Dragonfly, a
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was this paragraph generated by ChatGPT? :)

I thought about providing specific requirements of how to say reach 1M qps on m5 family instance.

  1. Specifiying Linux os version, even distribution, mentioning that Dragonfly runs best with iouring,
  2. talking about what's the minimal instance size that allows reaching 1M qps on m5.
  3. Talking about running the load test on a separate machine. machine size for the load test .
  4. Specifying that it is better to choose a bigger size for load test machine to avoid bottlenecks on the client side.
  5. Provide rule of thumb for --threads should not be higher than number of vcpus on that machine.
  6. Provide suggested configuration for memtier to reach 1M qps on Dragonfly.
  7. Provide suggested configuration + instance type to reach 2M qps on GCP instance etc

i.e. keep everything very technical and specific.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

or any other instance family and interesting target goal.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was not generated by ChatGPT, but I've been called worse :)

re/ 1: done
re/ 2: what is indeed the minimal instance?
re/ 3: I already talk about it below
re/ 4: done
re/ 5: done (but I think you meant --proactor_threads?)
re/ 6+7: do you have these? or would you like me to run them until I figure it out?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry 😞 , I was joking.

re/5 - I meant -t on memtier side. Dragonfly actually spans all the cpus automatically. memtier always uses 4 by default.

benchmark is commonly used to assess the CPU and memory performance & utilization.

Depending on the goals of your benchmark, you should choose the machine size accordingly. For a
production mimicking benchmark, you should use a machine size and traffic load similar to that of
your busiest production timing, or even higher to allow for some cushion.

If you do not use a cloud instance, it might be a good idea to configure your CPU's governance to
performance by issuing:

```shell
sudo apt install linux-tools-common linux-tools-generic
sudo cpupower frequency-set --governor performance
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are you sure you need it on cloud instances?
i have never done it, actually.

Copy link
Contributor

@romange romange Jun 15, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry I missed the sentence before :(
I would frame the whole page around the cloud or on-prem datacenter server. I do not advise benchmarking locally.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, I removed this whole paragraph with the command line example

```

Then, when you're done with the benchmark you could reboot your machine or run the following:

```shell
sudo cpupower frequency-set --governor powersave
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would not suggest it unless you saw difference in Dragonfly's performance with and without this option on the cloud.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed as per discussed

```

## Setting Up Dragonfly

Dragonfly can run in [Docker](/getting-started/docker) or directly installed as a
[binary](/getting-started/binary) on your machine. See the [Getting Started](/getting-started) page
for other options and the latest documentation.

## Reducing Noise

Ideally, a benchmark should be run in as similar as possible environment as the production setup.

In busy production deployments, it is common to run Dragonfly in its own machine (virtual or
dedicated). If you plan to do so in your production setup as well (which we highly recommend),
consider running the benchmark in a similar way.

In practice, it means that any other systems in your setup (like other services & databases) should
run in other machines. Importantly, also the software that sends the traffic should run in another
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i would even make the last sentence bold or move it to the start because it's the most relevant sentence here for practical purposes.

machine.

## Sending Traffic

If your service already has existing benchmarking tools, or ways to record and replay production
traffic, you should definitely use them. That would be the closest estimation to what a real
production deployment with a backing Dragonfly would look like.

If, like many others, you do not (yet) have such a tool, you could either write your own tool to
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets be opinionated and say - we usually use memtier - and this is how we do it.
I would not suggest folks to write their own loadtest tool :)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can also mention redis-benchmark - from my experience (have not been using it for the last two years) it's less efficient than memtier but it has more predefined loadtest options specific to redis.
but each such statement must be checked verified and run personally by you.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would actually suggest that users write their on load tests, as an end-to-end kind of thing, but that's really beside the point here :)

simulate production traffic or use an existing tool like `memtier_benchmark`.

When writing your own tool, try to recreate the production traffic as closely as possible. Use the
same commands (like `SET`, `GET`, `SADD`, etc), with the expected ratio between them, and the
expected key and value sizes.

If you choose to use an existing benchmarking tool, a popular and mature one is
[`memtier_benchmark`](https://github.com/RedisLabs/memtier_benchmark). It's an Open Source tool for
generic load generation and benchmarking with many features. Check out their documentation page for
more details, but as a quick reference you could use:

```shell
memtier_benchmark \
--server=<IP / Host> \
--threads=<thread count> \
--clients=<clients per thread> \
--requests=<requests per client>
```

## Having Troubles? Anything Unclear?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

btw, every docs page has "edit page" button at the end...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shall I remove this then?


Improving our documentation and helping the community is always of the higher priority for us, so
please feel free to reach out!