Set thread counts for client & server using env variables with better defaults #1993

pomo-mondreganto · 2023-11-29T16:38:04Z

No description provided.

… defaults

src/commands.rs

pomo-mondreganto · 2023-11-29T16:49:08Z

A bit of a backstory: we've encountered two issues using sccache in our cluster.

First, when running in a cpu-limited container environment, tokio erroneously detects the total number of CPUs instead of the container limit, which is correct (this behaviour exists in all schedulers known to me, e.g. the Goland one). Second, it spawns a gigantic tokio scheduler with num_cpus worker threads for each compilation process (client invocation). In our setup (128 logical CPUs, 64 cpu container limit, 64 concurrent compilation jobs) sccache spawned 128 * 64 threads on compilation, which is completely unnecessary and overflows the thread limit for the container.

glandium · 2023-11-30T09:14:40Z

First, when running in a cpu-limited container environment, tokio erroneously detects the total number of CPUs instead of the container limit, which is correct (this behaviour exists in all schedulers known to me, e.g. the Goland one).

num_cpus uses sched_getaffinity, and should return the right number. What does your container do to limit CPU?

pomo-mondreganto · 2023-11-30T09:16:02Z

First, when running in a cpu-limited container environment, tokio erroneously detects the total number of CPUs instead of the container limit, which is correct (this behaviour exists in all schedulers known to me, e.g. the Goland one).

num_cpus uses sched_getaffinity, and should return the right number. What does your container do to limit CPU?

It's a regular Kubernetes installation with containerd runtime, containers using cpu_request/cpu_limit by k8s.

pomo-mondreganto · 2023-11-30T09:22:02Z

I've just checked that in a container with the following settings:

│     Limits:                                                                                                            │
│       cpu:                3                                                                                            │
│       ephemeral-storage:  100Gi                                                                                        │
│       memory:             4Gi                                                                                          │
│     Requests:                                                                                                          │
│       cpu:                3                                                                                            │
│       ephemeral-storage:  100Gi                                                                                        │
│       memory:             4Gi

sched_getaffinity returns all cores:

>>> import os
>>> os.sched_getaffinity(0)
{0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127}

glandium · 2023-11-30T09:27:29Z

I guess k8s allocates less scheduling slots rather than less CPUs.

codecov-commenter · 2023-11-30T09:38:05Z

Codecov Report

Attention: 7 lines in your changes are missing coverage. Please review.

Comparison is base (fb0ab0c) 31.11% compared to head (ba4e92b) 30.89%.

Files	Patch %	Lines
src/commands.rs	0.00%	1 Missing and 3 partials ⚠️
src/server.rs	0.00%	1 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1993      +/-   ##
==========================================
- Coverage   31.11%   30.89%   -0.23%     
==========================================
  Files          51       51              
  Lines       19419    19425       +6     
  Branches     9341     9356      +15     
==========================================
- Hits         6043     6001      -42     
- Misses       7785     7797      +12     
- Partials     5591     5627      +36

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pomo-mondreganto · 2023-12-01T08:12:20Z

Can someone take a look at the "cancelled" test? No output and no reason for cancellation.

sylvestre · 2023-12-01T08:16:46Z

Don't bother, it is fine :)

pomo-mondreganto · 2023-12-01T09:28:18Z

So can I consider my work finished here? :)

pomo-mondreganto · 2024-01-25T08:06:55Z

Any plans on merging this? I might be adding some more features in the near future, and I'd prefer implementing them over upstream, not my fork.

sylvestre · 2024-01-25T08:15:17Z

README.md

@@ -161,6 +161,7 @@ set(CMAKE_MSVC_DEBUG_INFORMATION_FORMAT Embedded)

 And you can build code as usual without any additional flags in the command line, useful for IDEs.

+To limit the number of threads sccache process spawns, use `SCCACHE_SERVER_WORKER_THREADS` and `SCCACHE_CLIENT_WORKER_THREADS` environment variables for server and client processes respectively.


please explain in which contexts someone might need this

sylvestre · 2024-01-25T08:15:46Z

it would be nice to add tests which verifies that it works correctly

sylvestre · 2024-02-09T09:04:06Z

ping ?

sylvestre · 2024-02-20T11:57:50Z

please reopen when ready

Set thread counts for client & server using env variables with better…

d5d4725

… defaults

sylvestre reviewed Nov 29, 2023

View reviewed changes

src/commands.rs Show resolved Hide resolved

Add new env vars description to README.md

de7c332

Fix compile command runtime

ba4e92b

pomo-mondreganto mentioned this pull request Nov 30, 2023

feature request: Support config max concurrency jobs #1950

Closed

pomo-mondreganto requested a review from sylvestre January 25, 2024 08:07

sylvestre reviewed Jan 25, 2024

View reviewed changes

sylvestre closed this Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set thread counts for client & server using env variables with better defaults #1993

Set thread counts for client & server using env variables with better defaults #1993

pomo-mondreganto commented Nov 29, 2023

pomo-mondreganto commented Nov 29, 2023 •

edited

Loading

glandium commented Nov 30, 2023

pomo-mondreganto commented Nov 30, 2023

pomo-mondreganto commented Nov 30, 2023

glandium commented Nov 30, 2023

codecov-commenter commented Nov 30, 2023 •

edited

Loading

pomo-mondreganto commented Dec 1, 2023

sylvestre commented Dec 1, 2023

pomo-mondreganto commented Dec 1, 2023

pomo-mondreganto commented Jan 25, 2024

sylvestre Jan 25, 2024

sylvestre commented Jan 25, 2024

sylvestre commented Feb 9, 2024

sylvestre commented Feb 20, 2024

		@@ -161,6 +161,7 @@ set(CMAKE_MSVC_DEBUG_INFORMATION_FORMAT Embedded)

		And you can build code as usual without any additional flags in the command line, useful for IDEs.

		To limit the number of threads sccache process spawns, use `SCCACHE_SERVER_WORKER_THREADS` and `SCCACHE_CLIENT_WORKER_THREADS` environment variables for server and client processes respectively.

Set thread counts for client & server using env variables with better defaults #1993

Set thread counts for client & server using env variables with better defaults #1993

Conversation

pomo-mondreganto commented Nov 29, 2023

pomo-mondreganto commented Nov 29, 2023 • edited Loading

glandium commented Nov 30, 2023

pomo-mondreganto commented Nov 30, 2023

pomo-mondreganto commented Nov 30, 2023

glandium commented Nov 30, 2023

codecov-commenter commented Nov 30, 2023 • edited Loading

Codecov Report

pomo-mondreganto commented Dec 1, 2023

sylvestre commented Dec 1, 2023

pomo-mondreganto commented Dec 1, 2023

pomo-mondreganto commented Jan 25, 2024

sylvestre Jan 25, 2024

Choose a reason for hiding this comment

sylvestre commented Jan 25, 2024

sylvestre commented Feb 9, 2024

sylvestre commented Feb 20, 2024

pomo-mondreganto commented Nov 29, 2023 •

edited

Loading

codecov-commenter commented Nov 30, 2023 •

edited

Loading