feat(shard distributor): Persist Shard-Level Metrics With Guarded Updates for Load Balancing #7354

AndreasHolt · 2025-10-20T10:21:26Z

What changed?

introduce store.ShardMetrics (smoothed load + timestamps) and persist it under store/<namespace>/shards/<shardID>/metrics. SmoothedLoad will keep an EWMA of shard load, LastUpdateTime will be used for dynamically updating the alpha value used in EWMA, and LastMoveTime will be used to support cooldown logic to limits shard churn
extend the etcd store so AssignShard/AssignShards write metrics alongside ownership, refresh LastMoveTime when reusing existing metrics, and apply per-shard metric updates after the main transaction to stay within etcd’s 128ops trnsaction limit
extend GetState to read the new metric keys and expose them in NamespaceState, allowing the leader to use it for future rebalancing decisiosn

Why?
reported_shards is keyed by executor. That works for reporting the latest heartbeat, but it breaks down the moment a shard moves. Then the new owner can’t see the old owner’s smoothed load or timestamps, and the leader has to collect executor-specific parts just to reason about shard state. By giving each shard its own metrics key:

the data survives ownership changes. New executors and the leader can pick up where the prev owner left off
the leader can read NamespaceState and compute balancing or throtling decisions without looking for per-exec heartbeats
we can store both an EWMA (so short spikes hopefully won’t cause thrashing) and timestamps: last_update_time is used for the decay value
(alpha) when applying the next sample, and last_move_time is what we’ll use for cooldowns before moving a shard again.

A follow-up pull request will wire heartbeats to update the metrics each time.

How did you test it?
Integration tests w/ etcd (added new test cases to ./service/sharddistributor/store/etcd/etcdstore_test.go)
go test ./service/sharddistributor/store/etcd/executorstore
Also tested it by logging values while running the ephemeral service (which simulates executors and shards)

Potential risks
Added pressure to etcd and extra read operations when preparing metric updates

Release notes
Shard distributor now persists shard metrics in etcd (smoothed load and timestamps) for future load balancing logic.

Documentation Changes

Signed-off-by: Andreas Holt <[email protected]>

… is being reassigned in AssignShard Signed-off-by: Andreas Holt <[email protected]>

Signed-off-by: Andreas Holt <[email protected]>

…to not overload etcd's 128 max ops per txn Signed-off-by: Andreas Holt <[email protected]>

…s txn and retry monotonically Signed-off-by: Andreas Holt <[email protected]>

…ents Signed-off-by: Andreas Holt <[email protected]>

eleonoradgr · 2025-10-21T11:27:03Z

service/sharddistributor/store/etcd/etcdkeys/etcdkeys.go

+}
+
+func BuildShardKey(prefix string, namespace, shardID, keyType string) (string, error) {
+	if keyType != ShardAssignedKey && keyType != ShardMetricsKey {


where/when is this used?

eleonoradgr · 2025-10-21T11:27:58Z

service/sharddistributor/store/etcd/etcdkeys/etcdkeys.go

 	return parts[0], parts[1], nil
 }
+
+func BuildShardPrefix(prefix string, namespace string) string {


We need to have tests for BuildShardPrefix, BuildShardKey and ParseShardKey :)

eleonoradgr · 2025-10-21T11:32:50Z