-
Notifications
You must be signed in to change notification settings - Fork 130
Open
Labels
area/control-planeControl plane relatedControl plane relatedeffort/2mEffort for issue is around 2 monthsEffort for issue is around 2 monthskind/bugBugBugpriority/2Priority (lower number equals higher priority)Priority (lower number equals higher priority)
Description
How to categorize this issue?
/area control-plane
/kind bug
/priority 1
What happened:
We had a live, scalability issue where due to invalid credentials, the etcd database was filled up.
The machine-controller-manager was continuing updating MachineDeployments and MachineSets. The MachineDeployment status contains entry for each Machine and its lastError.
(issues-canary/issues/7190 internally)
What you expected to happen:
machine-controller-managershould adhere to controller best practices such as exponential backoff and skipping no-op (status) updates if there is no change in the status.
How to reproduce it (as minimally and precisely as possible):
- Use the virtual mcm provider and local api-server and etcd to simulate credential failure for large number of machines (> 1000)
- Check size of etcd db.
Anything else we need to know?:
ialidzhikovaaronfern
Metadata
Metadata
Assignees
Labels
area/control-planeControl plane relatedControl plane relatedeffort/2mEffort for issue is around 2 monthsEffort for issue is around 2 monthskind/bugBugBugpriority/2Priority (lower number equals higher priority)Priority (lower number equals higher priority)