-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flaky Migrate e2e tests: try explicitly killing processes before setting up new cluster #17394
Flaky Migrate e2e tests: try explicitly killing processes before setting up new cluster #17394
Conversation
…t fixes the flakiness seen by some residual processes, especially in the migrate tests Signed-off-by: Rohit Nayak <[email protected]>
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
Tests
Documentation
New flags
If a workflow is added or modified:
Backward compatibility
|
cmd := exec.Command("pkill", fmt.Sprintf(".*%s.*", bin)) | ||
err := cmd.Run() | ||
if err != nil { | ||
log.Infof("Error killing %s: %v", bin, err) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Log lines are for checking if we have residual processes. If this works we can remove the log lines asap.
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #17394 +/- ##
=======================================
Coverage 67.52% 67.53%
=======================================
Files 1581 1581
Lines 253948 253948
=======================================
+ Hits 171480 171499 +19
+ Misses 82468 82449 -19 ☔ View full report in Codecov by Sentry. |
This approach is not working. |
Description
There has been a lot of flakiness in the e2e VReplication Migrate tests. Each of the ones I checked had a process not starting because a previous one had not been stopped. This happened even for the first test in the CI workflow. So possibly this is happening from previous workflows in reused test VMs or there is a bug somewhere in our cluster management.
Trying explicitly
pkill
ing the processes a test cluster creates to see if we can work around this.Related Issue(s)
Checklist
Deployment Notes