-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Chassis] reboot-cause is not as expected when reboot through sup (normal & abnormal) #118
Comments
@arista-nwolfe, @kenneth-arista, @arlakshm for vis |
@Javier-Tan is there a sonic-mgmt test which tests this that is failing due to this difference in reboot reason? |
@arista-nwolfe These are some of the ones we're seeing fail
|
Looking at the driver code it looks like likely the reason for the reboot cause listing sonic/arista/utils/sonic_reboot.py Line 22 in 526fcf6
under the hood it looks like this ends up running the reboot command on the LCs:https://github.com/aristanetworks/sonic/blob/master/arista/utils/rpc/api.py#L91 It looks like if we didn't implement our own platform specific reboot handler the code would've defaulted to running Tagging @Staphylo in case he knows the history on why we try to be more graceful about the supervisor reboot instead of using the sonic default reboot. |
Thanks this makes sense, it'll be good if we could align reboot cause with typical SONiC behvaiour for testing / standardization proccesses if possible |
@arista-nwolfe does Arista have a work around for the reboot check failures? |
We do a graceful reboot on the linecards because we were having issues with ext4 filesystem corruption on the linecards which we believe was caused by ungraceful reboots. |
Hi team,
When we reboot from supervisor (both normally and abnormally), we aren't getting the expected reboot-cause in
show reboot-cause / show reboot-cause history
Normal reboot through sup:
Abnormal reboot through sup: (memory exhaustion from
nohup bash -c "sleep 5 && tail /dev/zero" &
)The text was updated successfully, but these errors were encountered: