Fix signal handler race condition on metrics.write() #2893

alindima · 2022-02-10T12:00:59Z

Reason for This PR

FIxes the root cause of intermittent failure of test_custom_seccomp.py::test_failing_fiter
View commit messages.

Description of Changes

View commit messages.

This functionality can be added in rust-vmm.

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license.

PR Checklist

[Author TODO: Meet these criteria.]
[Reviewer TODO: Verify that these criteria are met. Request changes if not]

All commits in this PR are signed (git commit -s).
The issue which led to this PR has a clear conclusion.
This PR follows the solution outlined in the related issue.
The description of changes is clear and encompassing.
Any required documentation changes (code and docs) are included in this PR.
Any newly added unsafe code is properly documented.
Any API changes follow the Runbook for Firecracker API changes.
Any user-facing changes are mentioned in CHANGELOG.md.
All added/changed functionality is tested.

raduiliescu · 2022-02-10T13:32:23Z

For the second commit I doubt this is the best approach. I am hardly familiar with RUST, but adding acquire/release a lock, on operations done on the hotpath is not OK. Maybe you can take the second commit out, and we fix just the issue with intermittent test.
One other approach with metrics on multiple threads is to have that metric split per thread, and when dumping just do an add.

alindima · 2022-02-10T13:39:27Z

I am hardly familiar with RUST, but adding acquire/release a lock, on operations done on the hotpath is not OK.

There are no locks added. These are just atomic instruction flags that prevent the CPU from reordering loads of a value before stores on the same atomic.
On x86 this should have no effect because of the stronger memory model. On ARM it should just prevent the CPU from reordering the atomic instructions in ways that violate the load-after-store rule across different threads.

More details about this: https://doc.rust-lang.org/nomicon/atomics.html#hardware-reordering

serban300 · 2022-02-11T08:10:30Z

src/logger/src/metrics.rs

+        let snapshot = self.count();
+        let res = serializer.serialize_u64(snapshot as u64 - self.1.load(Ordering::Acquire) as u64);

        if res.is_ok() {
-            self.1.store(snapshot, Ordering::Relaxed);
+            self.1.store(snapshot, Ordering::Release);


I'm not sure if this is correct. If we can serialize from multiple threads, I think snapshot can be different on each one and the value that ends up being assigned to self.1 in the end (self.1.store(snapshot, Ordering::Release);) won't necessarily be the most recent one. So we can end up in the situation where self.0 < self.1.

A solution would be to have another atomic bool that is marking whether there is a serialization in progress, and if there is, don't do anything. Although this wouldn't guarantee the serialization of the latest values.

self.0 is incremented atomically from any thread. If we have Relaxed semantics, it may have different values on multiple threads when serialising.
If using Acquire semantics, like we do in self.count(), we can be sure that any writes to self.0 with Release semantics (like we do in add()) are going to be executed and visible before the load.

I believe that the scenario you are describing is possible prior to this PR, due to the Relaxed ordering.

A solution would be to have another atomic bool that is marking whether there is a serialization in progress, and if there is, don't do anything. Although this wouldn't guarantee the serialization of the latest values.

This is essentially simulating a mutex

anyway, I believe that the usage of the metrics system in signal handlers needs a redesign with thread-safety in mind.
Since this second commit is pretty controversial and may be introducing some performance degradation on ARM, I will remove it from this PR to unblock it and I will open a new issue for it

Would something like the following be possible ?

+-----------------------------------+-----------------------------------+ | Thread 1 | Thread 2 | +-----------------------------------+-----------------------------------+ | let snapshot = self.count(); -> 5 | ... | +-----------------------------------+-----------------------------------+ | ... | inc metric -> 6 | +-----------------------------------+-----------------------------------+ | ... | let snapshot = self.count(); -> 6 | +-----------------------------------+-----------------------------------+ | ... | store snapshot in self.1 -> 6 | +-----------------------------------+-----------------------------------+ | store snapshot in self.1 -> 5 | | +-----------------------------------+-----------------------------------+

Actually this is not possible since we only write the metrics from the VMM thread or the signal handler.
In the signal handler we now only use StoreMetrics so this would not be possible

If the type is not thread safe than Sync should not be implemented for it. Otherwise we're breaking the Rust safety principles and Rust cannot protect and benefit of the "fearless concurrency" anymore. The problem seems to be that we're doing more things than we should be doing in a context of the signal handler, so maybe a solution would be to simplify the signal handler. This will come at the expense of maybe metrics being missed, but it is arguably better than opening potentially other safety problems.

We have discussed removing logging and metrics systems from the signal handlers.
One idea is to use specialised files for dumping abrupt exit information from signal and panic handlers (much like a core dump).

This is a separate feature and will be worked on in the future. It requires careful thought and more investment.

This PR addresses the issue surgically, since it is creating much trouble for us with intermittent test failures.
We are not introducing problems with this PR, we are applying a patch over the known occurrence of this problem.

As a patch I think it's ok. Do we have an issue for doing a proper fix ?

Created an issue for it: #2899

It doesn't seem sufficient to me because if I had to add a new metric I can imagine that it would be very easy to miss these comments. I wouldn't check the comments for the Metrics.write() or serialize() methods. It would be the same even if the serialize() method would be unsafe.

I realised that the signal handler generation macro is hardcoded to use the store function, which is only defined for StoreMetrics. It would require a separate function in order to add an IncMetric to a signal handler, like it is for SIGPIPE. This should be even more evident in code reviews

The METRICS.write() method operates on the false assumption that it is only being called from one thread (VMM). This is not true since it is called from the signal handlers as well (which may be executed on any thread). The race condition appears due to the fact that metrics state is mutated during serialization for SharedIncMetrics (old value gets assigned the new value after the difference is serialized). The METRICS.write() method only holds the lock when writing to the file, but there is no lock held while serialising the metrics to JSON. VMM THREAD (1/MINUTE) VCPU THREAD (SIGNAL HANDLER) new_val=1 dif=new_val-old_val(=1) old_val=new_val (=1) dif=new_val-old_val (=0) write(dif) (=0) exit() Because of the exit(), the VMM thread no longer gets a chance to write the metric. We now use SharedStoreMetric for deadly signal metrics, which does not have this type of race condition because it is not mutated on serialization. It also does not make sense to have a SharedIncMetric for deadly signal metrics, which can only ever be 0 or 1. This was the root-cause of the intermittent failure in test_custom_seccomp.py::test_failing_fiter Signed-off-by: alindima <[email protected]>

alindima self-assigned this Feb 10, 2022

alindima force-pushed the fix_signal_handler branch from e2dcfee to b4b5513 Compare February 10, 2022 12:01

alindima requested review from alsrdn and georgepisaltu February 10, 2022 12:02

serban300 reviewed Feb 11, 2022

View reviewed changes

alindima force-pushed the fix_signal_handler branch from b4b5513 to ee8c9d6 Compare February 11, 2022 09:14

alindima force-pushed the fix_signal_handler branch from ee8c9d6 to f7b7813 Compare February 11, 2022 09:42

alindima mentioned this pull request Feb 14, 2022

Redesign metrics system with thread-safety in mind #2899

Open

andreeaflorescu mentioned this pull request Feb 14, 2022

Use component defined metrics #1759

Open

serban300 approved these changes Feb 14, 2022

View reviewed changes

alsrdn approved these changes Feb 14, 2022

View reviewed changes

alindima merged commit 8f9ec61 into firecracker-microvm:main Feb 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix signal handler race condition on metrics.write() #2893

Fix signal handler race condition on metrics.write() #2893

Uh oh!

alindima commented Feb 10, 2022 •

edited

Loading

Uh oh!

raduiliescu commented Feb 10, 2022

Uh oh!

alindima commented Feb 10, 2022 •

edited

Loading

Uh oh!

serban300 Feb 11, 2022

Uh oh!

alindima Feb 11, 2022

Uh oh!

alindima Feb 11, 2022

Uh oh!

serban300 Feb 11, 2022

Uh oh!

alindima Feb 11, 2022

Uh oh!

andreeaflorescu Feb 11, 2022

Uh oh!

alindima Feb 11, 2022

Uh oh!

serban300 Feb 11, 2022

Uh oh!

alindima Feb 14, 2022

Uh oh!

alindima Feb 14, 2022

Uh oh!

Uh oh!

Fix signal handler race condition on metrics.write() #2893

Fix signal handler race condition on metrics.write() #2893

Uh oh!

Conversation

alindima commented Feb 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reason for This PR

Description of Changes

License Acceptance

PR Checklist

Uh oh!

raduiliescu commented Feb 10, 2022

Uh oh!

alindima commented Feb 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alindima commented Feb 10, 2022 •

edited

Loading

alindima commented Feb 10, 2022 •

edited

Loading