[illumos-utils] Use tokio::process::Command, not std::process::Command #8141

smklein · 2025-05-12T23:53:11Z

Many commands in illumos-utils bottomed out in a call to "std::process::Command", eventually
exec-ing a process. These commands were often used from asynchronous code, even though they
could block the calling thread.

This PR converts many - but not all - of these calls to use tokio::process::Command instead, which
no longer blocks the calling thread.

smklein · 2025-05-12T23:55:52Z

sled-diagnostics/src/logs.rs

-                "error" => ?e,
-            );
+        let DiagnosticsSnapshot { log, snapshot, .. } = self;
+        tokio::task::spawn({


To confirm: is it critical that the snapshot blocks the drop method? Async drop does not exist in rust.

This API, as implemented, creates the background task but does not await for it to complete.

smklein · 2025-05-13T16:48:17Z

illumos-utils/src/dladm.rs

@@ -6,12 +6,13 @@

 use crate::link::{Link, LinkKind};
 use crate::zone::IPADM;


The changes in this file - and others within illumos-utils/src/... - are representative of "what actually is changing in this PR":

std::process::Command -> tokio::process::Command

This makes the function async

The rest of the changes are fall-out from this, identifying blocks of code as asynchronous

papertigers · 2025-05-13T18:08:03Z

sled-diagnostics/src/logs.rs

            // Free all of the log_snapshots
            drop(log_snapshots);

-            let snapshots = get_sled_diagnostics_snapshots(zfs_filesystem);
+            let snapshots =
+                get_sled_diagnostics_snapshots(zfs_filesystem).await;
            assert!(snapshots.is_empty(), "no snapshots left behind");


I don't think it's critical that the drop method is synchronous however it does introduce a race condition in the test here, we may be able to work around it.

I've changed the behavior here to use a more explicit drop, and log errors if this wasn't handled. I also tried to refactor above to make this case less likely -- cleaning up snapshots at a higher-level in the call to get_zone_logs

sled-hardware/src/illumos/partitions.rs

sled-agent/config-reconciler/src/dataset_serialization_task.rs

jgallagher · 2025-05-14T19:13:11Z

illumos-utils/src/dladm.rs

@@ -175,6 +176,7 @@ pub struct Dladm(());
 /// Describes the API for interfacing with Data links.
 ///
 /// This is a trait so that it can be faked out for tests.
+#[async_trait::async_trait]


This is fine, but I'm curious (and maybe this is a "Friday afternoon Rust chat" question more than a PR question) - is there a reason to prefer async_trait over returning impl Future<Output = ...> + Send now that Rust supports that?

I think this is slightly more painful for default methods - I see elsewhere we have the pattern of:

trait Api { fn foobar(&self) -> impl Future<Output=Baz> + Send; } impl Api for MyStruct { async fn foobar(&self) -> Baz { ... } }

But if I try to use a default impl of:

trait Api { fn foobar(&self) -> impl Future<Output=Baz> + Send { // Do anything async } }

I'll get the error the ".await" is only allowed in async blocks, and foobar isn't an async block.

Might be able to get away with

trait Api { fn foobar(&self) -> impl Future<Output=Baz> + Send { async { // Do anything async } } }

but that might also be kind of a pain if you're doing anything with self. Anyway - it's a good point; async_trait does seem to be smoother still, at least for this case.

jgallagher · 2025-05-14T19:17:05Z

illumos-utils/src/zone.rs

        zone: Option<&'a str>,
        addrobj: &AddrObject,
        addrtype: AddressRequest,
    ) -> Result<IpNetwork, EnsureAddressError> {
-        |zone, addrobj, addrtype| -> Result<IpNetwork, anyhow::Error> {
-            match Self::get_address_impl(zone, addrobj) {
+        #[allow(clippy::redundant_closure_call)]


Do we need this redundant closure? (If so maybe worth a short comment explaining why)

I'll avoid it by pulling this out to a new method instead. We're using it to make translation of the error type more convenient.

jgallagher · 2025-05-14T19:22:02Z

sled-diagnostics/src/logs.rs

+
+    async fn destroy(&mut self) -> Result<(), LogError> {
+        if !self.destroyed {
+            self.destroyed = true;


Should this be set after Zfs::destroy_snapshot() returns Ok(_)? (As written the caller couldn't retry destruction, although maybe that doesn't matter at the moment.)

Sure, I'll move it down. I don't think we're trying to call this again from any pub codepaths, but I don't see a problem with making it retryable either.

sled-hardware/src/illumos/partitions.rs

sled-agent/config-reconciler/src/dataset_serialization_task.rs

smklein added 2 commits May 12, 2025 15:43

Make dladm commands async

9c19ebc

Make more illumos commands async

31cad18

smklein changed the title ~~Async illumos utils~~ [illumos-utils] Use tokio::process::Commands instead of std::process::Command May 12, 2025

smklein changed the title ~~[illumos-utils] Use tokio::process::Commands instead of std::process::Command~~ [illumos-utils] Use tokio::process::Command, not std::process::Command May 12, 2025

smklein commented May 12, 2025

View reviewed changes

illumos

e546327

smklein commented May 13, 2025

View reviewed changes

smklein marked this pull request as ready for review May 13, 2025 16:52

hawkw approved these changes May 13, 2025

View reviewed changes

papertigers reviewed May 13, 2025

View reviewed changes

smklein added 4 commits May 13, 2025 11:14

Merge branch 'main' into async-illumos-utils

dcde221

Better async drop

3739d5e

Merge branch 'main' into async-illumos-utils

60e6b07

Merge branch 'main' into async-illumos-utils

cd8363c

smklein commented May 14, 2025

View reviewed changes

sled-hardware/src/illumos/partitions.rs Show resolved Hide resolved

sled-agent/config-reconciler/src/dataset_serialization_task.rs Show resolved Hide resolved

jgallagher reviewed May 14, 2025

View reviewed changes

feedback

d776aa8

smklein merged commit 19c56cf into main May 16, 2025
16 checks passed

smklein deleted the async-illumos-utils branch May 16, 2025 19:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[illumos-utils] Use tokio::process::Command, not std::process::Command #8141

[illumos-utils] Use tokio::process::Command, not std::process::Command #8141

smklein commented May 12, 2025 •

edited

Loading

smklein May 12, 2025

smklein May 13, 2025

papertigers May 13, 2025

smklein May 13, 2025

jgallagher May 14, 2025 •

edited

Loading

smklein May 14, 2025

jgallagher May 14, 2025

jgallagher May 14, 2025

smklein May 14, 2025

jgallagher May 14, 2025

smklein May 14, 2025

		@@ -6,12 +6,13 @@

		use crate::link::{Link, LinkKind};
		use crate::zone::IPADM;

[illumos-utils] Use tokio::process::Command, not std::process::Command #8141

[illumos-utils] Use tokio::process::Command, not std::process::Command #8141

Conversation

smklein commented May 12, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jgallagher May 14, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

smklein commented May 12, 2025 •

edited

Loading

jgallagher May 14, 2025 •

edited

Loading