Skip to content

Commit e894a03

Browse files
authored
perf: Use Hashbrown for array_distinct (#20538)
## Which issue does this PR close? N/A ## Rationale for this change #20364 recently optimized `array_distinct` to use batched row conversion. As part of that PR, `std::HashSet` was used. This PR just replaces `std::HashSet` with `hashbrown::HashSet`, which measurably improves performance. ## What changes are included in this PR? ## Are these changes tested? Yes. ## Are there any user-facing changes? No.
1 parent e684994 commit e894a03

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

datafusion/functions-nested/src/set_ops.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,8 +34,8 @@ use datafusion_expr::{
3434
ColumnarValue, Documentation, ScalarUDFImpl, Signature, Volatility,
3535
};
3636
use datafusion_macros::user_doc;
37+
use hashbrown::HashSet;
3738
use std::any::Any;
38-
use std::collections::HashSet;
3939
use std::fmt::{Display, Formatter};
4040
use std::sync::Arc;
4141

0 commit comments

Comments
 (0)