Implement avx512 compressstore intrinsics #1273

jhorstmann · 2022-01-23T18:29:38Z

Implement avx512f compressstore intrinsics.

jhorstmann · 2022-01-23T18:31:04Z

crates/core_arch/src/x86/avx512f.rs

@@ -38007,6 +38127,34 @@ extern "C" {
    #[link_name = "llvm.x86.avx512.mask.compress.pd.128"]
    fn vcompresspd128(a: f64x2, src: f64x2, mask: u8) -> f64x2;

+    #[link_name = "llvm.x86.avx512.mask.compress.store.d.512"]
+    fn vcompressd_mem(mem: *mut i8, data: i32x16, mask: u16);


Is there a better naming convention for these intrinsics? The asm mnemonic is the same for mem and reg operations.

The recommended way to figure this out is to look at what IR clang generates: https://godbolt.org/z/nvaxM4MGh

In this case it is calling the llvm.masked.compressstore.v2f64 intrinsic which unfortunately can't be called directly from Rust because it uses a i1 vector which can't be represented with Rust types.

This is the reason why #1254 implemented some of the AVX512 intrinsics using inline assembly instead. I think this is the right approach in this case as well.

These llvm intrinsic seem to work though, and I saw them used with plain integer masks in llvm testcases: https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/X86/avx512-intrinsics-upgrade.ll#L8938 (not sure if there is an official documentation for them though). I wanted to avoid asm unless absolutely necessary.

Test failures in CI seem unrelated to the changes in this PR.

jhorstmann · 2022-01-23T18:44:19Z

crates/core_arch/src/x86/avx512f.rs

+#[inline]
+#[target_feature(enable = "avx512f")]
+#[cfg_attr(test, assert_instr(vpcompressd))]
+pub unsafe fn _mm512_mask_compressstoreu_epi32(base_addr: *mut i8, k: __mmask16, a: __m512i) {


Intel's intrinsic guide uses a void* for base_addr, the llvm intrinsics use an i8*. Using a ptr of the correct datatype would be more ergonomic, but I'm not sure whether that might prevent using the intrinsics for actually unaligned data.

The convention here is to use *mut u8 where C uses void pointers. LLVM's i8 doesn't mean anything since LLVM IR types don't have signs: LLVM's i8 is used for both of Rust's u8 and i8.

I missed that convention when implementing the masked load/store instructions, there are also several more intrinsics that did not already follow this convention. I can take a look at adjusting the stdarch-verify test to catch this and change any existing type differences. Since avx512 is still unstable that should be possible I guess.

jhorstmann added 2 commits January 23, 2022 19:10

Implement avx512 compressstore intrinsics

57ffd2e

Mark avx512f compressstore as implemented

7fcf1f9

jhorstmann commented Jan 23, 2022

View reviewed changes

Change naming convention for llvm intrinsics and use u8 pointers

de5ef40

Amanieu merged commit f4c5507 into rust-lang:master Jan 24, 2022

jhorstmann mentioned this pull request Apr 14, 2025

Add checks for void pointer types to ensure consistency #1775

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement avx512 compressstore intrinsics #1273

Implement avx512 compressstore intrinsics #1273

Uh oh!

jhorstmann commented Jan 23, 2022

Uh oh!

jhorstmann Jan 23, 2022

Uh oh!

Amanieu Jan 24, 2022

Uh oh!

jhorstmann Jan 24, 2022 •

edited

Loading

Uh oh!

jhorstmann Jan 23, 2022

Uh oh!

Amanieu Jan 24, 2022

Uh oh!

jhorstmann Jan 24, 2022

Uh oh!

Uh oh!

Implement avx512 compressstore intrinsics #1273

Implement avx512 compressstore intrinsics #1273

Uh oh!

Conversation

jhorstmann commented Jan 23, 2022

Uh oh!

jhorstmann Jan 23, 2022

Choose a reason for hiding this comment

Uh oh!

Amanieu Jan 24, 2022

Choose a reason for hiding this comment

Uh oh!

jhorstmann Jan 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jhorstmann Jan 23, 2022

Choose a reason for hiding this comment

Uh oh!

Amanieu Jan 24, 2022

Choose a reason for hiding this comment

Uh oh!

jhorstmann Jan 24, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jhorstmann Jan 24, 2022 •

edited

Loading