mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics #1067

minybot · 2021-03-09T14:03:33Z

f16c: _mm256_cvtps_ph; mm_cvtps_ph

rust-highfive · 2021-03-09T14:03:37Z

(rust-highfive has picked a reviewer for you, use r? to override)

minybot · 2021-03-09T14:23:41Z

At f16c.rs
_mm256_cvtps_ph(a: __m256, imm_rounding: i32). The current imm_rounding is set to 0-7.
I checked the Clang, it accepts 0-255.
Any suggestion?

Amanieu · 2021-03-09T15:47:55Z

The instruction definition here says that bits 3 to 7 are ignored by the CPU. I think to be safe we should only allow imm3, we can always relax it later if necessary.

minybot · 2021-03-09T16:14:39Z

The instruction definition here says that bits 3 to 7 are ignored by the CPU. I think to be safe we should only allow imm3, we can always relax it later if necessary.
Ok. I will finish f16c.

lqd · 2021-03-09T16:20:34Z

I'm still wondering about the fact that we "* 8" the immediates that are supposed to be in bytes and <= 16 for the shifts

Amanieu · 2021-03-09T16:32:00Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.

minybot · 2021-03-09T16:52:35Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.
Ok. I will modify it to similar to _mm_slli_si128.

minybot · 2021-03-09T18:57:07Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.

It seems mm256_slli_si256 = mm256_bslli_epi128?

Amanieu · 2021-03-09T22:20:30Z

Yes, see #1012.

Amanieu · 2021-03-09T22:22:59Z

crates/core_arch/src/x86/f16c.rs

@@ -4,7 +4,7 @@

 use crate::{
    core_arch::{simd::*, x86::*},
-    hint::unreachable_unchecked,
+    //    hint::unreachable_unchecked,


Deleted commented code.

Amanieu · 2021-03-09T22:23:28Z

crates/core_arch/src/x86/avx2.rs

-    }
-    transmute(constify_imm8!(imm8 * 8, call))
+    let r = vpslldq(a, IMM8 * 8);
+    transmute(r)


You can just call _mm256_bslli_epi128 here.

Amanieu · 2021-03-09T22:23:40Z

crates/core_arch/src/x86/avx2.rs

-    }
-    transmute(constify_imm8!(imm8 * 8, call))
+    let r = vpsrldq(a, IMM8 * 8);
+    transmute(r)


You can just call _mm256_bsrli_epi128 here.

minybot · 2021-03-09T22:24:11Z

Yes, see #1012.

Thanks. I think my bsrli_epi128 and bslli_epi128 having problems. I need to check them first.

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128

77e893a

rust-highfive assigned Amanieu Mar 9, 2021

_mm256_cvtps_ph; mm_cvtps_ph

6d83ff5

lqd mentioned this pull request Mar 9, 2021

Convert the last avx512f and avx512vpclmulqdq intrinsics #1068

Merged

fix mm256_bslli_epi128, mm256_bsrli_epi128

7dc9e91

Amanieu reviewed Mar 9, 2021

View reviewed changes

fix mm256_srli_si256, mm256_slli_si256, mm512_bsrli_epi128

a1952a0

Amanieu merged commit 3559569 into rust-lang:master Mar 10, 2021

minybot deleted the avx2 branch March 10, 2021 14:29

marcgalois mentioned this pull request May 18, 2021

Regression on nightly in AVX2 byte shift intrinsics rust-lang/rust#85446

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics #1067

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics #1067

minybot commented Mar 9, 2021 •

edited

Loading

rust-highfive commented Mar 9, 2021

minybot commented Mar 9, 2021

Amanieu commented Mar 9, 2021

minybot commented Mar 9, 2021

lqd commented Mar 9, 2021

Amanieu commented Mar 9, 2021

minybot commented Mar 9, 2021

minybot commented Mar 9, 2021

Amanieu commented Mar 9, 2021

Amanieu Mar 9, 2021

Amanieu Mar 9, 2021

Amanieu Mar 9, 2021

minybot commented Mar 9, 2021

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics #1067

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics #1067

Conversation

minybot commented Mar 9, 2021 • edited Loading

rust-highfive commented Mar 9, 2021

minybot commented Mar 9, 2021

Amanieu commented Mar 9, 2021

minybot commented Mar 9, 2021

lqd commented Mar 9, 2021

Amanieu commented Mar 9, 2021

minybot commented Mar 9, 2021

minybot commented Mar 9, 2021

Amanieu commented Mar 9, 2021

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

minybot commented Mar 9, 2021

minybot commented Mar 9, 2021 •

edited

Loading