74 Commits

Author SHA1 Message Date
ziyizhang-1
50cd4ef0c5 initial commit to enable amx
AMX Intrinsics:

amx-tile:
  - _tile_loadconfig
  - _tile_storeconfig
  - _tile_loadd
  - _tile_release
  - _tile_stored
  - _tile_stream_loadd
  - _tile_zero
amx-int8:
  - _tile_dpbssd
  - _tile_dpbsud
  - _tile_dpbusd
  - _tile_dpbuud
amx-bf16:
  - _tile_dpbf16ps
amx-fp16
  - _tile_dpfp16ps
amx-complex
  - _tile_cmmimfp16ps
  - _tile_cmmrlfp16ps
2024-08-03 19:02:09 +01:00
sayantn
4a13560ede Update Intrinsics List to v3.6.9
Add `#[inline]` to avx512ifma intrinsics
Fix the test equality.
Remove the stability attributes in simd types and test functions
2024-07-26 12:20:06 +01:00
sayantn
3cf2b7d74f AVX512FP16 Part 9: Remaining avx512fp16 and avxneconvert 2024-07-26 12:20:06 +01:00
sayantn
c878b773d5 AVX512FP16 Part 0: Types 2024-07-26 12:20:06 +01:00
sayantn
f101974941 Added verification for doc comments 2024-07-08 00:32:43 +02:00
sayantn
c862e4e487 Added a bf16 type 2024-07-06 11:00:34 +02:00
sayantn
3de8e86491 Implemented the missing AVX512BF16 intrinsics 2024-07-06 11:00:34 +02:00
sayantn
f22fab559e Implemented VEX versions
Modified stdarch-test to accept VEX versions
2024-07-06 11:00:34 +02:00
sayantn
ed1df99f03 Added support for AMD verification
Added a custom cpuid file for sde, which enables SSE4a, XOP, TBM and VP2INTERSECT. Fixed `xsave` tests
2024-06-30 21:45:56 +02:00
sayantn
818df2f7d0 Some fixes as asked by @Amanieu 2024-06-29 19:16:48 +02:00
sayantn
fa22a9aeda Add the missing BMI1, SSE2, SSE4.1 and AVX2 intrinsics 2024-06-29 19:16:48 +02:00
sayantn
d65d1a8ae6 Fixed some more intrinsics
Added some tests, Fixed incorrect target-features, and verification code for target-features. Removed all MMX support from verification.
2024-06-29 19:16:48 +02:00
sayantn
ad7cf91833 Fixed many intrinsics
fixed reduce-add and reduce-mul. and load/store of mask32 and mask64. added preserves-flags to mov asm. fixed the missing list. fixed `_mm_loadu_si64`. Added `assert_instr`
2024-06-29 19:16:48 +02:00
sayantn
d26d3a7481 Update Intrinsics list
Updated the intrinsics list from version 3.4 to 3.6.8. Added a missing-x86.md file to track progress.
2024-06-29 19:16:48 +02:00
sayantn
52f8b0c1a9 AVX512DQ Part 5: Range. Fixed intrinsic verification. 2024-06-18 19:13:13 +02:00
Eduardo Sánchez Muñoz
ff7e6c7ac6 Implement ARM __ssat and __usat functions 2024-04-23 15:58:46 +02:00
Eduardo Sánchez Muñoz
40237e7f83 Use char constants for single-character patterns 2023-11-18 20:28:46 -08:00
Eduardo Sánchez Muñoz
57479f8c9b Remove unneeded borrows 2023-11-18 20:28:46 -08:00
Amanieu d'Antras
ea27e5cfcb Fix various compilation errors 2023-10-29 20:39:50 +01:00
Eduardo Sánchez Muñoz
6698a597c0 Bump serde-xml-rs to 0.6 2023-10-10 14:47:43 +01:00
Eduardo Sánchez Muñoz
690cd51e5a Bump syn to 2.0
Also ensure that `assert_instr` attribute has been correctly parsed in stdarch-verify.
2023-10-10 14:47:43 +01:00
Josh Triplett
87c70444d6 Remove ud2 intrinsic (in favor of asm! or abort as needed) 2023-05-25 23:30:24 +02:00
Adam Gemmell
ba671a06c1 Use arm_intrinsics.json in stdarch-verify 2023-05-15 17:34:11 +02:00
bwmf2
1c18225f32 Fix typo 2023-02-18 20:02:17 +01:00
Amanieu d'Antras
c80d9794e0
Don't require AVX512 for 128/256-bit GFNI & VPCLMULQDQ intrinsics (#1349) 2022-10-30 01:56:06 +01:00
Amanieu d'Antras
55f9fcda26
Don't require AVX512 for 256-bit VAES intrinsics (#1348) 2022-10-27 19:51:16 +01:00
Yuri Astrakhan
81c221f058
Edition 2021, apply clippy::uninlined_format_args fix (#1339) 2022-10-25 20:17:23 +01:00
Amanieu d'Antras
ca1f7cc1a6
Add missing vtst_p16 and vtstq_p16 intrinsics (#1257) 2021-11-20 20:51:37 +00:00
Sparrow Li
7c3bd04537
complete armv8 instructions (#1256) 2021-11-19 01:24:36 +00:00
Sparrow Li
be5e1be224
Add remaining insturctions (#1250)
* add vmmla vusmmla vsm4e vsm3 vrax1 vxar vsha512 vbcax veor3 neon instructions

* update runtime feature detect

* correct tests

* add `vrnd32x` `vrnd64x`

* add MISSING.md
2021-11-10 15:19:59 +00:00
Sparrow Li
9df48f1e57
Complete the remaining neon instructions (#1230) 2021-10-21 10:52:05 +01:00
Sparrow Li
328553ef64
Complete vld1 instructions with some corrections (#1216) 2021-09-18 14:13:24 +01:00
Sparrow Li
4baf95fddd
add vldx neon instructions (#1200) 2021-08-24 19:51:30 +01:00
Nils Hasenbanck
3ecc56b329
Add vst1_* neon intrinsics. (#1171) 2021-05-27 07:40:45 +01:00
SparrowLii
911ace84b2 Add vqrdmulh, vqrdmlah, vqrdmlsh neon instructions 2021-05-06 15:44:54 +01:00
scottmcm
54a2d8b82a
Remove #![feature(try_trait)] from a test (#1142)
I'm working on `try_trait_v2` which will break this, so I'm going
around removing uses from the rustc tree where I can.
2021-04-26 00:45:20 +01:00
Sebastian Thiel
43126c3f65
[DRAFT] intrinsics for all architectures appear in rustdoc (#1104) 2021-04-17 13:46:33 +01:00
Amanieu d'Antras
6405058a6f
Don't import from core (#1115)
This breaks when core_arch is included as a module in core.
2021-04-13 04:31:37 +01:00
Joshua Nelson
7bab2c0695
Deny 2018 idiom lints (#1108)
This lint is allow by default, which is why this wasn't spotted earlier.
It's denied by rust-lang/rust, so it's good to warn about it here so it
can be fixed more quickly.
2021-04-07 05:46:39 +01:00
Sparrow Li
64f84788c9
add vreinterpret neon instructions (#1101) 2021-03-30 06:51:01 +01:00
surechen
7947cb8bac
add neon instruction vmov_n_* and vmovq_n_* (#1085) 2021-03-17 09:34:49 +00:00
Amanieu d'Antras
3fb437b867
Add stricter validation of const arguments on x86 intrinsics (#1025) 2021-02-28 18:07:42 +00:00
Rémy Rakic
d6a22093aa
Convert SSE 4.2 intrinsics to const generics (#1023) 2021-02-28 17:26:55 +00:00
Amanieu d'Antras
b4023b1ffe
Initial conversion to const generics (#1018) 2021-02-27 14:25:19 +00:00
minybot
b7acc2e1da
Avx512f avx512vl (#1001) 2021-02-14 17:59:05 +00:00
kangshan1157
936e1add97
Implement avx512bf16 intrinsics (#998) 2021-02-10 23:29:27 +00:00
minybot
1f93bbe3b9
avx512vnni (#982) 2021-01-13 14:09:19 +00:00
minybot
6dc569d9d9
avx512vbmi2 (#979) 2021-01-11 16:52:19 +00:00
minybot
981e250f89
Avx512vbmi (#977) 2021-01-04 00:35:51 +00:00
DevJPM
c5ccd02907
Add AVX512BITALG (#964)
This adds the AVX512BITALG intrinsics.
It also patches the verification against the Intel Intrinsic Guide
because Rust uses a different naming.
Added intrinsics match _mm(256|512)?(_maskz?)?_popcnt_epi(8|16) for the popcount ones
and _mm(256|512)?(_mask)?_bitshuffle_epi64_mask
2020-12-09 00:28:58 +00:00