CodePudding
Home
front end
Back-end
Net
Software design
Enterprise
Blockchain
Mobile
Software engineering
database
OS
other
Tags
>
avx
09-05
Mobile
Quickest way to shift/rotate byte vector with SIMD
08-28
front end
Does anyone have an example where _mm256_stream_load_si256 (non-tempral load to bypasse cache) actua
06-04
front end
How to set all the values in AVX ymm register to be the same (all are 0/1/specific value)?
05-30
Enterprise
Best way to mask a single bit in AVX2?
05-01
database
Implementing matrix operation using AVX in C
04-12
Enterprise
count number of unique values in a 128bit avx vector, or detecting if all elements are equal?
03-17
Software engineering
__attribute__ ((vector_size)) magic being destroyed by union
02-23
Back-end
AVX performance slower for bitwise xor op and popcount
12-24
Software design
AVX2: CountTrailingZeros on 8 bit elements in AVX register
12-21
Blockchain
GEMM kernel implemented using AVX2 is faster than AVX2/FMA on a Zen 2 CPU
12-21
OS
GEMM kernel implemented using AVX2 is faster than AVX2/FMA on a Zen 2 CPU
12-08
Blockchain
How to load into __m256 from a float* but reading backwards in memory as opposed to forwards?
11-20
Enterprise
Efficiently shift-or large bit vector
11-20
front end
How is the lvalue problem solved for SIMD inline asm with memory output operands in a 2D array?
11-17
other
In assembly, how to add integers without destroying either operand?
10-08
Enterprise
When source registers in avx instruction can be reused
09-30
Mobile
int8 x uint8 matrix-vector product with column-major layout
Links:
CodePudding