CodePudding
Home
front end
Back-end
Net
Software design
Enterprise
Blockchain
Mobile
Software engineering
database
OS
other
Tags
>
micro-optimization
08-28
front end
LEA vs MOV INC for incrementing a variable and storing it in another one?
08-25
Software design
lea or mov inc for incrementing a variable and storing it in another one?
08-24
Software design
What's the most efficient way to swap 4 16-bit integers on a 64-bit processor?
07-26
Mobile
Fastest way to take the average of two signed integers in x86 assembly?
07-25
Software design
Efficiently loading both RAX and R8 with the same small positive number
07-20
Enterprise
`test` vs `cmp` for one-bit registers comparison
06-27
OS
Why does gcc implement fmin and fmax in three different ways?
06-08
Enterprise
Is there a better way to any detect bits that are set in a 16-byte array of flags?
05-28
Software engineering
Implement a function that rotates 90 degrees in a two-dimensional array that represents a color imag
05-20
Mobile
How do you reason about fluctuations in benchmarking data?
05-20
Blockchain
Optimize lookup tables to simple ALU
04-30
Blockchain
Fastest way to set highest order bit of rax register to lowest order bit in rdx register
03-26
Enterprise
Is it still worth using the Quake fast inverse square root algorithm nowadays on x86-64?
03-02
OS
Copy bit of one register to another register (x86-64 asm)
12-21
Blockchain
GEMM kernel implemented using AVX2 is faster than AVX2/FMA on a Zen 2 CPU
12-21
OS
GEMM kernel implemented using AVX2 is faster than AVX2/FMA on a Zen 2 CPU
12-14
Blockchain
Intel JCC Erratum - what is the effect of prefixes used for mitigation?
12-01
Software engineering
Do 32-bit and 64-bit registers cause differences in CPU micro architecture?
11-30
Mobile
string_view Vs const char* performance
11-16
OS
When joining four 1-byte vars into one 4-byte word, which is a faster way to shift and OR ? (compari
11-14
front end
How to write a custom exception class derived from std::invalid_argument?
11-06
front end
C different using declarations for different concepts
11-06
front end
C 20 Likely and UnLikely?
10-08
Enterprise
When source registers in avx instruction can be reused
09-30
other
Why does clang's epilogue use `add $N, %rsp` instead of `mov %rbp, %rsp` to restore `%rsp`?
09-22
OS
x86_64 best way to reduce 64 bit register to 32 bit retaining zero or non-zero status
Links:
CodePudding