What are the calling conventions for UNIX & Linux system calls (and user-space functions) on i386 and x86-64 October 22, 2022 by Tarik
Why would introducing useless MOV instructions speed up a tight loop in x86_64 assembly? October 10, 2022 by Tarik
Why does GCC use multiplication by a strange number in implementing integer division? October 5, 2022 by Tarik
Why does this code execute more slowly after strength-reducing multiplications to loop-carried additions? September 28, 2022 by Tarik
Why does GCC generate 15-20% faster code if I optimize for size instead of speed? September 14, 2022 by Tarik
How do I achieve the theoretical maximum of 4 FLOPs per cycle? September 9, 2022 by Tarik Answer recommended by Intel