How can I code Java to allow SSE use and bounds-check elimination (or other advanced optimizations)?
Not a full answer, I simply don’t have time to do the detailed benchmarks your question needs but hopefully useful. Know your enemy You are targeting a combination of the JVM (in essence the JIT) and the underlying CPU/Memory subsystem. Thus “This is faster on JVM X” is not likely to be valid in all … Read more