First of all, the problem has nothing to do with the module system (directly).
I noticed that even with JDK 9 the first warmup iteration of newInstance
was as fast as with JDK 8.
# Fork: 1 of 1
# Warmup Iteration 1: 10,578 ns/op <-- Fast!
# Warmup Iteration 2: 246,426 ns/op
# Warmup Iteration 3: 242,347 ns/op
This means something has broken in JIT compilation.
-XX:+PrintCompilation
confirmed that the benchmark was recompiled after the first iteration:
10,762 ns/op
# Warmup Iteration 2: 1541 689 ! 3 java.lang.Class::newInstance (160 bytes) made not entrant
1548 692 % 4 bench.generated.NewInstance_newInstance_jmhTest::newInstance_avgt_jmhStub @ 13 (56 bytes)
1552 693 4 bench.generated.NewInstance_newInstance_jmhTest::newInstance_avgt_jmhStub (56 bytes)
1555 662 3 bench.generated.NewInstance_newInstance_jmhTest::newInstance_avgt_jmhStub (56 bytes) made not entrant
248,023 ns/op
Then -XX:+UnlockDiagnosticVMOptions -XX:+PrintInlining
pointed to the inlining problem:
1577 667 % 4 bench.generated.NewInstance_newInstance_jmhTest::newInstance_avgt_jmhStub @ 13 (56 bytes)
@ 17 bench.NewInstance::newInstance (6 bytes) inline (hot)
! @ 2 java.lang.Class::newInstance (160 bytes) already compiled into a big method
“already compiled into a big method” message means that the compiler has failed to inline Class.newInstance
call because the compiled size of the callee is larger than InlineSmallCode
value (which is 2000 by default).
When I reran the benchmark with -XX:InlineSmallCode=2500
, it became fast again.
Benchmark Mode Cnt Score Error Units
NewInstance.newInstance avgt 5 8,847 ± 0,080 ns/op
NewInstance.operatorNew avgt 5 5,042 ± 0,177 ns/op
You know, JDK 9 now has G1 as the default GC. If I fall back to Parallel GC, the benchmark will also be fast even with the default InlineSmallCode
.
Rerun JDK 9 benchmark with -XX:+UseParallelGC
:
Benchmark Mode Cnt Score Error Units
NewInstance.newInstance avgt 5 8,728 ± 0,143 ns/op
NewInstance.operatorNew avgt 5 4,822 ± 0,096 ns/op
G1 requires to put some barriers whenever an object store happens, that’s why the compiled code becomes a bit larger, so that Class.newInstance
exceeds the default InlineSmallCode
limit. Another reason why compiled Class.newInstance
has become larger is that the reflection code had been slightly rewritten in JDK 9.
TL;DR JIT has failed to inline
Class.newInstance
, becauseInlineSmallCode
limit has been exceeded. The compiled version ofClass.newInstance
has become larger due to changes in reflection code in JDK 9 and because the default GC has been changed to G1.