Why does my training loss have regular spikes?
I’ve figured it out myself: TL;DR: Make sure your loss magnitude is independent of your mini-batch size. The long explanation: In my case the issue was Keras-specific after all. Maybe the solution to this problem will be useful for someone at some point. It turns out that Keras divides the loss by the mini-batch size. … Read more