That is because the actual Z value is the sum of the elevation and the translationZ
From the docs, the elevation is “base z depth of the view” and this is a static variable, while translationZ is dynamic.
So elevation is your start value and for animations you should use translationZ.
Source