CONTENTS

Solution: Compute the First Adam Update by Hand

Solution

m1=0.1(4)=0.4m_1=0.1(4)=0.4 and v1=0.001(16)=0.016v_1=0.001(16)=0.016. Bias correction gives m^1=0.4/0.1=4\hat m_1=0.4/0.1=4 and v^1=0.016/0.001=16\hat v_1=0.016/0.001=16. The step is 0.14/16=0.1-0.1\cdot4/\sqrt{16}=-0.1.

Takeaways

  • Bias correction exactly removes the initial zero bias at t=1t=1.
  • The first Adam step has sign equal to the gradient sign.
  • The scale is controlled by α\alpha when ϵ=0\epsilon=0.
Solution - Compute the First Adam Update by Hand | q4quant.studio