Use of this document

This is a study note for understanding the hyper-parameter (p1, p2) in Adam optimizer (adaptive momentum optimizer)

jupyter_Adam