How Do Adam and Training Strategies Help BNNs Optimization?
AdamBNN This is the pytorch implementation of our paper “How Do Adam and Training Strategies Help BNNs Optimization?”, published in ICML 2021.  In this work, we explore the intrisic reasons why Adam is superior to other optimizers like SGD for BNN optimization and provide analytical explanations that support specific training strategies. By visualizing the optimization trajectory, we show that the optimization lies in extremely rugged loss landscape and the second-order momentum in Adam is crucial to revitalize the […]
Read more