1 code implementation • 22 Mar 2024 • Yifan Zhang, Weiqi Chen, Zhaoyang Zhu, Dalin Qin, Liang Sun, Xue Wang, Qingsong Wen, Zhang Zhang, Liang Wang, Rong Jin
For the state-of-the-art (SOTA) model, the MSE is reduced by $33. 3\%$.
1 code implementation • 4 Oct 2021 • Zhaoyang Zhu, Haozhe Sun, Chi Zhang
Adam is applied widely to train neural networks.