no code implementations • 31 May 2023 • Zhaocheng Liu, Zhongxiang Fan, Jian Liang, Dongying Kong, Han Li
However, it is still unknown whether a multi-epoch training paradigm could achieve better results, as the best performance is usually achieved by one-epoch training.