no code implementations • 28 Dec 2021 • Mohammad K. Ebrahimpour, Gang Qian, Allison Beach
On the other hand, the proxy-based loss functions often lead to significant speedups in convergence during training, while the rich relations among data points are often not fully explored by the proxy-based losses.