1 code implementation • ICCV 2023 • Hyunjae Lee, Heon Song, Hyeonsoo Lee, Gi-hyeon Lee, Suyeong Park, Donggeun Yoo
On the other hand, Self-Distillation (SD) only transfers partial knowledge learned by the task model itself.
Bayesian Optimization Learning with noisy labels