no code implementations • 11 Sep 2020 • Ji-Yue Wang, Pei Zhang, Wen-feng Pang, Jie Li
The experiment results confirm that the TC can help LsrKD and MrKD to boost training, especially on the networks they are failed.
Self-Knowledge Distillation