no code implementations • ICLR 2018 • Chong Wang, Xipeng Lan, Yangang Zhang
The idea is to make a small student network imitate the target of a large teacher network, then the student network can be competitive to the teacher one.
no code implementations • 9 Sep 2017 • Chong Wang, Xue Zhang, Xipeng Lan
However, as the number of identities becomes extremely large, the training will suffer from bad local minima because effective hard triplets are difficult to be found.