no code implementations • ACL 2022 • Dengji Guo, Zhengrui Ma, Min Zhang, Yang Feng
Regularization methods applying input perturbation have drawn considerable attention and have been frequently explored for NMT tasks in recent years.
no code implementations • ACL 2021 • Yang Feng, Shuhao Gu, Dengji Guo, Zhengxin Yang, Chenze Shao
Meanwhile, we force the conventional decoder to simulate the behaviors of the seer decoder via knowledge distillation.