no code implementations • 2 Aug 2024 • Jiwoo Ryu, Hao-Wen Dong, Jongmin Jung, Dasaem Jeong
The NMT consists of two transformers: the main decoder that models a sequence of compound tokens and the sub-decoder for modeling sub-tokens of each compound token.