1 code implementation • 5 Sep 2023 • Javier Ferrando, Matthias Sperber, Hendra Setiawan, Dominic Telaar, Saša Hasan
Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic capabilities through the analysis of input-output behavior.
no code implementations • 4 Sep 2023 • Telmo Pessoa Pires, António V. Lopes, Yannick Assogba, Hendra Setiawan
The Transformer architecture has two main non-embedding components: Attention and the Feed Forward Network (FFN).
no code implementations • 20 May 2023 • Hendra Setiawan
We propose utilizing n-best reranking to enhance Sequence-Level Knowledge Distillation (Kim and Rush, 2016) where we extract pseudo-labels for student model's training data from top n-best hypotheses and leverage a diverse set of models with different inductive biases, objective functions or architectures, including some publicly-available large language models, to pick the highest-quality hypotheses as labels.
no code implementations • 20 Dec 2022 • Mozhdeh Gheini, Tatiana Likhomanenko, Matthias Sperber, Hendra Setiawan
Self-training has been shown to be helpful in addressing data scarcity for many domains, including vision, speech, and language.
2 code implementations • Findings (ACL) 2022 • Orion Weller, Matthias Sperber, Telmo Pires, Hendra Setiawan, Christian Gollan, Dominic Telaar, Matthias Paulik
Code switching (CS) refers to the phenomenon of interchangeably using words and phrases from different languages.
1 code implementation • 24 Jul 2020 • Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik
To address various shortcomings of this paradigm, recent work explores end-to-end trainable direct models that translate without transcribing.
no code implementations • ACL 2020 • Hendra Setiawan, Matthias Sperber, Udhay Nallasamy, Matthias Paulik
Variational Neural Machine Translation (VNMT) is an attractive framework for modeling the generation of target translations, conditioned not only on the source sentence but also on some latent random variables.
no code implementations • IJCNLP 2015 • Hendra Setiawan, Zhongqiang Huang, Jacob Devlin, Thomas Lamar, Rabih Zbib, Richard Schwartz, John Makhoul
We present a three-pronged approach to improving Statistical Machine Translation (SMT), building on recent success in the application of neural networks to SMT.