Search Results for author: Hendra Setiawan

Found 11 papers, 3 papers with code

Automating Behavioral Testing in Machine Translation

1 code implementation5 Sep 2023 Javier Ferrando, Matthias Sperber, Hendra Setiawan, Dominic Telaar, Saša Hasan

Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic capabilities through the analysis of input-output behavior.

Machine Translation Translation

One Wide Feedforward is All You Need

no code implementations4 Sep 2023 Telmo Pessoa Pires, António V. Lopes, Yannick Assogba, Hendra Setiawan

The Transformer architecture has two main non-embedding components: Attention and the Feed Forward Network (FFN).

Position

Accurate Knowledge Distillation with n-best Reranking

no code implementations20 May 2023 Hendra Setiawan

We propose utilizing n-best reranking to enhance Sequence-Level Knowledge Distillation (Kim and Rush, 2016) where we extract pseudo-labels for student model's training data from top n-best hypotheses and leverage a diverse set of models with different inductive biases, objective functions or architectures, including some publicly-available large language models, to pick the highest-quality hypotheses as labels.

Knowledge Distillation Translation

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data

no code implementations20 Dec 2022 Mozhdeh Gheini, Tatiana Likhomanenko, Matthias Sperber, Hendra Setiawan

Self-training has been shown to be helpful in addressing data scarcity for many domains, including vision, speech, and language.

Data Augmentation Pseudo Label +2

Consistent Transcription and Translation of Speech

1 code implementation24 Jul 2020 Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik

To address various shortcomings of this paradigm, recent work explores end-to-end trainable direct models that translate without transcribing.

speech-recognition Speech Recognition +1

Variational Neural Machine Translation with Normalizing Flows

no code implementations ACL 2020 Hendra Setiawan, Matthias Sperber, Udhay Nallasamy, Matthias Paulik

Variational Neural Machine Translation (VNMT) is an attractive framework for modeling the generation of target translations, conditioned not only on the source sentence but also on some latent random variables.

Machine Translation NMT +2

Statistical Machine Translation Features with Multitask Tensor Networks

no code implementations IJCNLP 2015 Hendra Setiawan, Zhongqiang Huang, Jacob Devlin, Thomas Lamar, Rabih Zbib, Richard Schwartz, John Makhoul

We present a three-pronged approach to improving Statistical Machine Translation (SMT), building on recent success in the application of neural networks to SMT.

Machine Translation Tensor Networks +1

Cannot find the paper you are looking for? You can Submit a new open access paper.