no code implementations • NAACL (CMCL) 2021 • Kartik Sharma, Niyati Bafna, Samar Husain
The models differ in their use of prior context during the prediction process – the context is either noisy or noise-free.
no code implementations • LREC 2022 • Zdeněk Žabokrtský, Niyati Bafna, Jan Bodnár, Lukáš Kyjánek, Emil Svoboda, Magda Ševčíková, Jonáš Vidra
Our work aims at developing a multilingual data resource for morphological segmentation.
no code implementations • ICON 2021 • Niyati Bafna, Martin Vastl, Ondřej Bojar
Technical terms may require special handling when the target audience is bilingual, depending on the cultural and educational norms of the society in question.
no code implementations • ICON 2019 • Niyati Bafna, Dipti Sharma
English-Hindi machine translation systems have difficulty interpreting verb phrase ellipsis (VPE) in English, and commit errors in translating sentences with VPE.
no code implementations • 31 May 2025 • Niyati Bafna, Matthew Wiesner
Prior research indicates that LID model performance significantly declines on accented speech; however, the specific causes, extent, and characterization of these errors remain under-explored.
no code implementations • 27 Jan 2025 • Niyati Bafna, Emily Chang, Nathaniel R. Robinson, David R. Mortensen, Kenton Murray, David Yarowsky, Hale Sirin
Most of the world's languages and dialects are low-resource, and lack support in mainstream machine translation (MT) models.
1 code implementation • 19 Jun 2024 • Niyati Bafna, Kenton Murray, David Yarowsky
While large language models exhibit certain cross-lingual generalization capabilities, they suffer from performance degradation (PD) on unseen closely-related languages (CRLs) and dialects relative to their high-resource language neighbour (HRLN).
1 code implementation • 16 Mar 2024 • Niyati Bafna, Philipp Koehn, David Yarowsky
While Transformer-based neural machine translation (NMT) is very effective in high-resource settings, many languages lack the necessary large parallel corpora to benefit from it.
1 code implementation • 23 May 2023 • Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagot, Rachel Bawden
Most existing approaches for unsupervised bilingual lexicon induction (BLI) depend on good quality static or contextual embeddings requiring large monolingual corpora for both languages.