Search Results for author: Tasnim Mohiuddin

Found 12 papers, 2 papers with code

Data Selection Curriculum for Neural Machine Translation

no code implementations • 25 Mar 2022 • Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq Joty

In this work, we introduce a two-stage curriculum training framework for NMT where we fine-tune a base NMT model on subsets of data, selected by both deterministic scoring using pre-trained methods and online scoring that considers prediction scores of the emerging NMT model.

Machine Translation NMT +1

Paper
Add Code

AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

no code implementations • Findings (ACL) 2021 • Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

We show that AUGVIC helps to attenuate the discrepancies between relevant and distant-domain monolingual data in traditional back-translation.

Data Augmentation Machine Translation +2

Paper
Add Code

XLA: A Robust Unsupervised Data Augmentation Framework for Cross-Lingual NLP

no code implementations • 1 Jan 2021 • M Saiful Bari, Tasnim Mohiuddin, Shafiq Joty

Transfer learning has yielded state-of-the-art (SoTA) results in many supervised NLP tasks.

Cross-Lingual Transfer Data Augmentation +7

Paper
Add Code

Unsupervised Word Translation with Adversarial Autoencoder

no code implementations • CL 2020 • Tasnim Mohiuddin, Shafiq Joty

Crosslingual word embeddings learned from monolingual embeddings have a crucial role in many downstream tasks, ranging from machine translation to transfer learning.

Machine Translation Transfer Learning +3

Paper
Add Code

Rethinking Coherence Modeling: Synthetic vs. Downstream Tasks

no code implementations • EACL 2021 • Tasnim Mohiuddin, Prathyusha Jwalapuram, Xiang Lin, Shafiq Joty

Although coherence modeling has come a long way in developing novel models, their evaluation on downstream applications for which they are purportedly developed has largely been neglected.

Benchmarking Coherence Evaluation +7

Paper
Add Code

UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP

no code implementations • ACL 2021 • M Saiful Bari, Tasnim Mohiuddin, Shafiq Joty

We propose UXLA, a novel unsupervised data augmentation framework for zero-resource transfer learning scenarios.

Cross-Lingual Transfer Data Augmentation +4

Paper
Add Code

LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

no code implementations • EMNLP 2020 • Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

Most of the successful and predominant methods for bilingual lexicon induction (BLI) are mapping-based, where a linear mapping function is learned with the assumption that the word embedding spaces of different languages exhibit similar geometric structures (i. e., approximately isomorphic).

Bilingual Lexicon Induction Cross-Lingual Word Embeddings +1

Paper
Add Code

A Unified Neural Coherence Model

no code implementations • IJCNLP 2019 • Han Cheol Moon, Tasnim Mohiuddin, Shafiq Joty, Xu Chi

In this paper, we propose a unified coherence model that incorporates sentence grammar, inter-sentence coherence relations, and global coherence patterns into a common neural framework.

Machine Translation Sentence +1

Paper
Add Code

Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

1 code implementation • NAACL 2019 • Tasnim Mohiuddin, Shafiq Joty

Adversarial training has shown impressive success in learning bilingual dictionary without any parallel data by mapping monolingual embeddings to a shared space.

Translation Word Translation

Paper
Code

Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

no code implementations • NAACL 2019 • Tasnim Mohiuddin, Thanh-Tung Nguyen, Shafiq Joty

We address the problem of speech act recognition (SAR) in asynchronous conversations (forums, emails).

Word Embeddings

Paper
Add Code

Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach

no code implementations • CL 2018 • Shafiq Joty, Tasnim Mohiuddin

Participants in an asynchronous conversation (e. g., forum, e-mail) interact with each other at different times, performing certain communicative acts, called speech acts (e. g., question, request).

Sentence Word Embeddings

Paper
Add Code

Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

1 code implementation • ACL 2018 • Tasnim Mohiuddin, Shafiq Joty, Dat Tien Nguyen

We propose a novel coherence model for written asynchronous conversations (e. g., forums, emails), and show its applications in coherence assessment and thread reconstruction tasks.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.