Search Results for author: Tasnim Mohiuddin

Found 12 papers, 2 papers with code

Data Selection Curriculum for Neural Machine Translation

no code implementations25 Mar 2022 Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq Joty

In this work, we introduce a two-stage curriculum training framework for NMT where we fine-tune a base NMT model on subsets of data, selected by both deterministic scoring using pre-trained methods and online scoring that considers prediction scores of the emerging NMT model.

Machine Translation NMT +1

AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

no code implementations Findings (ACL) 2021 Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

We show that AUGVIC helps to attenuate the discrepancies between relevant and distant-domain monolingual data in traditional back-translation.

Data Augmentation Machine Translation +2

Unsupervised Word Translation with Adversarial Autoencoder

no code implementations CL 2020 Tasnim Mohiuddin, Shafiq Joty

Crosslingual word embeddings learned from monolingual embeddings have a crucial role in many downstream tasks, ranging from machine translation to transfer learning.

Machine Translation Transfer Learning +3

Rethinking Coherence Modeling: Synthetic vs. Downstream Tasks

no code implementations EACL 2021 Tasnim Mohiuddin, Prathyusha Jwalapuram, Xiang Lin, Shafiq Joty

Although coherence modeling has come a long way in developing novel models, their evaluation on downstream applications for which they are purportedly developed has largely been neglected.

Benchmarking Coherence Evaluation +7

LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

no code implementations EMNLP 2020 Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

Most of the successful and predominant methods for bilingual lexicon induction (BLI) are mapping-based, where a linear mapping function is learned with the assumption that the word embedding spaces of different languages exhibit similar geometric structures (i. e., approximately isomorphic).

Bilingual Lexicon Induction Cross-Lingual Word Embeddings +1

A Unified Neural Coherence Model

no code implementations IJCNLP 2019 Han Cheol Moon, Tasnim Mohiuddin, Shafiq Joty, Xu Chi

In this paper, we propose a unified coherence model that incorporates sentence grammar, inter-sentence coherence relations, and global coherence patterns into a common neural framework.

Machine Translation Sentence +1

Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

1 code implementation NAACL 2019 Tasnim Mohiuddin, Shafiq Joty

Adversarial training has shown impressive success in learning bilingual dictionary without any parallel data by mapping monolingual embeddings to a shared space.

Translation Word Translation

Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach

no code implementations CL 2018 Shafiq Joty, Tasnim Mohiuddin

Participants in an asynchronous conversation (e. g., forum, e-mail) interact with each other at different times, performing certain communicative acts, called speech acts (e. g., question, request).

Sentence Word Embeddings

Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

1 code implementation ACL 2018 Tasnim Mohiuddin, Shafiq Joty, Dat Tien Nguyen

We propose a novel coherence model for written asynchronous conversations (e. g., forums, emails), and show its applications in coherence assessment and thread reconstruction tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.