Search Results for author: Anoop Sarkar

Found 41 papers, 10 papers with code

Auxiliary Subword Segmentations as Related Languages for Low Resource Multilingual Translation

no code implementations • EAMT 2022 • Nishant Kambhatla, Logan Born, Anoop Sarkar

We propose a novel technique that combines alternative subword tokenizations of a single source-target language pair that allows us to leverage multilingual neural translation training methods.

Cross-Lingual Transfer Translation

Paper
Add Code

Effectively pretraining a speech translation decoder with Machine Translation data

no code implementations • EMNLP 2020 • Ashkan Alinejad, Anoop Sarkar

Directly translating from speech to text using an end-to-end approach is still challenging for many language pairs due to insufficient data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Compositionality of Complex Graphemes in the Undeciphered Proto-Elamite Script using Image and Text Embedding Models

1 code implementation • Findings (ACL) 2021 • Logan Born, Kathryn Kelley, M. Willis Monroe, Anoop Sarkar

Paper
Code

Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation

1 code implementation • EMNLP 2021 • Ashkan Alinejad, Hassan S. Shavarani, Anoop Sarkar

In simultaneous machine translation, finding an agent with the optimal action sequence of reads and writes that maintain a high level of translation quality while minimizing the average lag in producing target tokens remains an extremely challenging problem.

Action Generation Machine Translation +2

Paper
Code

Unified Examination of Entity Linking in Absence of Candidate Sets

1 code implementation • 17 Apr 2024 • Nicolas Ong, Hassan Shavarani, Anoop Sarkar

Despite remarkable strides made in the development of entity linking systems in recent years, a comprehensive comparative analysis of these systems using a unified framework is notably absent.

Entity Linking

Paper
Code

SpEL: Structured Prediction for Entity Linking

1 code implementation • 23 Oct 2023 • Hassan S. Shavarani, Anoop Sarkar

Entity linking is a prominent thread of research focused on structured data creation by linking spans of text to an ontology or knowledge source.

Ranked #1 on Entity Linking on AIDA/testc (using extra training data)

Entity Linking Structured Prediction

Paper
Code

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

1 code implementation • ACL 2022 • Nishant Kambhatla, Logan Born, Anoop Sarkar

We propose a novel data-augmentation technique for neural machine translation based on ROT-$k$ ciphertexts.

Ranked #9 on Machine Translation on IWSLT2014 German-English

Data Augmentation Machine Translation +1

Paper
Code

Better Neural Machine Translation by Extracting Linguistic Information from BERT

1 code implementation • EACL 2021 • Hassan S. Shavarani, Anoop Sarkar

Adding linguistic information (syntax or semantics) to neural machine translation (NMT) has mostly focused on using point estimates from pre-trained models.

Machine Translation NMT +1

Paper
Code

Measuring and Improving Faithfulness of Attention in Neural Machine Translation

no code implementations • EACL 2021 • Pooya Moradi, Nishant Kambhatla, Anoop Sarkar

While the attention heatmaps produced by neural machine translation (NMT) models seem insightful, there is little evidence that they reflect a model{'}s true internal reasoning.

Machine Translation NMT +1

Paper
Add Code

Training with Adversaries to Improve Faithfulness of Attention in Neural Machine Translation

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Pooya Moradi, Nishant Kambhatla, Anoop Sarkar

Can we trust that the attention heatmaps produced by a neural machine translation (NMT) model reflect its true internal reasoning?

Machine Translation NMT +1

Paper
Add Code

Deconstructing Supertagging into Multi-Task Sequence Prediction

no code implementations • CONLL 2019 • Zhenqi Zhu, Anoop Sarkar

Supertagging is a sequence prediction task where each word is assigned a piece of complex syntactic structure called a supertag.

Multi-Task Learning TAG

Paper
Add Code

Interrogating the Explanatory Power of Attention in Neural Machine Translation

1 code implementation • WS 2019 • Pooya Moradi, Nishant Kambhatla, Anoop Sarkar

Attention models have become a crucial component in neural machine translation (NMT).

counterfactual Machine Translation +2

Paper
Code

Pointer-based Fusion of Bilingual Lexicons into Neural Machine Translation

1 code implementation • 17 Sep 2019 • Jetic Gū, Hassan S. Shavarani, Anoop Sarkar

Neural machine translation (NMT) systems require large amounts of high quality in-domain parallel corpora for training.

Language Modelling Machine Translation +2

Paper
Code

Sign Clustering and Topic Extraction in Proto-Elamite

1 code implementation • WS 2019 • Logan Born, Kate Kelley, Nishant Kambhatla, Carolyn Chen, Anoop Sarkar

We describe a first attempt at using techniques from computational linguistics to analyze the undeciphered proto-Elamite script.

Clustering Decipherment +1

Paper
Code

Prediction Improves Simultaneous Neural Machine Translation

no code implementations • EMNLP 2018 • Ashkan Alinejad, Maryam Siahbani, Anoop Sarkar

Simultaneous speech translation aims to maintain translation quality while minimizing the delay between reading input and incrementally producing the output.

Machine Translation reinforcement-learning +2

Paper
Add Code

Decipherment of Substitution Ciphers with Neural Language Models

no code implementations • EMNLP 2018 • Nishant Kambhatla, Anahita Mansouri Bigvand, Anoop Sarkar

Decipherment Language Modelling +1

Paper
Add Code

Decipherment for Adversarial Offensive Language Detection

no code implementations • WS 2018 • Zhelun Wu, Nishant Kambhatla, Anoop Sarkar

Automated filters are commonly used by online services to stop users from sending age-inappropriate, bullying messages, or asking others to expose personal information.

Decipherment Spelling Correction

Paper
Add Code

In-domain Context-aware Token Embeddings Improve Biomedical Named Entity Recognition

no code implementations • WS 2018 • Golnar Sheikhshabbafghi, Inanc Birol, Anoop Sarkar

Here we report on a pipeline built on Embeddings from Language Models (ELMo) and a deep learning package for natural language processing (AllenNLP).

Language Modelling named-entity-recognition +6

Paper
Add Code

Top-down Tree Structured Decoding with Syntactic Connections for Neural Machine Translation and Parsing

no code implementations • EMNLP 2018 • Jetic Gū, Hassan S. Shavarani, Anoop Sarkar

The addition of syntax-aware decoding in Neural Machine Translation (NMT) systems requires an effective tree-structured neural network, a syntax-aware attention model and a language generation model that is sensitive to sentence structure.

Constituency Parsing Dependency Parsing +5

Paper
Add Code

Prefix Lexicalization of Synchronous CFGs using Synchronous TAG

no code implementations • ACL 2018 • Logan Born, Anoop Sarkar

We show that an epsilon-free, chain-free synchronous context-free grammar (SCFG) can be converted into a weakly equivalent synchronous tree-adjoining grammar (STAG) which is prefix lexicalized.

Machine Translation TAG