CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

microsoft/CodeXGLUE 9 Feb 2021

Benchmark datasets have a significant impact on accelerating research in programming language tasks.

Contextual Neural Model for Translating Bilingual Multi-Speaker Conversations

sameenmaruf/Bi-MSMT WS 2018

In this work, we propose the task of translating Bilingual Multi-Speaker Conversations, and explore neural architectures which exploit both source and target-side conversation histories for this task.

Pre-training via Paraphrasing

lucidrains/marge-pytorch NeurIPS 2020

The objective noisily captures aspects of paraphrase, translation, multi-document summarization, and information retrieval, allowing for strong zero-shot performance on several tasks.

CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task

ssun32/CLIReval ACL 2020

We present CLIReval, an easy-to-use toolkit for evaluating machine translation (MT) with the proxy task of cross-lingual information retrieval (CLIR).

Rethinking Document-level Neural Machine Translation

sunzewei2715/Doc2Doc_NMT Findings (ACL) 2022

This paper does not aim at introducing a novel model for document-level neural machine translation.

UDAAN - Machine Learning based Post-Editing tool for Document Translation

ayushbits/udaan-post-editing 3 Mar 2022

Replacements are based on the source and target texts lexicon alignment.