Natural Language Processing

Multilingual NLP

36 papers with code • 0 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Multilingual NLP

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Multilingual NLP models and implementations

nlp-uoregon/trankit

2 papers

712

Datasets

Most implemented papers

Most implemented Social Latest No code

Unsupervised Cross-lingual Representation Learning at Scale

facebookresearch/XLM • • ACL 2020

We also present a detailed empirical analysis of the key factors that are required to achieve these gains, including the trade-offs between (1) positive transfer and capacity dilution and (2) the performance of high and low resource languages at scale.

Paper
Code

Language-agnostic BERT Sentence Embedding

FreddeFrallan/Multilingual-CLIP • • ACL 2022

While BERT is an effective method for learning monolingual sentence embeddings for semantic similarity and embedding based transfer learning (Reimers and Gurevych, 2019), BERT based cross-lingual sentence embeddings have yet to be explored.

Paper
Code

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

tigerresearch/tigerbot • • 9 Nov 2022

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions.

Paper
Code

PMIndia -- A Collection of Parallel Corpora of Languages of India

bhaddow/pmindia-crawler • 27 Jan 2020

Parallel text is required for building high-quality machine translation (MT) systems, as well as for other multilingual NLP applications.

Paper
Code

XeroAlign: Zero-Shot Cross-lingual Transformer Alignment

huawei-noah/noah-research • • Findings (ACL) 2021

The introduction of pretrained cross-lingual language models brought decisive improvements to multilingual NLP tasks.

Paper
Code

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

yihongl1u/colexificationnet • 22 May 2023

ColexNet's nodes are concepts and its edges are colexifications.

Paper
Code

MMCR4NLP: Multilingual Multiway Corpora Repository for Natural Language Processing

zhiqu22/adapnoncenter • • 3 Oct 2017

Multilinguality is gradually becoming ubiquitous in the sense that more and more researchers have successfully shown that using additional languages help improve the results in many Natural Language Processing tasks.

Paper
Code