Cross-Lingual Document Classification

12 papers with code • 10 benchmarks • 2 datasets

Cross-lingual document classification refers to the task of using data and models available for one language for which ample such resources are available (e.g., English) to solve classification tasks in another, commonly low-resource, language.

Most implemented papers

Bridging the domain gap in cross-lingual document classification

laiguokun/xlu-data 16 Sep 2019

We consider the setting of semi-supervised cross-lingual understanding, where labeled data is available in a source language (English), but only unlabeled data is available in the target language.

Multilingual and cross-lingual document classification: A meta-learning approach

mrvoh/meta_learning_multilingual_doc_classification EACL 2021

The great majority of languages in the world are considered under-resourced for the successful application of deep learning methods.