Multilingual text classification

14 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

gatenlp/peft_fft_multilingual 14 Aug 2023

Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient.

0
14 Aug 2023

Exploring Multilingual Text Data Distillation

harshp1802/text-dataset-distillation 9 Aug 2023

In the paper, we propose several data distillation techniques for multilingual text classification datasets using language-model-based learning methods.

2
09 Aug 2023

SLICER: Sliced Fine-Tuning for Low-Resource Cross-Lingual Transfer for Named Entity Recognition

fdschmidt93/SLICER Proceedings of the Conference on Empirical Methods in Natural Language Processing 2022

Large multilingual language models generally demonstrate impressive results in zero-shot cross-lingual transfer, yet often fail to successfully transfer to low-resource languages, even for token-level prediction tasks like named entity recognition (NER).

2
01 Oct 2022

Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification

xiaoleihuang/domainfairness NAACL 2022

Existing approaches to mitigate demographic biases evaluate on monolingual data, however, multilingual data has not been examined.

2
12 Apr 2022

Practical Transformer-based Multilingual Text Classification

sentropytechnologies/hateval2019-relabeled NAACL 2021

Transformer-based methods are appealing for multilingual text classification, but common research benchmarks like XNLI (Conneau et al., 2018) do not reflect the data availability and task variety of industry applications.

2
01 Jun 2021

NLP-CUET@DravidianLangTech-EACL2021: Offensive Language Detection from Multilingual Code-Mixed Text using Transformers

eftekhar-hossain/CUET_NLP-EACL_2021 EACL (DravidianLangTech) 2021

In the task, datasets provided in three languages including Tamil, Malayalam and Kannada code-mixed with English where participants are asked to implement separate models for each language.

1
28 Feb 2021

NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner

eftekhar-hossain/CUET_NLP-EACL_2021 EACL (LTEDI) 2021

We propose three distinct models to identify hope speech in English, Tamil and Malayalam language to serve this purpose.

1
28 Feb 2021

CogALex-VI Shared Task: Transrelation - A Robust Multilingual Language Model for Multilingual Relation Identification

Text2TCS/Transrelation 12 Dec 2020

We describe our submission to the CogALex-VI shared task on the identification of multilingual paradigmatic relations building on XLM-RoBERTa (XLM-R), a robustly optimized and multilingual BERT model.

2
12 Dec 2020

The Multilingual Amazon Reviews Corpus

mojave-pku/uniprompt EMNLP 2020

We present the Multilingual Amazon Reviews Corpus (MARC), a large-scale collection of Amazon reviews for multilingual text classification.

7
06 Oct 2020

A Multi-cascaded Deep Model for Bilingual SMS Classification

haroonshakeel/bilingual_sms_classification 29 Nov 2019

Our model achieves high accuracy for classification on this dataset and outperforms the previous model for multilingual text classification, highlighting language independence of McM.

0
29 Nov 2019