Multilingual text classification
14 papers with code • 0 benchmarks • 2 datasets
Benchmarks
These leaderboards are used to track progress in Multilingual text classification
Latest papers with no code
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
This research contributes significantly to expanding the pool of available text classification datasets and also makes it possible to develop topic classification models for Indian regional languages.
Comparative Analysis of Multilingual Text Classification & Identification through Deep Learning and Embedding Visualization
This research conducts a comparative study on multilingual text classification methods, utilizing deep learning and embedding visualization.
Model and Evaluation: Towards Fairness in Multilingual Text Classification
The multilingual text representation module uses a multilingual pre-trained language model to represent the text, the language fusion module makes the semantic spaces of different languages tend to be consistent through contrastive learning, and the text debiasing module uses contrastive learning to make the model unable to identify sensitive attributes' information.
MiLMo:Minority Multilingual Pre-trained Language Model
To solve the problem of scarcity of datasets on minority languages and verify the effectiveness of the MiLMo model, this paper constructs a minority multilingual text classification dataset named MiTC, and trains a word2vec model for each language.
muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem
In this paper, we are presenting our solution to Multilingual Abusive Comment Identification Problem on Moj, an Indian video-sharing social networking service, powered by ShareChat.
Graph Neural Network Enhanced Language Models for Efficient Multilingual Text Classification
To overcome these challenges, we propose a multilingual disaster related text classification system which is capable to work under \{mono, cross and multi\} lingual scenarios and under limited supervision.
Multilingual Text Classification for Dravidian Languages
On the other hand, in view of the problem that the model cannot well recognize and utilize the correlation among languages, we further proposed a language-specific representation module to enrich semantic information for the model.
A Primer on Pretrained Multilingual Language Models
Multilingual Language Models (\MLLMs) such as mBERT, XLM, XLM-R, \textit{etc.}
Multilingual Epidemiological Text Classification: A Comparative Study
We conduct a comparative study of different machine and deep learning text classification models using a dataset comprising news articles related to epidemic outbreaks from six languages, four low-resourced and two high-resourced, in order to analyze the influence of the nature of the language, the structure of the document, and the size of the data.
Evaluating Transformer-Based Multilingual Text Classification
As NLP tools become ubiquitous in today's technological landscape, they are increasingly applied to languages with a variety of typological structures.