LinCE (Linguistic Code-switching Evaluation Dataset)

Introduced by Aguilar et al. in LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation

A centralized benchmark for Linguistic Code-switching Evaluation (LinCE) that combines ten corpora covering four different code-switched language pairs (i.e., Spanish-English, Nepali-English, Hindi-English, and Modern Standard Arabic-Egyptian Arabic) and four tasks (i.e., language identification, named entity recognition, part-of-speech tagging, and sentiment analysis).

Source: LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages