Datasets

9,499 machine learning datasets
Filter by Task (clear)
MORPH Question Answering 7 Misinformation 6 Sentiment Analysis 6 Cross-Lingual Transfer 4 Image Classification 4 Text Classification 4 Abstractive Text Summarization 3 Language Modelling 3 Machine Translation 3 Natural Language Inference 3 Natural Language Understanding 3 Part-Of-Speech Tagging 3 Reading Comprehension 3 Sarcasm Detection 3 Speech Recognition 3 Text Summarization 3 Token Classification 3 Chinese Named Entity Recognition 2 Classification 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Dialect Identification 2 Fake News Detection 2 Information Retrieval 2 Language Identification 2 Named Entity Recognition (NER) 2 Open-Domain Question Answering 2 Question Generation 2 Speech-to-Text Translation 2 Text Categorization 2 Text-To-Speech Synthesis 2 Word Embeddings 2 Zero-Shot Cross-Lingual Transfer 2 Arabic Sentiment Analysis 1 Arabic Text Diacritization 1 Audio Source Separation 1 Automatic Phoneme Recognition 1 Automatic Post-Editing 1 Automatic Speech Recognition 1 Chinese Reading Comprehension 1 Chinese Sentence Pair Classification 1 Code Generation 1 Common Sense Reasoning 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-lingual zero-shot dependency parsing 1 Data Augmentation 1 Dependency Parsing 1 Document Summarization 1 Domain Adaptation 1 Entity Embeddings 1 Entity Typing 1 FG-1-PG-1 1 FLUE 1 Fact Checking 1 Fact Verification 1 Few-Shot Audio Classification 1 Few-shot NER 1 Generalized Zero-Shot Learning 1 Handwriting Recognition 1 Hate Speech Detection 1 Instruction Following 1 Intent Classification 1 Irony Identification 1 Irregular Text Recognition 1 LABELED_DEPENDENCIES 1 LEMMA 1 Machine Reading Comprehension 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual NLP 1 Multilingual text classification 1 Multiple-choice 1 Named Entity Recognition 1 Natural Questions 1 Node Classification 1 Object Detection 1 Optical Character Recognition (OCR) 1 POS 1 Pretrained Multilingual Language Models 1 Propaganda detection 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Classification 1 Relation Extraction 1 SENTS 1 Scene Text Recognition 1 Semantic Role Labeling 1 Semantic Similarity 1 Sentence Embeddings 1 Sequence-to-sequence Language Modeling 1 Short Text Clustering 1 Slot Filling 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken Language Understanding 1 Stance Detection 1 Story Completion 1 Style Transfer 1 Super-Resolution 1 TAG 1 Text Effects Transfer 1 Translation 1 Translation eng-hrv 1 Translation eng-srp_Cyrl 1 Transliteration 1 UNLABELED_DEPENDENCIES 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 Visual Reasoning 1 Weakly-Supervised Named Entity Recognition 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-shot Cross-lingual Fact-checking 1 coreference-resolution 1 speech-recognition 1 tabular-classification 1 text annotation 1 text2text-generation 1
Filter by Language (clear)
Arabic Afrikaans 1 Akkadian 1 Akuntsu 1 Albanian 1 Amharic 1 Ancient Greek 1 Apurinã 1 Armenian 1 Assyrian Neo-Aramaic 1 Bambara 1 Basque 1 Belarusian 1 Bhojpuri 1 Breton 1 Bulgarian 1 Catalan 1 Chinese 1 Chukot 1 Church Slavic 1 Coptic 1 Croatian 1 Czech 1 Danish 1 Dutch 1 English 1 Erzya 1 Estonian 1 Faroese 1 Finnish 1 French 1 Galician 1 German 1 Gothic 1 Hebrew 1 Hindi 1 Hungarian 1 Icelandic 1 Indonesian 1 Irish 1 Italian 1 Japanese 1 Karelian 1 Kazakh 1 Khunsari 1 Komi-Permyak 1 Komi-Zyrian 1 Korean 1 Latin 1 Latvian 1 Literary Chinese 1 Lithuanian 1 Livvi 1 Maltese 1 Manx 1 Marathi 1 Mbyá Guaraní 1 Modern Greek 1 Moksha 1 Mundurukú 1 Nayini 1 Nigerian Pidgin 1 Northern Kurdish 1 Northern Sami 1 Norwegian 1 Old French 1 Old Russian 1 Old Turkish 1 Persian 1 Polish 1 Portuguese 1 Romanian 1 Russia Buriat 1 Russian 1 Sanskrit 1 Scottish Gaelic 1 Serbian 1 Skolt Sami 1 Slovak 1 Slovenian 1 Soi 1 South Levantine Arabic 1 Spanish 1 Swedish 1 Swedish Sign Language 1 Swiss German 1 Tagalog 1 Tamil 1 Telugu 1 Thai 1 Tupinambá 1 Turkish 1 Uighur 1 Ukrainian 1 Upper Sorbian 1 Urdu 1 Vietnamese 1 Warlpiri 1 Welsh 1 Wolof 1 Yoruba 1 Yue Chinese 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 American Sign Language 0 Ancient Hebrew 0 Aragonese 0 Argentine Sign Language 0 Arpitan 0 Assamese 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Bavarian 0 Bemba (Zambia) 0 Bengali 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Buginese 0 Burmese 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chuvash 0 Congo Swahili 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dhivehi 0 Dimli (individual language) 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Esperanto 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Filipino 0 Fon 0 Friulian 0 Fulah 0 Gagauz 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Greek 0 Greek Sign Language 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Halh Mongolian 0 Hausa 0 Hawaiian 0 Herero 0 Hiri Motu 0 Ido 0 Igbo 0 Iloko 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kabuverdianu 0 Kabyle 0 Kachin 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Kongo 0 Krio 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Lushai 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Mandarin Chinese 0 Manipuri 0 Maori 0 Marshallese 0 Mazanderani 0 Mesopotamian Arabic 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek (1453-) 0 Mongolian 0 Moroccan Arabic 0 Multilingual 0 Najdi Arabic 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Luri 0 Northern Uzbek 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Plateau Malagasy 0 Pontic 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romansh 0 Rundi 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Serbo-Croatian 0 Shan 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Somali 0 South Azerbaijani 0 Southern Pashto 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Standard Latvian 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swiss-German Sign Language 0 Tahitian 0 Tai 0 Tajik 0 Tatar 0 Tetum 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tonga (Zambia) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 West Central Oromo 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0