Datasets

9,499 machine learning datasets
Filter by Task (clear)
Word Embeddings Question Answering 7 Misinformation 6 Sentiment Analysis 6 Cross-Lingual Transfer 4 Image Classification 4 Text Classification 4 Abstractive Text Summarization 3 Language Modelling 3 Machine Translation 3 Natural Language Inference 3 Natural Language Understanding 3 Part-Of-Speech Tagging 3 Reading Comprehension 3 Sarcasm Detection 3 Speech Recognition 3 Text Summarization 3 Token Classification 3 Chinese Named Entity Recognition 2 Classification 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Dialect Identification 2 Fake News Detection 2 Information Retrieval 2 Language Identification 2 Named Entity Recognition (NER) 2 Open-Domain Question Answering 2 Question Generation 2 Speech-to-Text Translation 2 Text Categorization 2 Text-To-Speech Synthesis 2 Zero-Shot Cross-Lingual Transfer 2 Arabic Sentiment Analysis 1 Arabic Text Diacritization 1 Audio Source Separation 1 Automatic Phoneme Recognition 1 Automatic Post-Editing 1 Automatic Speech Recognition 1 Chinese Reading Comprehension 1 Chinese Sentence Pair Classification 1 Code Generation 1 Common Sense Reasoning 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-lingual zero-shot dependency parsing 1 Data Augmentation 1 Dependency Parsing 1 Document Summarization 1 Domain Adaptation 1 Entity Embeddings 1 Entity Typing 1 FG-1-PG-1 1 FLUE 1 Fact Checking 1 Fact Verification 1 Few-Shot Audio Classification 1 Few-shot NER 1 Generalized Zero-Shot Learning 1 Handwriting Recognition 1 Hate Speech Detection 1 Instruction Following 1 Intent Classification 1 Irony Identification 1 Irregular Text Recognition 1 LABELED_DEPENDENCIES 1 LEMMA 1 MORPH 1 Machine Reading Comprehension 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual NLP 1 Multilingual text classification 1 Multiple-choice 1 Named Entity Recognition 1 Natural Questions 1 Node Classification 1 Object Detection 1 Optical Character Recognition (OCR) 1 POS 1 Pretrained Multilingual Language Models 1 Propaganda detection 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Classification 1 Relation Extraction 1 SENTS 1 Scene Text Recognition 1 Semantic Role Labeling 1 Semantic Similarity 1 Sentence Embeddings 1 Sequence-to-sequence Language Modeling 1 Short Text Clustering 1 Slot Filling 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken Language Understanding 1 Stance Detection 1 Story Completion 1 Style Transfer 1 Super-Resolution 1 TAG 1 Text Effects Transfer 1 Translation 1 Translation eng-hrv 1 Translation eng-srp_Cyrl 1 Transliteration 1 UNLABELED_DEPENDENCIES 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 Visual Reasoning 1 Weakly-Supervised Named Entity Recognition 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-shot Cross-lingual Fact-checking 1 coreference-resolution 1 speech-recognition 1 tabular-classification 1 text annotation 1 text2text-generation 1
Filter by Language (clear)
Arabic English 18 Spanish 7 Bengali 5 Chinese 5 German 5 French 4 Dutch 3 Italian 3 Japanese 3 Russian 3 Aragonese 2 Armenian 2 Assamese 2 Bavarian 2 Bishnupriya 2 Breton 2 Chechen 2 Egyptian Arabic 2 Gujarati 2 Indonesian 2 Kannada 2 Malayalam 2 Marathi 2 Oriya (macrolanguage) 2 Polish 2 Portuguese 2 South Azerbaijani 2 Tamil 2 Tatar 2 Telugu 2 Turkish 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Afrikaans 1 Akan 1 Albanian 1 Amharic 1 Arpitan 1 Asturian 1 Avaric 1 Aymara 1 Azerbaijani 1 Bambara 1 Banjar 1 Bashkir 1 Basque 1 Belarusian 1 Bislama 1 Bosnian 1 Buginese 1 Bulgarian 1 Burmese 1 Catalan 1 Cebuano 1 Central Bikol 1 Central Khmer 1 Central Kurdish 1 Chamorro 1 Cherokee 1 Cheyenne 1 Choctaw 1 Church Slavic 1 Chuvash 1 Cornish 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Croatian 1 Czech 1 Danish 1 Dhivehi 1 Dimli (individual language) 1 Dzongkha 1 Eastern Mari 1 Erzya 1 Esperanto 1 Estonian 1 Ewe 1 Extremaduran 1 Faroese 1 Fiji Hindi 1 Fijian 1 Finnish 1 Friulian 1 Fulah 1 Gagauz 1 Galician 1 Gan Chinese 1 Ganda 1 Georgian 1 Gilaki 1 Goan Konkani 1 Gothic 1 Guarani 1 Haitian 1 Hakka Chinese 1 Hausa 1 Hawaiian 1 Hebrew 1 Herero 1 Hindi 1 Hiri Motu 1 Hungarian 1 Icelandic 1 Ido 1 Igbo 1 Iloko 1 Interlingua (International Auxiliary Language Association) 1 Interlingue 1 Inuktitut 1 Inupiaq 1 Irish 1 Jamaican Creole English 1 Javanese 1 Kabardian 1 Kabyle 1 Kalaallisut 1 Kalmyk 1 Kanuri 1 Kara-Kalpak 1 Karachay-Balkar 1 Kashmiri 1 Kashubian 1 Kazakh 1 Kikuyu 1 Kinyarwanda 1 Kirghiz 1 Komi 1 Komi-Permyak 1 Kongo 1 Korean 1 Kuanyama 1 Kurdish 1 Kölsch 1 Ladino 1 Lak 1 Lao 1 Latgalian 1 Latin 1 Latvian 1 Lezghian 1 Ligurian 1 Limburgan 1 Lingala 1 Lithuanian 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luxembourgish 1 Macedonian 1 Maithili 1 Malagasy 1 Malay (macrolanguage) 1 Maltese 1 Manx 1 Maori 1 Marshallese 1 Mazanderani 1 Min Dong Chinese 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Modern Greek (1453-) 1 Moksha 1 Mongolian 1 Multilingual 1 Narom 1 Nauru 1 Navajo 1 Ndonga 1 Neapolitan 1 Nepali (macrolanguage) 1 Newari 1 Northern Frisian 1 Northern Luri 1 Northern Sami 1 Norwegian 1 Norwegian Nynorsk 1 Novial 1 Nyanja 1 Occitan (post 1500) 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Oromo 1 Ossetian 1 Pali 1 Pampanga 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Persian 1 Pfaelzisch 1 Picard 1 Piemontese 1 Pitcairn-Norfolk 1 Pontic 1 Punjabi 1 Pushto 1 Quechua 1 Romanian 1 Romansh 1 Rundi 1 Russia Buriat 1 Rusyn 1 Samoan 1 Sango 1 Sanskrit 1 Sardinian 1 Saterfriesisch 1 Scots 1 Scottish Gaelic 1 Serbian 1 Serbo-Croatian 1 Shona 1 Sichuan Yi 1 Sicilian 1 Silesian 1 Sindhi 1 Sinhala 1 Slovak 1 Slovenian 1 Somali 1 Southern Sotho 1 Sranan Tongo 1 Sundanese 1 Swahili (macrolanguage) 1 Swati 1 Swedish 1 Tagalog 1 Tahitian 1 Tajik 1 Tetum 1 Thai 1 Tibetan 1 Tigrinya 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tswana 1 Tulu 1 Tumbuka 1 Turkmen 1 Tuvinian 1 Twi 1 Udmurt 1 Uighur 1 Ukrainian 1 Upper Sorbian 1 Urdu 1 Uzbek 1 Venda 1 Venetian 1 Veps 1 Vietnamese 1 Vlaams 1 Vlax Romani 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Welsh 1 Western Frisian 1 Western Mari 1 Western Panjabi 1 Wolof 1 Wu Chinese 1 Xhosa 1 Yakut 1 Yiddish 1 Yoruba 1 Zeeuws 1 Zhuang 1 Zulu 1 Akkadian 0 Akuntsu 0 American Sign Language 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Argentine Sign Language 0 Assyrian Neo-Aramaic 0 Bangala 0 Bangladeshi Sign Language 0 Bemba (Zambia) 0 Bhojpuri 0 Bodo (India) 0 Central Pashto 0 Chavacano 0 Chukot 0 Congo Swahili 0 Coptic 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Filipino 0 Fon 0 Geez 0 German Sign Language 0 Greek 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Halh Mongolian 0 Iranian Persian 0 Jejueo 0 Kabuverdianu 0 Kachin 0 Karelian 0 Khunsari 0 Komi-Zyrian 0 Krio 0 Literary Chinese 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Lushai 0 Malay (individual language) 0 Mandarin Chinese 0 Manipuri 0 Mbyá Guaraní 0 Mesopotamian Arabic 0 Modern Greek 0 Moroccan Arabic 0 Mundurukú 0 Najdi Arabic 0 Naxi 0 Nayini 0 Nepali (individual language) 0 Nigerian Fulfulde 0 Nigerian Pidgin 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Uzbek 0 Norwegian Bokmål 0 Odia 0 Old French 0 Old Russian 0 Old Turkish 0 Plateau Malagasy 0 Portuguse 0 Rajasthani 0 Saidi Arabic 0 Santali 0 Shan 0 Skolt Sami 0 Soi 0 South Levantine Arabic 0 Southern Pashto 0 Standard Arabic 0 Standard Latvian 0 Swahili 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tai 0 Tonga (Zambia) 0 Tunisian Arabic 0 Tupinambá 0 Turkish Sign Language 0 Votic 0 Warlpiri 0 West Central Oromo 0 Yue Chinese 0 Zaza 0