Datasets

9,498 machine learning datasets
Filter by Task (clear)
Chinese Reading Comprehension Machine Translation 9 Question Answering 6 Cross-Lingual Transfer 5 Image Classification 4 Language Modelling 4 Natural Language Inference 4 Text Classification 4 Abstractive Text Summarization 3 Domain Adaptation 3 Natural Language Understanding 3 Sentiment Analysis 3 Speech Recognition 3 Text Summarization 3 Token Classification 3 Translation 3 Word Embeddings 3 Audio Source Separation 2 Code Generation 2 Cross-Lingual NER 2 Cross-Lingual POS Tagging 2 Entity Alignment 2 Gesture Generation 2 Information Retrieval 2 Knowledge Graphs 2 Language Identification 2 Misinformation 2 Multi-modal Entity Alignment 2 Multilingual NLP 2 Named Entity Recognition (NER) 2 Outlier Detection 2 Part-Of-Speech Tagging 2 Relation Extraction 2 Zero-Shot Cross-Lingual Transfer 2 3D Face Animation 1 Accented Speech Recognition 1 Anomaly Detection 1 Arithmetic Reasoning 1 Automatic Phoneme Recognition 1 Automatic Speech Recognition 1 Bias Detection 1 Blind Super-Resolution 1 Body Detection 1 Causal Inference 1 Chord Recognition 1 Citation Recommendation 1 Clustering Algorithms Evaluation 1 Code Summarization 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Natural Language Inference 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Document Classification 1 Document Summarization 1 Document Translation 1 Entity Linking 1 FLUE 1 Face Detection 1 Facial Expression Recognition (FER) 1 Fake News Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 Fine-Grained Image Classification 1 Image Captioning 1 Image Retrieval 1 Image Super-Resolution 1 Image/Document Clustering 1 Intent Classification 1 Keyword Spotting 1 Knowledge Graph Completion 1 Knowledge Graph Embedding 1 Knowledge Graph Embeddings 1 LABELED_DEPENDENCIES 1 LEMMA 1 Low-Resource Neural Machine Translation 1 MORPH 1 Machine Reading Comprehension 1 Math Word Problem Solving 1 Mathematical Reasoning 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Multi-task Language Understanding 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual Named Entity Recognition 1 Multilingual text classification 1 Multiple-choice 1 Music Information Retrieval 1 Named Entity Recognition 1 Natural Questions 1 News Recommendation 1 Node Classification 1 Object Detection 1 Open-Domain Question Answering 1 POS 1 Paraphrase Generation 1 Paraphrase Identification 1 Pretrained Multilingual Language Models 1 Reading Comprehension 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Recommendation Systems 1 Relation Classification 1 Relation Linking 1 SENTS 1 Semantic Parsing 1 Semantic Similarity 1 Semantic Textual Similarity 1 Slot Filling 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Speech-to-Text Translation 1 Spoken Language Understanding 1 Spoken language identification 1 TAG 1 Table Recognition 1 Text Generation 1 Text-To-SQL 1 Text-To-Speech Synthesis 1 UNLABELED_DEPENDENCIES 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 Visual Reasoning 1 Vocal ensemble separation 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
Japanese English 5 Chinese 4 Afrikaans 1 Albanian 1 Arabic 1 Armenian 1 Assamese 1 Azerbaijani 1 Bambara 1 Basque 1 Bengali 1 Bulgarian 1 Burmese 1 Catalan 1 Cebuano 1 Central Khmer 1 Central Kurdish 1 Croatian 1 Czech 1 Danish 1 Dutch 1 Egyptian Arabic 1 Estonian 1 Finnish 1 French 1 Fulah 1 Ganda 1 Georgian 1 German 1 Greek 1 Guarani 1 Gujarati 1 Gulf Arabic 1 Haitian 1 Halh Mongolian 1 Hausa 1 Hebrew 1 Hindi 1 Hungarian 1 Icelandic 1 Igbo 1 Iloko 1 Indonesian 1 Iranian Persian 1 Italian 1 Javanese 1 Kabuverdianu 1 Kachin 1 Kannada 1 Kazakh 1 Kinyarwanda 1 Kirghiz 1 Korean 1 Lao 1 Latvian 1 Lingala 1 Lithuanian 1 Luo (Kenya and Tanzania) 1 Macedonian 1 Malagasy 1 Malay (individual language) 1 Malay (macrolanguage) 1 Malayalam 1 Maltese 1 Mandarin Chinese 1 Maori 1 Marathi 1 Mesopotamian Arabic 1 Mongolian 1 Moroccan Arabic 1 Multilingual 1 Najdi Arabic 1 Nepali (individual language) 1 Nepali (macrolanguage) 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Uzbek 1 Norwegian 1 Norwegian Bokmål 1 Nyanja 1 Odia 1 Oriya (macrolanguage) 1 Oromo 1 Pedi 1 Persian 1 Plateau Malagasy 1 Polish 1 Portuguese 1 Punjabi 1 Romanian 1 Russian 1 Serbian 1 Serbo-Croatian 1 Shan 1 Shona 1 Sindhi 1 Sinhala 1 Slovak 1 Slovenian 1 Somali 1 South Azerbaijani 1 Southern Pashto 1 Southern Sotho 1 Spanish 1 Standard Arabic 1 Standard Latvian 1 Sundanese 1 Swahili 1 Swati 1 Swedish 1 Tagalog 1 Tajik 1 Tamil 1 Telugu 1 Thai 1 Tibetan 1 Tigrinya 1 Tosk Albanian 1 Tsonga 1 Tswana 1 Turkish 1 Ukrainian 1 Urdu 1 Uzbek 1 Vietnamese 1 Waray (Philippines) 1 West Central Oromo 1 Wolof 1 Xhosa 1 Yoruba 1 Zulu 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Akkadian 0 Akuntsu 0 American Sign Language 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Aragonese 0 Argentine Sign Language 0 Arpitan 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Bavarian 0 Belarusian 0 Bemba (Zambia) 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Central Bikol 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Congo Swahili 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dhivehi 0 Dimli (individual language) 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Dzongkha 0 Eastern Mari 0 Erzya 0 Esperanto 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Fon 0 Friulian 0 Gagauz 0 Galician 0 Gan Chinese 0 Geez 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek Sign Language 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Ido 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Irish 0 Jamaican Creole English 0 Jejueo 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kalmyk 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Khunsari 0 Kikuyu 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Krio 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Latin 0 Lezghian 0 Ligurian 0 Limburgan 0 Literary Chinese 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Lushai 0 Luxembourgish 0 Maithili 0 Manipuri 0 Manx 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mundurukú 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Newari 0 Nigerian Pidgin 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Norwegian Nynorsk 0 Novial 0 Occitan (post 1500) 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Pontic 0 Portuguse 0 Pushto 0 Quechua 0 Rajasthani 0 Romansh 0 Rundi 0 Russia Buriat 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Skolt Sami 0 Soi 0 South Levantine Arabic 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tahitian 0 Tai 0 Tatar 0 Tetum 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tonga (Zambia) 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Upper Sorbian 0 Venda 0 Venetian 0 Veps 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Warlpiri 0 Welsh 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wu Chinese 0 Yakut 0 Yiddish 0 Yue Chinese 0 Zaza 0 Zeeuws 0 Zhuang 0