Datasets

11,849 machine learning datasets
Filter by Language (clear)
Nauru English 3843 Chinese 440 German 192 French 184 Spanish 151 Russian 136 Japanese 106 Arabic 103 Italian 95 Portuguese 90 Hindi 82 Vietnamese 70 Korean 67 Bengali 66 Turkish 60 Persian 54 Dutch 50 Tamil 49 Indonesian 44 Polish 44 Czech 41 Danish 35 Finnish 35 Telugu 35 Romanian 34 Thai 33 Urdu 33 Marathi 31 Hungarian 29 Multilingual 29 Swedish 29 Greek 26 Gujarati 26 Mandarin Chinese 25 Estonian 24 Hebrew 24 Malayalam 23 Bulgarian 22 Ukrainian 22 Basque 21 Kannada 19 Punjabi 19 Catalan 18 Croatian 18 Swahili 18 Lithuanian 17 Slovak 17 Latvian 16 Serbian 16 Slovenian 16 Kazakh 15 Norwegian 15 Amharic 14 Iranian Persian 14 Albanian 12 Kurdish 12 Assamese 11 Sinhala 11 Tagalog 11 Yoruba 11 Armenian 10 Azerbaijani 10 Filipino 10 Irish 10 Macedonian 10 Welsh 10 American Sign Language 9 Burmese 9 Georgian 9 Maltese 9 Mongolian 9 Sanskrit 9 Breton 8 Galician 8 Hausa 8 Igbo 8 Odia 8 Oriya (macrolanguage) 8 Esperanto 7 Nepali (macrolanguage) 7 Oromo 7 Uzbek 7 Afrikaans 6 Bambara 6 Belarusian 6 Central Khmer 6 Guarani 6 Icelandic 6 Javanese 6 Malagasy 6 Nepali (individual language) 6 Nigerian Pidgin 6 Serbo-Croatian 6 Somali 6 Sundanese 6 Western Panjabi 6 Wolof 6 Bosnian 5 Central Kurdish 5 Fon 5 Ganda 5 Haitian 5 Latin 5 Malay (individual language) 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Sindhi 5 Standard Arabic 5 Tibetan 5 Tigrinya 5 Aymara 4 Bangala 4 Chechen 4 Dhivehi 4 Egyptian Arabic 4 Ewe 4 Kabyle 4 Lingala 4 Norwegian Bokmål 4 Tatar 4 Tetum 4 Tswana 4 Twi 4 Upper Sorbian 4 Xhosa 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Cebuano 3 Chuvash 3 Erzya 3 Faroese 3 Fulah 3 Goan Konkani 3 Iloko 3 Interlingue 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Luo (Kenya and Tanzania) 3 Maithili 3 Nyanja 3 Occitan (post 1500) 3 Romansh 3 Rundi 3 Russia Buriat 3 Sardinian 3 South Azerbaijani 3 Swiss German 3 Turkmen 3 Uighur 3 Yiddish 3 Zulu 3 Argentine Sign Language 2 Asturian 2 Avaric 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Central Pashto 2 Cherokee 2 Church Slavic 2 Cornish 2 Corsican 2 Dimli (individual language) 2 Eastern Mari 2 German Sign Language 2 Gothic 2 Ido 2 Inuktitut 2 Jamaican Creole English 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Mossi 2 Naxi 2 Neapolitan 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Old Spanish 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Shona 2 Sichuan Yi 2 Sicilian 2 Southern Pashto 2 Swati 2 Swiss-German Sign Language 2 Tai 2 Tajik 2 Tsonga 2 Turkish Sign Language 2 Tuvinian 2 Udmurt 2 Venda 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ambonese Malay 1 Ancient Greek 1 Ancient Hebrew 1 Andaman Creole Hindi 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Cree 1 Creek 1 Crimean Tatar 1 Cusco Quechua 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Kabardian 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kupang Malay 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Makasar 1 Malayic Dayak 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Navajo 1 Nayini 1 Ndonga 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tosk Albanian 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Uab Meto 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Kabuverdianu 0 Kachin 0 Lingua Franca 0 Mesopotamian Arabic 0 Najdi Arabic 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Uzbek 0 Plateau Malagasy 0 Portuguse 0 Saidi Arabic 0 Santali 0 Shan 0 Standard Latvian 0 Thai Song 0 Tunisian Sign Language 0 West Central Oromo 0