Datasets

6,527 machine learning datasets
Filter by Task
Speech Recognition 7 Language Identification 5 Machine Translation 5 Named Entity Recognition 5 Language Modelling 4 Cross-Lingual Transfer 3 Natural Language Inference 3 Question Answering 3 Sentiment Analysis 3 Abstractive Text Summarization 2 Automatic Speech Recognition 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Fake News Detection 2 Misinformation 2 Natural Language Understanding 2 Part-Of-Speech Tagging 2 Speaker Recognition 2 Text Classification 2 Text Summarization 2 Token Classification 2 Abusive Language 1 Aggression Identification 1 Audio-Visual Synchronization 1 Chinese Sentence Pair Classification 1 Chunking 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Dialogue Generation 1 Domain Adaptation 1 Fact Verification 1 Few-shot NER 1 Handwriting Recognition 1 Hate Speech Detection 1 Image Classification 1 Keyword Spotting 1 Low-Resource Neural Machine Translation 1 Multi-task Audio Source Seperation 1 Multimodal Machine Translation 1 Multiple Choice Question Answering (MCQA) 1 Multlingual Neural Machine Translation 1 NER 1 News Classification 1 Node Classification 1 Optical Character Recognition 1 Reading Comprehension 1 Resynthesis 1 Sarcasm Detection 1 Sentence Embeddings 1 Speech Denoising 1 Speech Enhancement 1 Spoken language identification 1 Transliteration 1 Video Captioning 1 Vision and Language Navigation 1 Visual Question Answering 1 Wikipedia Summarization 1 Word Embeddings 1 audio-visual learning 1
Filter by Language (clear)
Hindi English 1644 Chinese 258 German 128 French 102 Spanish 89 Russian 83 Japanese 69 Italian 65 Arabic 57 Portuguese 54 Korean 50 Turkish 43 Vietnamese 34 Dutch 32 Tamil 30 Persian 29 Bengali 28 Indonesian 28 Polish 28 Czech 27 Danish 24 Finnish 24 Romanian 24 Telugu 22 Malayalam 21 Urdu 21 Multilingual 20 Thai 20 Mandarin Chinese 18 Marathi 18 Estonian 16 Swedish 16 Basque 15 Gujarati 15 Hebrew 15 Hungarian 15 Bulgarian 14 Kannada 14 Greek 13 Punjabi 13 Kazakh 12 Norwegian 12 Ukrainian 12 Catalan 11 Slovak 11 Slovenian 11 Croatian 10 Latvian 10 Serbian 10 Swahili 10 Albanian 9 Amharic 9 Armenian 9 Assamese 9 Lithuanian 9 Welsh 9 Breton 8 Mongolian 8 Oriya (macrolanguage) 8 Sinhala 8 Georgian 7 Icelandic 7 Macedonian 7 Maltese 7 Esperanto 6 Irish 6 Kurdish 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Belarusian 5 Burmese 5 Filipino 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Bosnian 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Malay (individual language) 4 Nepali (macrolanguage) 4 Norwegian Nynorsk 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Faroese 3 Fon 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Odia 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Uighur 3 Western Panjabi 3 Yiddish 3 Asturian 2 Avaric 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Norwegian Bokmål 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Piemontese 2 Portuguse 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Saidi Arabic 0 Santali 0

49 dataset results for Hindi