Datasets

10,034 machine learning datasets
Filter by Task
Language Modelling 4 Cross-Lingual Transfer 3 Language Identification 3 Machine Translation 3 Named Entity Recognition (NER) 3 Question Answering 3 Speech Recognition 3 Text Generation 3 Instruction Following 2 Max-Shot Cross-Lingual Visual Reasoning 2 Misinformation 2 Reading Comprehension 2 Sentiment Analysis 2 Text Classification 2 Visual Reasoning 2 Word Embeddings 2 Zero-Shot Cross-Lingual Transfer 2 Zero-Shot Cross-Lingual Visual Reasoning 2 Abstractive Text Summarization 1 Chinese Reading Comprehension 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual NER 1 Cross-Lingual POS Tagging 1 Cross-lingual zero-shot dependency parsing 1 Data-to-Text Generation 1 Dependency Parsing 1 Document Translation 1 Fact Verification 1 Fake News Detection 1 Few-Shot Audio Classification 1 Hope Speech Detection 1 Hope Speech Detection for English 1 Hope Speech Detection for Malayalam 1 Hope Speech Detection for Tamil 1 Image Classification 1 KG-to-Text Generation 1 Keyword Spotting 1 LABELED_DEPENDENCIES 1 LEMMA 1 Low-Resource Neural Machine Translation 1 MORPH 1 Machine Reading Comprehension 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Meme Classification 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual NLP 1 Multilingual text classification 1 Multiple Choice Question Answering (MCQA) 1 Multiple-choice 1 Multlingual Neural Machine Translation 1 Natural Language Inference 1 Natural Language Understanding 1 Natural Questions 1 News Classification 1 News Recommendation 1 Node Classification 1 Open-Domain Question Answering 1 POS 1 Part-Of-Speech Tagging 1 Pretrained Multilingual Language Models 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Recommendation Systems 1 SENTS 1 Sentence Embeddings 1 Speech Denoising 1 Speech Synthesis - Assamese 1 Speech Synthesis - Bengali 1 Speech Synthesis - Bodo 1 Speech Synthesis - Gujarati 1 Speech Synthesis - Hindi 1 Speech Synthesis - Kannada 1 Speech Synthesis - Malayalam 1 Speech Synthesis - Manipuri 1 Speech Synthesis - Marathi 1 Speech Synthesis - Rajasthani 1 Speech Synthesis - Tamil 1 Speech Synthesis - Telugu 1 Speech-to-Speech Translation 1 Speech-to-Text Translation 1 Spoken language identification 1 TAG 1 Text Summarization 1 Text-To-Speech Synthesis 1 Token Classification 1 Translation 1 Translation deu-eng 1 Translation eng-deu 1 Transliteration 1 UNLABELED_DEPENDENCIES 1 Unsupervised Extractive Summarization 1 Unsupervised Machine Translation 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-shot Cross-lingual Fact-checking 1 de-en 1 es-en 1 fr-en 1
Filter by Language (clear)
Tamil English 2995 Chinese 334 German 170 French 151 Spanish 127 Russian 111 Japanese 86 Arabic 84 Italian 81 Portuguese 78 Hindi 69 Vietnamese 60 Korean 54 Turkish 50 Bengali 47 Persian 44 Dutch 42 Polish 38 Czech 35 Indonesian 34 Danish 32 Finnish 31 Romanian 30 Telugu 30 Multilingual 29 Urdu 27 Marathi 25 Hungarian 24 Swedish 24 Thai 24 Greek 22 Malayalam 22 Mandarin Chinese 22 Gujarati 21 Estonian 20 Hebrew 20 Basque 19 Bulgarian 19 Kannada 18 Punjabi 17 Slovak 17 Ukrainian 16 Croatian 15 Slovenian 15 Swahili 15 Catalan 14 Latvian 14 Norwegian 14 Lithuanian 13 Assamese 12 Iranian Persian 12 Kazakh 12 Serbian 12 Amharic 11 Irish 10 Kurdish 10 Maltese 10 Albanian 9 Armenian 9 Oriya (macrolanguage) 9 Welsh 9 American Sign Language 8 Georgian 8 Mongolian 8 Sanskrit 8 Sinhala 8 Tagalog 8 Yoruba 8 Azerbaijani 7 Breton 7 Burmese 7 Hausa 7 Icelandic 7 Igbo 7 Macedonian 7 Serbo-Croatian 7 Somali 7 Uzbek 7 Afrikaans 6 Central Kurdish 6 Esperanto 6 Galician 6 Guarani 6 Haitian 6 Odia 6 Oromo 6 Sindhi 6 Bambara 5 Belarusian 5 Egyptian Arabic 5 Filipino 5 Javanese 5 Latin 5 Malagasy 5 Malay (individual language) 5 Nepali (macrolanguage) 5 Norwegian Bokmål 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Standard Arabic 5 Sundanese 5 Tibetan 5 Tigrinya 5 Wolof 5 Bangala 4 Cebuano 4 Central Khmer 4 Chechen 4 Dhivehi 4 Fulah 4 Ganda 4 Iloko 4 Kabyle 4 Kinyarwanda 4 Kirghiz 4 Lao 4 Lingala 4 Nigerian Pidgin 4 Nyanja 4 South Azerbaijani 4 Tatar 4 Upper Sorbian 4 Western Panjabi 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Bosnian 3 Chuvash 3 Erzya 3 Faroese 3 Fon 3 Goan Konkani 3 Maithili 3 Malay (macrolanguage) 3 Romansh 3 Russia Buriat 3 Swati 3 Swiss German 3 Tajik 3 Tsonga 3 Tswana 3 Twi 3 Uighur 3 Waray (Philippines) 3 Xhosa 3 Yiddish 3 Argentine Sign Language 2 Asturian 2 Avaric 2 Aymara 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Ewe 2 German Sign Language 2 Gothic 2 Gulf Arabic 2 Ido 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luo (Kenya and Tanzania) 2 Luxembourgish 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Moroccan Arabic 2 Naxi 2 Neapolitan 2 Nepali (individual language) 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Pedi 2 Piemontese 2 Pushto 2 Rundi 2 Sardinian 2 Shona 2 Sichuan Yi 2 Sicilian 2 Southern Sotho 2 Swiss-German Sign Language 2 Tai 2 Tosk Albanian 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Zulu 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kabuverdianu 1 Kachin 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Marshallese 1 Mbyá Guaraní 1 Mesopotamian Arabic 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Mundurukú 1 Najdi Arabic 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Uzbek 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Plateau Malagasy 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shan 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Pashto 1 Sranan Tongo 1 Standard Latvian 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 West Central Oromo 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Northern Huishui Hmong 0 Portuguse 0 Saidi Arabic 0 Santali 0

41 dataset results for Tamil