Datasets

6,539 machine learning datasets
Filter by Task
Machine Translation 5 Speech Recognition 5 Language Modelling 4 Question Answering 4 Cross-Lingual Transfer 3 Language Identification 3 Machine Reading Comprehension 3 Reading Comprehension 3 Abstractive Text Summarization 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Named Entity Recognition 2 Natural Language Inference 2 Part-Of-Speech Tagging 2 Text Summarization 2 Token Classification 2 Aspect-Based Sentiment Analysis 1 Automatic Speech Recognition 1 Chinese Sentence Pair Classification 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Data Augmentation 1 Dependency Parsing 1 Dialogue Generation 1 Domain Adaptation 1 Few-shot NER 1 French Text Diacritization 1 Handwriting Recognition 1 Hungarian Text Diacritization 1 Image Captioning 1 Intent Detection 1 Irish Text Diacritization 1 Latvian Text Diacritization 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Multilingual NLP 1 Named Entity Recognition In Vietnamese 1 Natural Language Understanding 1 Open-Domain Question Answering 1 Opinion Mining 1 Optical Character Recognition 1 Paraphrase Generation 1 Polish Text Diacritization 1 Romanian Text Diacritization 1 Semantic Parsing 1 Sentiment Analysis (Product + User) 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Recognition 1 Speech-to-Text Translation 1 Spelling Correction 1 Text Classification 1 Text-To-Sql 1 Toxic Comment Classification 1 Translation 1 Turkish Text Diacritization 1 Vietnamese Machine Reading Comprehension 1 Vietnamese Text Diacritization 1 Visual Reasoning 1 Word Embeddings 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Transfer 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1
Filter by Language (clear)
Vietnamese English 1653 Chinese 258 German 128 French 103 Spanish 89 Russian 83 Japanese 69 Italian 65 Arabic 57 Portuguese 54 Korean 50 Hindi 49 Turkish 43 Dutch 33 Tamil 30 Persian 29 Bengali 28 Indonesian 28 Polish 28 Czech 27 Danish 24 Finnish 24 Romanian 24 Telugu 22 Malayalam 21 Multilingual 21 Urdu 21 Thai 20 Mandarin Chinese 18 Marathi 18 Estonian 16 Swedish 16 Basque 15 Gujarati 15 Hebrew 15 Hungarian 15 Bulgarian 14 Kannada 14 Greek 13 Punjabi 13 Kazakh 12 Norwegian 12 Ukrainian 12 Catalan 11 Slovak 11 Slovenian 11 Croatian 10 Latvian 10 Serbian 10 Swahili 10 Albanian 9 Amharic 9 Armenian 9 Assamese 9 Lithuanian 9 Welsh 9 Breton 8 Irish 8 Mongolian 8 Oriya (macrolanguage) 8 Sinhala 8 Georgian 7 Icelandic 7 Macedonian 7 Maltese 7 Esperanto 6 Kurdish 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Belarusian 5 Burmese 5 Filipino 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Bosnian 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Malay (individual language) 4 Nepali (macrolanguage) 4 Norwegian Nynorsk 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Faroese 3 Fon 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Odia 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Uighur 3 Western Panjabi 3 Yiddish 3 Asturian 2 Avaric 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Norwegian Bokmål 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Piemontese 2 Portuguse 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Saidi Arabic 0 Santali 0

34 dataset results for Vietnamese