Datasets

13,352 machine learning datasets
Filter by Task
Speech Recognition 7 Machine Translation 6 Cross-Lingual Transfer 5 Language Modelling 5 Named Entity Recognition (NER) 4 Cross-Lingual NER 3 Part-Of-Speech Tagging 3 Text Generation 3 Text Summarization 3 Token Classification 3 Word Embeddings 3 Abstractive Text Summarization 2 Automatic Post-Editing 2 Cross-Lingual POS Tagging 2 Data Augmentation 2 Discourse Segmentation 2 Document Summarization 2 Language Identification 2 Misinformation 2 Question Answering 2 Sentiment Analysis 2 Speech-to-Text Translation 2 Text Classification 2 3D Face Modelling 1 3DGS 1 Audio Classification 1 Automatic Speech Recognition 1 Automatic Speech Recognition (ASR) 1 CCG Supertagging 1 Classification 1 Code Generation 1 Common Sense Reasoning 1 Connective Detection 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Entity Linking 1 Cross-Lingual Natural Language Inference 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Discourse Parsing 1 Document Classification 1 Document-level Event Extraction 1 Entity Linking 1 Event Argument Extraction 1 Event Detection 1 Event Extraction 1 Fact Verification 1 Fake News Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 Image Captioning 1 Image Classification 1 Image-to-Text Retrieval 1 Implicit Discourse Relation Classification 1 Intent Classification 1 Keyphrase Extraction 1 Keyphrase Generation 1 LABELED_DEPENDENCIES 1 LEMMA 1 License Plate Detection 1 MORPH 1 Masked Language Modeling 1 Morphological Analysis 1 Multi-Task Learning 1 Multilingual Named Entity Recognition 1 Multilingual text classification 1 Multimodal Machine Translation 1 NMT 1 Natural Language Inference 1 Natural Language Understanding 1 Negation Detection 1 News Retrieval 1 Node Classification 1 Open-Domain Question Answering 1 POS 1 Relation Classification 1 SENTS 1 Semantic Communication 1 Semantic Parsing 1 Sentence Embeddings 1 Slot Filling 1 Speech-to-Speech Translation 1 Speech-to-Text 1 Spoken Language Understanding 1 TAG 1 Text Retrieval 1 Text-To-Speech Synthesis 1 Text2text Generation 1 Translation 1 UIE 1 UNLABELED_DEPENDENCIES 1 Zero-Shot Cross-Lingual Transfer 1 Zero-shot Cross-lingual Fact-checking 1 Zero-shot Text-to-Image Retrieval 1 automatic-speech-translation 1 de-en 1 es-en 1 fr-en 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
Dutch English 4152 Chinese 467 Spanish 242 German 201 French 196 Russian 146 Japanese 112 Arabic 109 Italian 102 Portuguese 93 Hindi 82 Vietnamese 71 Korean 68 Bengali 66 Turkish 63 Persian 60 Tamil 49 Polish 48 Indonesian 45 Czech 41 Danish 35 Finnish 35 Telugu 35 Romanian 34 Urdu 34 Thai 33 Marathi 31 Hungarian 29 Multilingual 29 Swedish 29 Greek 26 Gujarati 26 Hebrew 25 Mandarin Chinese 25 Estonian 24 Malayalam 23 Ukrainian 23 Bulgarian 22 Basque 21 Punjabi 20 Kannada 19 Catalan 18 Croatian 18 Slovak 18 Swahili 18 Lithuanian 17 Latvian 16 Norwegian 16 Serbian 16 Slovenian 16 Kazakh 15 Amharic 14 Iranian Persian 14 Albanian 12 Kurdish 12 Assamese 11 Burmese 11 Sinhala 11 Tagalog 11 Yoruba 11 Armenian 10 Azerbaijani 10 Filipino 10 Irish 10 Macedonian 10 Welsh 10 American Sign Language 9 Georgian 9 Maltese 9 Mongolian 9 Old Spanish 9 Sanskrit 9 Breton 8 Galician 8 Hausa 8 Igbo 8 Odia 8 Oriya (macrolanguage) 8 Esperanto 7 Nepali (individual language) 7 Nepali (macrolanguage) 7 Oromo 7 Somali 7 Uzbek 7 Afrikaans 6 Bambara 6 Belarusian 6 Central Khmer 6 Guarani 6 Icelandic 6 Javanese 6 Malagasy 6 Nigerian Pidgin 6 Serbo-Croatian 6 Standard Arabic 6 Sundanese 6 Tibetan 6 Western Panjabi 6 Wolof 6 Bosnian 5 Central Kurdish 5 Fon 5 Ganda 5 Haitian 5 Latin 5 Malay (individual language) 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Sindhi 5 Tigrinya 5 Aymara 4 Bangala 4 Bavarian 4 Chechen 4 Dhivehi 4 Egyptian Arabic 4 Ewe 4 Kabyle 4 Lingala 4 Norwegian Bokmål 4 Tatar 4 Tetum 4 Tswana 4 Twi 4 Upper Sorbian 4 Xhosa 4 Aragonese 3 Bashkir 3 Bishnupriya 3 Cebuano 3 Central Pashto 3 Chuvash 3 Erzya 3 Faroese 3 Fulah 3 Goan Konkani 3 Iloko 3 Interlingue 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Luo (Kenya and Tanzania) 3 Maithili 3 Modern Greek 3 Nyanja 3 Occitan (post 1500) 3 Romansh 3 Rundi 3 Russia Buriat 3 Sardinian 3 South Azerbaijani 3 Southern Pashto 3 Swiss German 3 Turkmen 3 Uighur 3 Yiddish 3 Zulu 3 Ancient Greek 2 Argentine Sign Language 2 Asturian 2 Avaric 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Corsican 2 Dimli (individual language) 2 Eastern Mari 2 German Sign Language 2 Gothic 2 Gulf Arabic 2 Ido 2 Inuktitut 2 Jamaican Creole English 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Moksha 2 Mossi 2 Naxi 2 Neapolitan 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Shona 2 Sichuan Yi 2 Sicilian 2 Swati 2 Swiss-German Sign Language 2 Tai 2 Tajik 2 Tsonga 2 Turkish Sign Language 2 Tuvinian 2 Udmurt 2 Venda 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ambonese Malay 1 Ancient Hebrew 1 Andaman Creole Hindi 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Cree 1 Creek 1 Crimean Tatar 1 Cusco Quechua 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Kabardian 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kupang Malay 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Makasar 1 Malayic Dayak 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Northern Pashto 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tosk Albanian 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Uab Meto 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Kabuverdianu 0 Kachin 0 Lingua Franca 0 Mesopotamian Arabic 0 Najdi Arabic 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Uzbek 0 Plateau Malagasy 0 Portuguse 0 Saidi Arabic 0 Santali 0 Shan 0 Standard Latvian 0 Thai Song 0 Tunisian Sign Language 0 West Central Oromo 0

51 dataset results for Dutch