Datasets

5,420 machine learning datasets
Filter by Task
Machine Translation 20 Question Answering 9 Named Entity Recognition 8 Automatic Post-Editing 7 Cross-Lingual Transfer 7 Speech Recognition 7 Data Augmentation 6 Language Modelling 6 Handwriting Recognition 5 Information Retrieval 5 Word Embeddings 5 Handwriting generation 4 Reading Comprehension 4 Token Classification 4 Abstractive Text Summarization 3 Cross-Lingual NER 3 Domain Adaptation 3 Language Identification 3 Relation Extraction 3 Text Summarization 3 Coreference Resolution 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Discourse Parsing 2 Discourse Segmentation 2 Entity Linking 2 Image Captioning 2 Multimodal Machine Translation 2 Natural Language Inference 2 Open-Domain Question Answering 2 Part-Of-Speech Tagging 2 Relation Classification 2 Sentence Embeddings 2 Sentiment Analysis 2 Sequence-to-sequence Language Modeling 2 Slot Filling 2 Speech-to-Text Translation 2 Text Categorization 2 Text Simplification 2 Unsupervised Machine Translation 2 3D Object Detection 1 Accented Speech Recognition 1 Audio Source Separation 1 Chinese Sentence Pair Classification 1 Chunking 1 Classification of toxic, engaging, fact-claiming comments 1 Constituency Parsing 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Sentiment Classification 1 Cross-Modal Retrieval 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Dialogue Generation 1 Document Classification 1 Emotion Classification 1 Emotion Recognition 1 Entity Embeddings 1 Fact Verification 1 Fault Detection 1 Image Retrieval 1 Implicit Discourse Relation Classification 1 Intent Classification 1 Interpretable Machine Learning 1 Keyword Spotting 1 Knowledge Graphs 1 Language Acquisition 1 Low Resource Named Entity Recognition 1 Misinformation 1 Morphological Analysis 1 Multi-Task Learning 1 Multilingual Named Entity Recognition 1 Multilingual text classification 1 Multimodal Lexical Translation 1 Multimodal Text Prediction 1 Natural Language Understanding 1 Open Information Extraction 1 Outlier Detection 1 Paraphrase Generation 1 Paraphrase Identification 1 Passage Retrieval 1 Semantic Role Labeling 1 Sentence Embedding 1 Sign Language Recognition 1 Speech Enhancement 1 Speech Quality 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken Language Understanding 1 Spoken language identification 1 Text Generation 1 Time Series 1 Time Series Regression 1 Translation 1 Unfairness Detection 1 Weakly-Supervised Named Entity Recognition 1 Word Alignment 1 Zero-Shot Cross-Lingual Transfer 1 connective detection 1
Filter by Language (clear)
German English 1245 Chinese 167 French 84 Russian 76 Spanish 74 Japanese 56 Arabic 53 Italian 53 Portuguese 48 Turkish 41 Korean 36 Hindi 35 Dutch 31 Vietnamese 29 Persian 27 Polish 25 Czech 24 Finnish 24 Tamil 24 Romanian 23 Bengali 21 Indonesian 21 Multilingual 19 Telugu 19 Urdu 18 Thai 17 Basque 15 Estonian 15 Malayalam 15 Mandarin Chinese 15 Marathi 15 Swedish 15 Hungarian 14 Bulgarian 13 Kannada 13 Danish 12 Gujarati 12 Catalan 11 Hebrew 11 Norwegian 11 Ukrainian 11 Greek 10 Latvian 10 Punjabi 10 Slovak 10 Slovenian 10 Amharic 9 Croatian 9 Kazakh 9 Serbian 9 Swahili 9 Welsh 9 Albanian 8 Armenian 8 Assamese 8 Breton 8 Lithuanian 8 Sinhala 8 Georgian 7 Mongolian 7 Esperanto 6 Icelandic 6 Irish 6 Kurdish 6 Macedonian 6 Maltese 6 Oriya (macrolanguage) 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Belarusian 4 Bosnian 4 Burmese 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Nepali (macrolanguage) 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Filipino 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Malay (individual language) 3 Norwegian Nynorsk 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Tibetan 3 Uighur 3 Yiddish 3 Asturian 2 Avaric 2 Bashkir 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Faroese 2 Fon 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Odia 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Western Panjabi 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Malay (macrolanguage) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Norwegian Bokmål 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Portuguse 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Santali 0

115 dataset results for German