Datasets

9,772 machine learning datasets
Filter by Task (clear)
Claim Extraction with Stance Classification (CESC) Machine Translation 20 Question Answering 13 Named Entity Recognition (NER) 11 Speech Recognition 10 Automatic Post-Editing 7 Cross-Lingual Transfer 7 Language Modelling 7 Token Classification 7 Data Augmentation 6 Information Retrieval 6 Text Classification 6 Text Simplification 6 Automatic Speech Recognition (ASR) 5 Cross-Lingual NER 5 Handwriting Recognition 5 Reading Comprehension 5 Relation Extraction 5 Text Summarization 5 Word Embeddings 5 Entity Linking 4 Handwriting generation 4 Language Identification 4 Abstractive Text Summarization 3 Classification 3 Cross-Lingual Question Answering 3 Misinformation 3 Multilingual Named Entity Recognition 3 Part-Of-Speech Tagging 3 Relation Classification 3 Sentiment Analysis 3 Slot Filling 3 Text-To-Speech Synthesis 3 Translation 3 Automatic Speech Recognition 2 Coreference Resolution 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Paraphrase Identification 2 Discourse Parsing 2 Discourse Segmentation 2 Document Summarization 2 Domain Adaptation 2 FLUE 2 Multilingual text classification 2 Named Entity Recognition 2 Natural Language Inference 2 Natural Language Understanding 2 Open-Domain Question Answering 2 POS 2 Paraphrase Identification 2 Semantic Role Labeling 2 Semantic Similarity 2 Sentence Embeddings 2 Sequence-to-sequence Language Modeling 2 Sign Language Recognition 2 Speech Enhancement 2 Speech Synthesis 2 Speech-to-Speech Translation 2 Speech-to-Text Translation 2 Text Categorization 2 Text Generation 2 Unsupervised Machine Translation 2 Zero-Shot Cross-Lingual Transfer 2 2D Object Detection 1 3D Object Detection 1 Accented Speech Recognition 1 Arithmetic Reasoning 1 Audio Signal Processing 1 Audio Source Separation 1 Audio Synthesis 1 Automatic Lyrics Transcription 1 Automatic Phoneme Recognition 1 Bias Detection 1 Causal Language Modeling 1 Chinese Reading Comprehension 1 Chinese Sentence Pair Classification 1 Chunking 1 Citation Recommendation 1 Classification of toxic, engaging, fact-claiming comments 1 Code Generation 1 Connective Detection 1 Constituency Parsing 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Sentiment Classification 1 Cross-Modal Retrieval 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Dialogue Evaluation 1 Dialogue Generation 1 Dialogue Understanding 1 Document Classification 1 Document Translation 1 Embeddings Evaluation 1 Emotion Classification 1 Emotion Recognition 1 Emotional Speech Synthesis 1 Entity Embeddings 1 Explainable Artificial Intelligence (XAI) 1 FG-1-PG-1 1 Fact Verification 1 Fake News Detection 1 Fault Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 Gloss-free Sign Language Translation 1 Hand Gesture Recognition 1 Handwritten Text Recognition 1 Hate Speech Detection 1 Image Captioning 1 Image Classification 1 Image Retrieval 1 Implicit Discourse Relation Classification 1 Intent Classification 1 Interpretable Machine Learning 1 Keyword Spotting 1 Knowledge Base Question Answering 1 Knowledge Graphs 1 LABELED_DEPENDENCIES 1 LEMMA 1 Language Acquisition 1 Lip Reading 1 Lip to Speech Synthesis 1 Lipreading 1 Low Resource Named Entity Recognition 1 MORPH 1 Machine Reading Comprehension 1 Masked Language Modeling 1 Math Word Problem Solving 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Multi-Label Text Classification 1 Multi-Task Learning 1 Multi-task Language Understanding 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual NLP 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Text Prediction 1 Multiple-choice 1 Multiview Clustering 1 NER 1 Natural Questions 1 Node Classification 1 Object Detection 1 Open Information Extraction 1 Outlier Detection 1 Paraphrase Generation 1 Passage Retrieval 1 Persona Dialogue in Story 1 Pretrained Multilingual Language Models 1 Propaganda detection 1 Question Generation 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Linking 1 SENTS 1 Semantic Segmentation 1 Sentence Classification 1 Sentence Embedding 1 Sign Language Translation 1 Speaker Attribution in German Parliamentary Debates (GermEval 2023, subtask 1) 1 Speaker Attribution in German Parliamentary Debates (GermEval 2023, subtask 2) 1 Speech Denoising 1 Speech Emotion Recognition 1 Speech Separation 1 Spoken Language Understanding 1 Spoken language identification 1 TAG 1 Talking Face Generation 1 Temporal Tagging 1 Text Complexity Assessment (GermEval 2022) 1 Text Segmentation 1 Text-To-SQL 1 Time Series Analysis 1 Time Series Regression 1 Translation deu-eng 1 Translation eng-deu 1 UNLABELED_DEPENDENCIES 1 Unconstrained Lip-synchronization 1 Unfairness Detection 1 Urdu Speech Recognition 1 Variable Detection 1 Variable Disambiguation 1 Vietnamese Machine Reading Comprehension 1 Visual Reasoning 1 Visual Speech Recognition 1 Weakly-Supervised Named Entity Recognition 1 Word Alignment 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-shot Cross-lingual Fact-checking 1 audio-visual learning 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
German English 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 American Sign Language 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Arabic 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bemba (Zambia) 0 Bengali 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Chinese 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Congo Swahili 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 French 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Greek Sign Language 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Halh Mongolian 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hindi 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Italian 0 Jamaican Creole English 0 Japanese 0 Javanese 0 Jejueo 0 Kabardian 0 Kabuverdianu 0 Kabyle 0 Kachin 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Korean 0 Krio 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Lushai 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Mesopotamian Arabic 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mongolian 0 Moroccan Arabic 0 Multilingual 0 Mundurukú 0 Najdi Arabic 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Fulfulde 0 Nigerian Pidgin 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Northern Uzbek 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Plateau Malagasy 0 Polish 0 Pontic 0 Portuguese 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Russian 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shan 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Pashto 0 Southern Sotho 0 Spanish 0 Sranan Tongo 0 Standard Arabic 0 Standard Latvian 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Telugu 0 Tetum 0 Thai 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tonga (Zambia) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkish 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 West Central Oromo 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0