Datasets

3,465 Machine Learning Datasets
Filter by Task
Question Answering 80 Language Modelling 44 Reading Comprehension 36 Machine Translation 29 Text Generation 24 Natural Language Inference 19 Named Entity Recognition 18 Text Summarization 18 Visual Question Answering 18 Sentiment Analysis 17 Speech Recognition 17 Information Retrieval 15 Text Classification 15 Word Embeddings 15 Common Sense Reasoning 13 Coreference Resolution 13 Abstractive Text Summarization 12 Data Augmentation 12 Machine Reading Comprehension 11 Natural Language Understanding 11 Relation Extraction 10 Domain Adaptation 9 Automatic Post-Editing 8 Data-to-Text Generation 8 Entity Linking 8 Image Classification 8 Open-Domain Question Answering 8 Semantic Textual Similarity 8 Cross-Lingual Transfer 7 Document Summarization 7 Paraphrase Identification 7 Semantic Parsing 7 Video Question Answering 7 Emotion Recognition 6 Sentence Embeddings 6 Video Captioning 6 Word Sense Disambiguation 6 Dialogue State Tracking 5 Emotion Classification 5 Emotion Recognition in Conversation 5 Goal-Oriented Dialog 5 Image Captioning 5 Language Identification 5 Link Prediction 5 Multi-Task Learning 5 Part-Of-Speech Tagging 5 Recommendation Systems 5 Scene Text Detection 5 Sentence Classification 5 Spoken Language Understanding 5 Stochastic Optimization 5 Video Retrieval 5 Chinese Reading Comprehension 4 Image Clustering 4 Intent Detection 4 Knowledge Graphs 4 Paraphrase Generation 4 Relation Classification 4 Scientific Results Extraction 4 Stance Detection 4 Visual Dialog 4 Visual Reasoning 4 Weakly-Supervised Named Entity Recognition 4 Abusive Language 3 Anomaly Detection 3 Audio Source Separation 3 Conversational Response Selection 3 Decision Making 3 Dependency Parsing 3 Dialogue Generation 3 Entity Typing 3 Fake News Detection 3 Joint Entity and Relation Extraction 3 Misinformation 3 Multi-Document Summarization 3 Multi-Label Classification 3 Multimodal Machine Translation 3 Nested Named Entity Recognition 3 Node Classification 3 Opinion Mining 3 Optical Character Recognition 3 Outlier Detection 3 Sarcasm Detection 3 Semantic Role Labeling 3 Slot Filling 3 Speech Enhancement 3 Task-Oriented Dialogue Systems 3 Word Alignment 3 2 2D Object Detection 2 Action Recognition 2 Answer Selection 2 Causal Inference 2 Change Point Detection 2 Chatbot 2 Citation Intent Classification 2 Click-Through Rate Prediction 2 Cross-Modal Retrieval 2 Dialog Relation Extraction 2 Dialogue Act Classification 2 Distant Speech Recognition 2 Document Classification 2 Document Ranking 2 Entity Disambiguation 2 Extractive Text Summarization 2 Fine-Grained Image Classification 2 Grammatical Error Correction 2 Handwriting Verification 2 Image Generation 2 Image Retrieval 2 Learning-To-Rank 2 Linguistic Acceptability 2 Math Word Problem Solving 2 Metric Learning 2 Multimodal Sentiment Analysis 2 Negation Detection 2 Nested Mention Recognition 2 Network Embedding 2 Object Recognition 2 Quantization 2 Question Generation 2 Referring Expression Segmentation 2 Scene Text Recognition 2 Self-Supervised Learning 2 Sentence Embedding 2 Speech Dereverberation 2 Spoken Dialogue Systems 2 Text Categorization 2 Text-To-Sql 2 Transliteration 2 Vision and Language Navigation 2 Vision-Language Navigation 2 Visual Navigation 2 Weakly Supervised Classification 2 Weakly-Supervised Object Localization 2 text-based games 2 2D Human Pose Estimation 1 3D Action Recognition 1 3D Object Classification 1 AMR Parsing 1 Accented Speech Recognition 1 Action Classification 1 Action Recognition In Videos 1 Active Learning 1 Activity Recognition 1 Adversarial Defense 1 Atari Games 1 Audio Super-Resolution 1 Automated Theorem Proving 1 Bias Detection 1 Binarization 1 Biologically-plausible Training 1 COVID-19 Diagnosis 1 Camouflaged Object Segmentation 1 Cardiac Segmentation 1 Causal Emotion Entailment 1 Chinese Named Entity Recognition 1 Chord Recognition 1 Chunking 1 Classification Consistency 1 Classification with Binary Weight Network 1 Color Image Denoising 1 Community Question Answering 1 Component Classification 1 Conditional Image Generation 1 Constituency Parsing 1 Continual Learning 1 Continuous Control 1 Core set discovery 1 Cross Document Coreference Resolution 1 Cross-Domain Named Entity Recognition 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Curved Text Detection 1 Definition Extraction 1 Dense Video Captioning 1 Density Estimation 1 Depth Estimation 1 Dialogue Management 1 Dialogue Understanding 1 Distractor Generation 1 Domain Generalization 1 Drone Pose Estimation 1 End-To-End Dialogue Modelling 1 End-To-End Speech Recognition 1 Entity Alignment 1 Entity Embeddings 1 Epidemiology 1 Event Coreference Resolution 1 Event Extraction 1 Extreme Multi-Label Classification 1 Extreme Summarization 1 Face Presentation Attack Detection 1 Face Sketch Synthesis 1 Fact Verification 1 Fairness 1 Feature Importance 1 Few-Shot Image Classification 1 Few-Shot Learning 1 Few-Shot Relation Classification 1 Fine-Grained Opinion Analysis 1 Generative Question Answering 1 Goal-Oriented Dialogue Systems 1 Grammatical Error Detection 1 Graph Classification 1 Graph Embedding 1 Graph Generation 1 Graph-to-Sequence 1 Handwriting Recognition 1 Handwritten Chinese Text Recognition 1 Hate Speech Detection 1 Human Pose Forecasting 1 Human robot interaction 1 Humor Detection 1 Hypernym Discovery 1 Image Forensics 1 Image Inpainting 1 Image Manipulation 1 Incremental Learning 1 Interpretable Machine Learning 1 K-complex detection 1 KB-to-Language Generation 1 Keyword Extraction 1 Knowledge Base Question Answering 1 Knowledge Graph Completion 1 Knowledge Graph Embeddings 1 Lexical Entailment 1 Lexical Simplification 1 Lip password classification 1 Lipreading 1 Logical Reasoning Question Answering 1 Long-tail Learning 1 Low Resource Named Entity Recognition 1 Low-Resource Neural Machine Translation 1 Medical Image Segmentation 1 Medical Named Entity Recognition 1 Meeting Summarization 1 Monocular Depth Estimation 1 Morphological Analysis 1 Motion Segmentation 1 Multi-Domain Sentiment Classification 1 Multi-domain Dialogue State Tracking 1 Multimodal Abstractive Text Summarization 1 Multiple Instance Learning 1 Multivariate Time Series Forecasting 1 Music Emotion Recognition 1 Music Information Retrieval 1 Network Pruning 1 Neural Architecture Search 1 Neural Network Compression 1 Non-Intrusive Load Monitoring 1 Novel View Synthesis 1 Object Detection 1 Offline RL 1 Parallel Corpus Mining 1 Partial Domain Adaptation 1 Passage Re-Ranking 1 Pedestrian Attribute Recognition 1 Person Re-Identification 1 Person Search 1 Phrase Grounding 1 Poem meters classification 1 Pose Estimation 1 Program Synthesis 1 Prosody Prediction 1 Recipe Generation 1 Recognizing Emotion Cause in Conversations 1 Referring Expression Comprehension 1 Robust classification 1 SQL Parsing 1 Scene Text 1 Scene Understanding 1 Scene-Aware Dialogue 1 Self-Supervised Image Classification 1 Semantic Segmentation 1 Semantic Similarity 1 Semi-Supervised Image Classification 1 Semi-Supervised Text Classification 1 Sentence Compression 1 Sign Language Recognition 1 Sign Language Translation 1 Skeleton Based Action Recognition 1 Sleep Stage Detection 1 Sparse Learning 1 Speaker Diarization 1 Speech Emotion Recognition 1 Speech Synthesis 1 Speech-to-Gesture Translation 1 Spindle Detection 1 Spoken language identification 1 Stereo Matching 1 Style Transfer 1 Super-Resolution 1 Surgical Gesture Recognition 1 Surgical tool detection 1 Synthetic Data Generation 1 Systematic Generalization 1 Table Detection 1 Table-to-Text Generation 1 Temporal Information Extraction 1 Text Effects Transfer 1 Text Matching 1 Text Simplification 1 Text Style Transfer 1 Text-Image Retrieval 1 Text-To-Speech Synthesis 1 Text-to-Image Generation 1 Time Series Classification 1 Time Series Forecasting 1 Timex normalization 1 Tokenization 1 Topic Models 1 Tweet-Reply Sentiment Analysis 1 Twitter Sentiment Analysis 1 Univariate Time Series Forecasting 1 Unsupervised Image Classification 1 Unsupervised KG-to-text 1 Unsupervised Machine Translation 1 Unsupervised semantic parsing 1 Variational Inference 1 Video Description 1 Video Emotion Recognition 1 Video Story QA 1 Video Understanding 1 Visual Commonsense Reasoning 1 Visual Speech Recognition 1 Visual Storytelling 1 Voice Anti-spoofing 1 Weakly Supervised Object Detection 1 Zero-Shot Learning 1 Zero-Shot Object Detection 1 Zero-Shot Transfer Image Classification 1
Filter by Language (clear)
English Chinese 97 German 68 French 54 Spanish 47 Arabic 38 Japanese 38 Russian 37 Italian 32 Portuguese 28 Korean 25 Turkish 23 Hindi 22 Vietnamese 21 Finnish 20 Czech 19 Dutch 19 Persian 16 Polish 16 Romanian 16 Multilingual 15 Tamil 15 Telugu 15 Thai 14 Urdu 13 Basque 11 Bengali 11 Estonian 11 Indonesian 11 Malayalam 11 Kannada 10 Marathi 10 Swedish 10 Bulgarian 9 Catalan 9 Hungarian 9 Norwegian 9 Armenian 8 Breton 8 Danish 8 Greek 8 Gujarati 8 Hebrew 8 Ukrainian 8 Assamese 7 Lithuanian 7 Mandarin Chinese 7 Punjabi 7 Slovak 7 Albanian 6 Amharic 6 Croatian 6 Esperanto 6 Kurdish 6 Latvian 6 Serbian 6 Sinhala 6 Slovenian 6 Swahili 6 Welsh 6 Afrikaans 5 Galician 5 Georgian 5 Icelandic 5 Irish 5 Kazakh 5 Macedonian 5 Maltese 5 Oriya (macrolanguage) 5 Tagalog 5 Yoruba 5 Belarusian 4 Bosnian 4 Haitian 4 Igbo 4 Latin 4 Malagasy 4 Mongolian 4 Sanskrit 4 Scottish Gaelic 4 Sindhi 4 Standard Arabic 4 Tatar 4 Wolof 4 Aragonese 3 Azerbaijani 3 Bavarian 3 Bishnupriya 3 Burmese 3 Central Khmer 3 Chechen 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Filipino 3 Guarani 3 Hausa 3 Javanese 3 Kinyarwanda 3 Lao 3 Malay (individual language) 3 Norwegian Nynorsk 3 Quechua 3 Romansh 3 Russia Buriat 3 Serbo-Croatian 3 Somali 3 South Azerbaijani 3 Sundanese 3 Uighur 3 Upper Sorbian 3 Uzbek 3 Yiddish 3 Asturian 2 Avaric 2 Bambara 2 Bashkir 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Faroese 2 Fon 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Kirghiz 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Neapolitan 2 Nepali (macrolanguage) 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Oromo 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Sardinian 2 Sicilian 2 Swati 2 Swiss German 2 Tajik 2 Tibetan 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Western Panjabi 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 American Sign Language 1 Ancient Greek 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Buginese 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inuktitut 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Malay (macrolanguage) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Norwegian Bokmål 1 Novial 1 Nyanja 1 Odia 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Portuguse 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Sichuan Yi 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tetum 1 Tigrinya 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1

591 dataset results for English