Datasets

7,429 machine learning datasets
Filter by Task (clear)
Multimodal Deep Learning Question Answering 318 Semantic Segmentation 239 Object Detection 223 Image Classification 194 Speech Recognition 191 Language Modelling 135 Text Generation 115 Text Classification 111 Pose Estimation 107 Named Entity Recognition 102 Reading Comprehension 93 Visual Question Answering 91 Action Recognition 88 Sentiment Analysis 80 Face Recognition 79 Information Retrieval 78 Text Summarization 75 Speaker Recognition 74 Domain Adaptation 71 Natural Language Inference 69 Machine Translation 68 Autonomous Driving 63 Relation Extraction 63 Anomaly Detection 62 Data Augmentation 62 Image Generation 62 Depth Estimation 61 Natural Language Understanding 61 Instance Segmentation 60 Image Retrieval 56 Link Prediction 56 Person Re-Identification 56 Image Captioning 54 Optical Character Recognition 51 Word Embeddings 50 Node Classification 46 Object Tracking 46 Multi-Task Learning 45 Common Sense Reasoning 44 Abstractive Text Summarization 43 Face Detection 43 Classification 42 Temporal Action Localization 42 Coreference Resolution 41 Emotion Recognition 40 Scene Understanding 40 Machine Reading Comprehension 39 Recommendation Systems 39 Semantic Parsing 39 3D Reconstruction 37 Graph Classification 37 Object Recognition 37 Knowledge Graphs 36 Video Understanding 36 Decision Making 35 2D Semantic Segmentation 34 Medical Image Segmentation 34 3D Human Pose Estimation 33 Few-Shot Learning 33 Misinformation 32 Visual Reasoning 32 Zero-Shot Learning 32 3D Object Detection 31 Entity Linking 31 Fine-Grained Image Classification 31 Unsupervised Domain Adaptation 31 Facial Expression Recognition 30 Image Super-Resolution 30 Metric Learning 30 Optical Flow Estimation 30 Speech Synthesis 30 Action Detection 29 Hate Speech Detection 29 Image-to-Image Translation 29 Self-Supervised Learning 29 Autonomous Vehicles 28 Document Summarization 28 Activity Recognition 27 Dialogue Generation 27 Accented Speech Recognition 26 Handwriting Recognition 26 Multi-Label Classification 25 Multi-Object Tracking 25 Neural Architecture Search 25 Token Classification 25 Trajectory Prediction 25 Action Classification 24 Audio Classification 24 Code Generation 24 Continual Learning 24 Sequence-to-sequence Language Modeling 24 Super-Resolution 24 Video Captioning 24 Video Question Answering 24 3D Pose Estimation 23 Fake News Detection 23 Open-Domain Question Answering 23 Skeleton Based Action Recognition 23 Drug Discovery 22 Face Verification 22 Part-Of-Speech Tagging 22 Question Generation 22 Video Retrieval 22 Domain Generalization 21 Few-Shot Image Classification 21 Music Information Retrieval 21 3D Semantic Segmentation 20 Emotion Classification 20 Hand Pose Estimation 20 Human-Object Interaction Detection 20 Monocular Depth Estimation 20 Music Generation 20 Panoptic Segmentation 20 Scene Classification 20 Data-to-Text Generation 19 Object Counting 19 Stance Detection 19 2D object detection 18 Crowd Counting 18 Denoising 18 Graph Clustering 18 Image Clustering 18 Meta-Learning 18 Relation Classification 18 Sound Event Detection 18 Face Anti-Spoofing 17 Fairness 17 Image Inpainting 17 Imitation Learning 17 Novel View Synthesis 17 SMAC+ 17 Scene Text Recognition 17 Semantic Textual Similarity 17 Slot Filling 17 Visual Localization 17 Visual Object Tracking 17 2D Human Pose Estimation 16 Dialect Identification 16 Face Alignment 16 Knowledge Graph Completion 16 Salient Object Detection 16 Text-To-Speech Synthesis 16 Video Prediction 16 Visual Odometry 16 Visual Tracking 16 Age Estimation 15 Automatic Speech Recognition 15 Cross-Lingual Transfer 15 Cross-Modal Retrieval 15 Density Estimation 15 Gesture Recognition 15 Image Dehazing 15 Image Denoising 15 Language Identification 15 Paraphrase Identification 15 RGB Salient Object Detection 15 Scene Recognition 15 Sign Language Recognition 15 Simultaneous Localization and Mapping 15 Speech Enhancement 15 Style Transfer 15 Word Sense Disambiguation 15 Active Learning 14 Audio Source Separation 14 Clustering Algorithms Evaluation 14 Intent Detection 14 Lane Detection 14 Node Classification on Non-Homophilic (Heterophilic) Graphs 14 Out-of-Distribution Detection 14 Self-Driving Cars 14 Task-Oriented Dialogue Systems 14 Time Series Forecasting 14 Translation 14 Visual Navigation 14 3D Hand Pose Estimation 13 Action Segmentation 13 Computed Tomography (CT) 13 Deblurring 13 Dependency Parsing 13 Fact Verification 13 Gaze Estimation 13 Grammatical Error Correction 13 Graph Regression 13 Multi-Document Summarization 13 Multiple Object Tracking 13 Quantization 13 Scene Text Detection 13 Speech Emotion Recognition 13 Speech Separation 13 Unsupervised Anomaly Detection 13 3D Instance Segmentation 12 Activity Detection 12 Aspect-Based Sentiment Analysis 12 DeepFake Detection 12 Document Classification 12 Emotion Recognition in Conversation 12 Event Extraction 12 Facial Landmark Detection 12 Human Detection 12 Multi-Label Text Classification 12 NER 12 Object Localization 12 Paraphrase Generation 12 Robotic Grasping 12 Spoken Language Understanding 12 Stereo Matching 12 Stochastic Optimization 12 Text Simplification 12 Video Classification 12 Vision and Language Navigation 12 Weakly Supervised Object Detection 12 Action Recognition In Videos 11 Animal Pose Estimation 11 Binarization 11 Disentanglement 11 Hand Gesture Recognition 11 Image Enhancement 11 Intent Classification 11 Learning with noisy labels 11 Math Word Problem Solving 11 Medical Diagnosis 11 Nested Named Entity Recognition 11 Pedestrian Detection 11 Saliency Detection 11 Semi-Supervised Image Classification 11 Sentence Classification 11 Transfer Learning 11 Video Frame Interpolation 11 Video Super-Resolution 11 3D Action Recognition 10 3D Face Reconstruction 10 6D Pose Estimation 10 Acoustic Scene Classification 10 COVID-19 Diagnosis 10 Cell Segmentation 10 Code Search 10 Conversational Response Selection 10 Dimensionality Reduction 10 Entity Typing 10 Graph Embedding 10 Handwriting generation 10 Image Compression 10 Image Restoration 10 Imputation 10 Lesion Segmentation 10 Multi-class Classification 10 Open-Domain Dialog 10 Referring Expression Segmentation 10 Relational Reasoning 10 Robot Navigation 10 Sarcasm Detection 10 Sentence Embeddings 10 Time Series 10 Visual Place Recognition 10 Weather Forecasting 10 motion prediction 10 Automatic Post-Editing 9 Color Image Denoising 9 Community Detection 9 Depth Completion 9 Dialogue State Tracking 9 Entity Disambiguation 9 Face Swapping 9 Handwritten Text Recognition 9 Hierarchical Multi-label Classification 9 Joint Entity and Relation Extraction 9 Knowledge Base Question Answering 9 Learning-To-Rank 9 Long-tail Learning 9 Low-Light Image Enhancement 9 Object Detection In Indoor Scenes 9 Outlier Detection 9 Prompt Engineering 9 Real-Time Semantic Segmentation 9 Semantic Similarity 9 Vehicle Re-Identification 9 Video Generation 9 Video Quality Assessment 9 Video Summarization 9 Visual Dialog 9 Abusive Language 8 Action Anticipation 8 Answer Selection 8 Citation Recommendation 8 Column Type Annotation 8 Continuous Control 8 Contrastive Learning 8 Dialogue Understanding 8 Driver Attention Monitoring 8 Edge Detection 8 Entity Resolution 8 Explainable artificial intelligence 8 Fact Checking 8 Federated Learning 8 General Classification 8 Generalized Zero-Shot Learning 8 Head Pose Estimation 8 Human Part Segmentation 8 Image Quality Assessment 8 Image Registration 8 Image/Document Clustering 8 Incremental Learning 8 Keypoint Detection 8 Keyword Spotting 8 Motion Estimation 8 Person Search 8 Point Cloud Registration 8 Real-Time Object Detection 8 Semantic Role Labeling 8 Small Data Image Classification 8 Table annotation 8 Text-To-Sql 8 Trajectory Forecasting 8 Video Inpainting 8 Video Object Segmentation 8 Video Segmentation 8 3D Depth Estimation 7 3D Human Reconstruction 7 3D Object Recognition 7 3D Object Tracking 7 3D Shape Reconstruction 7 Action Quality Assessment 7 Ad-hoc video search 7 Adversarial Attack 7 Adversarial Robustness 7 Audio Tagging 7 AutoML 7 Automated Theorem Proving 7 Blind Super-Resolution 7 Boundary Detection 7 Chinese Reading Comprehension 7 Code Summarization 7 Colorectal Polyps Characterization 7 Colorization 7 Conditional Image Generation 7 Conversational Question Answering 7 Disaster Response 7 Discourse Parsing 7 Document Layout Analysis 7 Extreme Summarization 7 Face Identification 7 Few-Shot Object Detection 7 Fine-Grained Image Recognition 7 Font Recognition 7 Gait Recognition 7 General Reinforcement Learning 7 Generalizable Person Re-identification 7 Graph Matching 7 Homography Estimation 7 Human Behavior Forecasting 7 Human action generation 7 Hyperspectral Image Classification 7 Image Matting 7 Image Reconstruction 7 Interactive Segmentation 7 KG-to-Text Generation 7 Lesion Classification 7 License Plate Recognition 7 Lipreading 7 Molecular Property Prediction 7 Motion Segmentation 7 Multimodal Activity Recognition 7 Multivariate Time Series Forecasting 7 Music Transcription 7 Node Clustering 7 Opinion Mining 7 Single Image Dehazing 7 Source Code Summarization 7 Speaker Diarization 7 Surface Normals Estimation 7 Table Detection 7 Text-Image Retrieval 7 Traffic Sign Recognition 7 Unsupervised Semantic Segmentation 7 Unsupervised Video Object Segmentation 7 3D Medical Imaging Segmentation 6 AMR Parsing 6 Abnormal Event Detection In Video 6 Atari Games 6 Audio Generation 6 Bayesian Inference 6 Community Question Answering 6 Constituency Parsing 6 Cross-Lingual Question Answering 6 Dialogue Act Classification 6 Dictionary Learning 6 Document Ranking 6 EEG 6 Environmental Sound Classification 6 Event Detection 6 Feature Engineering 6 Few-Shot Relation Classification 6 Food Recognition 6 Formation Energy 6 Fraud Detection 6 Goal-Oriented Dialog 6 Hand-Gesture Recognition 6 Image Segmentation 6 LIDAR Semantic Segmentation 6 License Plate Detection 6 Lip Reading 6 Mathematical Question Answering 6 Motion Forecasting 6 Multi-Person Pose Estimation 6 Multi-agent Reinforcement Learning 6 Multimodal Emotion Recognition 6 Multiple Instance Learning 6 Multivariate Time Series Imputation 6 Music Source Separation 6 Natural Language Visual Grounding 6 Passage Retrieval 6 Product Recommendation 6 Program Repair 6 Retrieval 6 Robust Object Detection 6 Scene Segmentation 6 Temporal Tagging 6 Text-to-Image Generation 6 Time Series Prediction 6 Tumor Segmentation 6 Video Object Tracking 6 Video Recognition 6 Video-Based Person Re-Identification 6 3D Absolute Human Pose Estimation 5 3D Classification 5 3D FACE MODELING 5 3D Face Animation 5 3D Multi-Object Tracking 5 3D Point Cloud Classification 5 AMR-to-Text Generation 5 Adversarial Defense 5 Aesthetics Quality Assessment 5 Argument Mining 5 Autonomous Navigation 5 Band Gap 5 Bias Detection 5 Biomedical Information Retrieval 5 Body Detection 5 Brain Tumor Segmentation 5 Breast Cancer Detection 5 Causal Inference 5 Change Point Detection 5 Chatbot 5 Chinese Named Entity Recognition 5 Class-agnostic Object Detection 5 Click-Through Rate Prediction 5 Code Documentation Generation 5 Content-Based Image Retrieval 5 Core set discovery 5 Defect Detection 5 Dense Video Captioning 5 Document Text Classification 5 Entity Alignment 5 Event-based vision 5 Extractive Text Summarization 5 Face Model 5 Face Presentation Attack Detection 5 Face Sketch Synthesis 5 Feature Importance 5 Few-Shot Audio Classification 5 Fine-Grained Visual Categorization 5 Fine-Grained Visual Recognition 5 Genre classification 5 Graph Generation 5 Graph Learning 5 Handwriting Verification 5 Handwritten Digit Recognition 5 Heart rate estimation 5 Human Interaction Recognition 5 JPEG Artifact Correction 5 Joint Demosaicing and Denoising 5 Keyword Extraction 5 Language Acquisition 5 Lexical Entailment 5 Line Segment Detection 5 Low-Resource Neural Machine Translation 5 Mathematical Reasoning 5 Medical Image Registration 5 Medical Named Entity Recognition 5 Medical Relation Extraction 5 Multi-Label Learning 5 Multimodal Sentiment Analysis 5 Multiview Detection 5 Music Classification 5 Music Modeling 5 Network Intrusion Detection 5 Noise Level Prediction 5 Object Proposal Generation 5 Offline RL 5 Open Information Extraction 5 Physical Simulations 5 Pose Prediction 5 Pose Transfer 5 Referring Expression Comprehension 5 Semi-Supervised Semantic Segmentation 5 Semi-Supervised Video Object Segmentation 5 Sign Language Translation 5 Single-View 3D Reconstruction 5
Filter by Language
English 2 Multilingual 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 American Sign Language 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Arabic 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bengali 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Chinese 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Congo Swahili 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 French 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Greek Sign Language 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hindi 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Italian 0 Jamaican Creole English 0 Japanese 0 Javanese 0 Jejueo 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Korean 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mongolian 0 Moroccan Arabic 0 Mundurukú 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Pidgin 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Polish 0 Pontic 0 Portuguese 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Russian 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Sotho 0 Spanish 0 Sranan Tongo 0 Standard Arabic 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Telugu 0 Tetum 0 Thai 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkish 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0

8 dataset results for Multimodal Deep Learning