Datasets

6,712 machine learning datasets
Filter by Task
Action Recognition 58 Temporal Action Localization 27 Object Detection 25 Video Understanding 25 Pose Estimation 23 Object Tracking 21 Video Captioning 20 Video Retrieval 19 Action Detection 18 Video Question Answering 18 Semantic Segmentation 17 Action Classification 16 Visual Question Answering 16 Multi-Object Tracking 15 Question Answering 15 Activity Recognition 13 Skeleton Based Action Recognition 12 Video Classification 12 Visual Object Tracking 12 Action Recognition In Videos 11 Speech Recognition 11 Video Prediction 11 Visual Tracking 11 Autonomous Driving 9 DeepFake Detection 9 Person Re-Identification 9 Sign Language Recognition 9 Trajectory Prediction 9 Video Generation 9 Video Summarization 9 3D Human Pose Estimation 8 Action Segmentation 8 Activity Detection 8 Facial Expression Recognition 8 Ad-hoc video search 7 Depth Estimation 7 Face Anti-Spoofing 7 Face Recognition 7 Human-Object Interaction Detection 7 Multi-Task Learning 7 Optical Flow Estimation 7 Video Segmentation 7 Emotion Recognition 6 Face Detection 6 Face Swapping 6 Hand Pose Estimation 6 Instance Segmentation 6 Lipreading 6 Multimodal Activity Recognition 6 Multiple Object Tracking 6 Self-Supervised Learning 6 Sign Language Translation 6 Unsupervised Video Object Segmentation 6 Video Object Detection 6 Video Object Segmentation 6 3D Action Recognition 5 Anomaly Detection 5 Decision Making 5 Dense Video Captioning 5 Emotion Recognition in Conversation 5 Face Verification 5 Hand Gesture Recognition 5 Human action generation 5 Image Generation 5 Online Multi-Object Tracking 5 Scene Understanding 5 Semi-Supervised Video Object Segmentation 5 Spatio-Temporal Action Localization 5 Video Frame Interpolation 5 Video Instance Segmentation 5 Video Quality Assessment 5 Video Recognition 5 Video Super-Resolution 5 2D Semantic Segmentation 4 3D Hand Pose Estimation 4 3D Object Detection 4 3D Pose Estimation 4 Action Anticipation 4 Action Quality Assessment 4 Action Triplet Recognition 4 Action Understanding 4 Audio Classification 4 Autonomous Vehicles 4 Crowd Counting 4 Emotion Classification 4 Human Detection 4 Image Inpainting 4 Lane Detection 4 Lip Reading 4 Motion Segmentation 4 Novel View Synthesis 4 Person Search 4 Pose Tracking 4 Real-Time Object Detection 4 Self-Supervised Action Recognition 4 Speaker Recognition 4 Temporal Action Proposal Generation 4 Video Description 4 Video Inpainting 4 Video Salient Object Detection 4 Visual Speech Recognition 4 Zero-Shot Action Recognition 4 3D Absolute Human Pose Estimation 3 3D Object Tracking 3 3D Reconstruction 3 Abnormal Event Detection In Video 3 Anomaly Detection In Surveillance Videos 3 Audio-Visual Speech Recognition 3 Camera shot boundary detection 3 Deblurring 3 Dialect Identification 3 Disentanglement 3 Domain Adaptation 3 Face Presentation Attack Detection 3 Facial Action Unit Detection 3 Few Shot Action Recognition 3 Gait Recognition 3 Gesture Recognition 3 Human Behavior Forecasting 3 Human Interaction Recognition 3 Human Pose Forecasting 3 Image Classification 3 Interactive Video Object Segmentation 3 Medical Image Segmentation 3 Moment Retrieval 3 Monocular Depth Estimation 3 Multimodal Sentiment Analysis 3 Natural Language Moment Retrieval 3 Object Localization 3 Object Recognition 3 Panoptic Segmentation 3 Pose Prediction 3 Real-Time Multi-Object Tracking 3 Referring Expression Segmentation 3 Steering Control 3 Supervised Video Summarization 3 Trajectory Forecasting 3 Unconstrained Lip-synchronization 3 Unsupervised Person Re-Identification 3 Unsupervised Video Summarization 3 Video Object Tracking 3 Video Reconstruction 3 Visual Keyword Spotting 3 Weakly Supervised Action Localization 3 motion prediction 3 2D Human Pose Estimation 2 2D object detection 2 Accented Speech Recognition 2 Action Localization 2 Action Parsing 2 Action Spotting 2 Action Unit Detection 2 Active Learning 2 Activity Prediction 2 Activity Recognition In Videos 2 Bayesian Inference 2 Boundary Detection 2 Class-agnostic Object Detection 2 Denoising 2 Driver Attention Monitoring 2 Egocentric Activity Recognition 2 Event Detection 2 Event Segmentation 2 Face Alignment 2 Face Identification 2 Facial Emotion Recognition 2 Facial Landmark Detection 2 Few Shot Temporal Action Localization 2 Fine-Grained Action Detection 2 Gaze Estimation 2 Generalized Zero Shot skeletal action recognition 2 Group Activity Recognition 2 Highlight Detection 2 Homography Estimation 2 Interactive Segmentation 2 Lip to Speech Synthesis 2 Metric Learning 2 Motion Forecasting 2 Multi-Label Classification 2 Multi-Label Learning 2 Multi-Person Pose Estimation 2 Multi-future Trajectory Prediction 2 Multimodal Emotion Recognition 2 Multiple Instance Learning 2 Multiple People Tracking 2 Multiview Learning 2 Music Information Retrieval 2 Natural Language Visual Grounding 2 One-Shot 3D Action Recognition 2 Online Action Detection 2 Open World Object Detection 2 Pedestrian Detection 2 Pose Retrieval 2 Real-Time Semantic Segmentation 2 Robot Navigation 2 Scene Change Detection 2 Scene Text Recognition 2 Self-Driving Cars 2 Self-supervised Video Retrieval 2 Semantic Object Interaction Classification 2 Semi-Supervised Action Detection 2 Sentiment Analysis 2 Sign Language Production 2 Skills Assessment 2 Skills Evaluation 2 Small Object Detection 2 Speech Emotion Recognition 2 Speech Enhancement 2 Surgical Gesture Recognition 2 Surgical tool detection 2 Talking Face Generation 2 Text-to-Image Generation 2 Text-to-video search 2 Unsupervised 3D Human Pose Estimation 2 Unsupervised Skeleton Based Action Recognition 2 Vehicle Re-Identification 2 Video Alignment 2 Video Compression 2 Video Denoising 2 Video Emotion Recognition 2 Video Grounding 2 Video Matting 2 Video Polyp Segmentation 2 Video Semantic Segmentation 2 Video Visual Relation Detection 2 Video Visual Relation Tagging 2 Video-Based Person Re-Identification 2 Video-Text Retrieval 2 Visual Reasoning 2 Weakly Supervised Action Segmentation (Transcript) 2 Weakly Supervised Object Detection 2 Weakly Supervised Temporal Action Localization 2 Weakly-supervised 3D Human Pose Estimation 2 Weather Forecasting 2 Zero Shot Skeletal Action Recognition 2 Zero-Shot Action Detection 2 Zero-Shot Learning 2 audio-visual learning 2 2D Semantic Segmentation task 1 (8 classes) 1 2D Semantic Segmentation task 3 (25 classes) 1 3D Anomaly Detection 1 3D Car Instance Understanding 1 3D Classification 1 3D Depth Estimation 1 3D Feature Matching 1 3D Human Dynamics 1 3D Human Pose Tracking 1 3D Human Reconstruction 1 3D Instance Segmentation 1 3D Lane Detection 1 3D Multi-Person Pose Estimation 1 3D Object Detection From Stereo Images 1 3D Object Recognition 1 3D Object Reconstruction 1 3D Object Retrieval 1 3D Point Cloud Matching 1 3D Point Cloud Reconstruction 1 3D Scene Reconstruction 1 3D Shape Reconstruction 1 3D Shape Representation 1 6D Pose Estimation 1 6D Pose Estimation using RGB 1 6D Pose Estimation using RGBD 1 Accident Anticipation 1 Action Recognition In Videos 1 Active Object Detection 1 Active Speaker Localization 1 Activeness Detection 1 Aesthetics Quality Assessment 1 Age Estimation 1 Amodal Instance Segmentation 1 Amodal Panoptic Segmentation 1 Animal Action Recognition 1 Animal Pose Estimation 1 Anxiety Detection 1 Arousal Estimation 1 Atari Games 1 Audio Emotion Recognition 1 Audio Generation 1 Audio Source Separation 1 Audio-Visual Active Speaker Detection 1 Audio-Visual Synchronization 1 Automatic Speech Recognition 1 Behavioral Malware Detection 1 Binarization 1 Breast Cancer Detection 1 Breast Tumour Classification 1 Camera Auto-Calibration 1 Camera shot segmentation 1 Classification 1 Clinical Concept Extraction 1 Colorectal Gland Segmentation: 1 Colorectal Polyps Characterization 1 Colorization 1 Conditional Image Generation 1 Continual Learning 1 Contrastive Learning 1 Conversational Response Generation 1 Copy Detection 1 Cross-Modal Retrieval 1 Deep Attention 1 Depression Detection 1 Dialog Act Classification 1 Dialogue Act Classification 1 Dialogue Generation 1 Dimensionality Reduction 1 Domain Generalization 1 Emotional Dialogue Acts 1 English Conversational Speech Recognition 1 Face Clustering 1 Face Generation 1 Few-Shot Image Classification 1 Few-Shot Learning 1 Few-Shot Object Detection 1 Fine-Grained Vehicle Classification 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Fire Detection 1 Future prediction 1 Gait Identification 1 Gaze Prediction 1 Gender Prediction 1 General Action Video Anomaly Detection 1 General Classification 1 Generalizable Person Re-identification 1 Generalized Zero-Shot Object Detection 1 Genre classification 1 Graph Matching 1 HDR Reconstruction 1 Hand Joint Reconstruction 1 Hand Segmentation 1 Hand-Gesture Recognition 1 Head Pose Estimation 1 Heart Rate Variability 1 Heart rate estimation 1 Home Activity Monitoring 1 Human Dynamics 1 Human Instance Segmentation 1 Human Part Segmentation 1 Human fMRI response prediction 1 Human motion prediction 1 Human-Object-interaction motion tracking 1 Image Captioning 1 Image Deblurring 1 Image Denoising 1 Image Generation from Scene Graphs 1 Image Manipulation 1 Image Registration 1 Image Relighting 1 Image Restoration 1 Image Retrieval 1 Image Super-Resolution 1 Image-level Supervised Instance Segmentation 1 Imitation Learning 1 Imputation 1 Indoor Localization 1 Instrument Recognition 1 Inverse-Tone-Mapping 1 Keypoint Detection 1 Kinematic Based Workflow Recognition 1 Knowledge Distillation 1 Language Modelling 1 Layout-to-Image Generation 1 Lesion Detection 1 License Plate Detection 1 License Plate Recognition 1 Lip password classification 1 Localization In Video Forgery 1 Logical Reasoning Question Answering 1 Long-tail Learning 1 Low resource - Speech Emotion Recognition 1 Low-Light Image Enhancement 1 MULTI-VIEW LEARNING 1 Medical Diagnosis 1 Medical Image Registration 1 Metaheuristic Optimization 1 Micro-Expression Spotting 1 Misinformation 1 Mistake Detection 1 Moment Queries 1 Monocular 3D Human Pose Estimation 1 Monocular 3D Object Detection 1 Monocular Cross-View Road Scene Parsing(Road) 1 Monocular Cross-View Road Scene Parsing(Vehicle) 1 Motion Estimation 1 Multi Future Trajectory Prediction 1 Multi-Frame Super-Resolution 1 Multi-Hypotheses 3D Human Pose Estimation 1 Multi-Instance Retrieval 1 Multi-Object Tracking and Segmentation 1 Multi-Person Pose Estimation and Tracking 1 Multi-agent Reinforcement Learning 1 Multi-object discovery 1 Multi-task Audio Source Seperation 1 Multimodal Abstractive Text Summarization 1 Multimodal Deep Learning 1 Multimodal GIF Dialog 1 Multiple Object Track and Segmentation 1 Multiview Detection 1 Multiview Gait Recognition 1 Music Emotion Recognition 1 Music Generation 1 Natural Language Inference 1 Natural Language Processing 1 Natural Language Queries 1 Natural Language Understanding 1 Object Counting 1 Object Detection In Indoor Scenes 1 Object Discovery 1 Object Proposal Generation 1 Object State Change Classification 1 Occluded 3D Object Symmetry Detection 1 Offline RL 1 One-Shot Instance Segmentation 1 One-Shot Object Detection 1 One-shot visual object segmentation 1 Open Vocabulary Object Detection 1 Open-Domain Dialog 1 Organ Detection 1 Paraphrase Generation 1 Partial Label Learning 1 Partial Point Cloud Matching 1 Pedestrian Trajectory Prediction 1 Person Identification 1 Person Recognition 1 Personality Recognition in Conversation 1 Personality Trait Recognition 1 Personalized and Emotional Conversation 1 Point Cloud Registration 1 Point-Supervised Instance Segmentation 1 Portrait Segmentation 1 Privacy Preserving Deep Learning 1 Prosody Prediction 1 Question Generation 1 Reading Comprehension 1 Real-time Instance Segmentation 1 Recognizing And Localizing Human Actions 1 Recognizing Emotion Cause in Conversations 1 Region Proposal 1 Replay Grounding 1 Robust Object Detection 1 Root Joint Localization 1 Sarcasm Detection 1 Scene Classification 1 Scene Flow Estimation 1 Scene Graph Detection 1 Scene Graph Generation 1 Scene-Aware Dialogue 1 Segmentation Based Workflow Recognition 1 Semantic SLAM 1 Semi Supervised Learning for Image Captioning 1 Semi-Supervised Instance Segmentation 1 Semi-Supervised Video Classification 1 Semi-supervised Anomaly Detection 1 Sentence Embedding 1 Simultaneous Localization and Mapping 1 Single-Image Portrait Relighting 1 Single-object discovery 1 Small Data Image Classification 1 Smile Recognition 1 Speaker Diarization 1 Speaker Separation 1 Speaker Verification 1 Speech Emotion Recognition - 5-Fold 1 Speech Extraction 1 Speech Separation 1 Speech Synthesis 1 Speech-to-Gesture Translation 1 Stereo Matching 1 Super-Resolution 1 Surgical Skills Evaluation 1 Symmetry Detection 1 Synthetic-to-Real Translation 1 TFLM sequence generation 1 TGIF-Action 1 TGIF-Frame 1 TGIF-Transition 1 Talking Head Generation 1 Temporal Localization 1 Temporal/Casual QA 1 Text Spotting 1 Text Summarization 1 Text to Video Retrieval 1 Text-Image Retrieval 1 Text-to-Video Generation 1 Texture Synthesis 1 Thermal Infrared Object Tracking 1 Traffic Accident Detection 1 Traffic Object Detection 1 Transfer Learning 1 Unconditional Video Generation 1 Unsupervised Anomaly Detection 1 Unsupervised Domain Adaptation 1 Unsupervised Semantic Segmentation 1 drone-based object tracking 1 eXtreme-Video-Frame-Interpolation 1 object-detection 1
Filter by Language
English 137 Chinese 9 German 4 Hindi 4 Spanish 4 French 3 Korean 3 Portuguese 3 American Sign Language 2 Arabic 2 Japanese 2 Multilingual 2 Russian 2 Turkish Sign Language 2 Bengali 1 Greek Sign Language 1 Italian 1 Kazakh 1 Mongolian 1 Swiss German 1 Swiss-German Sign Language 1 Telugu 1 Tibetan 1 Turkish 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Congo Swahili 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Moroccan Arabic 0 Mundurukú 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Pidgin 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Polish 0 Pontic 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Tetum 0 Thai 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zeeuws 0 Zhuang 0 Zulu 0

643 dataset results for Videos