Datasets

6,203 machine learning datasets
Filter by Task
Action Recognition 58 Temporal Action Localization 27 Video Understanding 25 Object Detection 23 Pose Estimation 23 Object Tracking 21 Video Captioning 19 Video Retrieval 18 Action Detection 17 Video Question Answering 17 Action Classification 15 Question Answering 15 Semantic Segmentation 15 Visual Question Answering 15 Multi-Object Tracking 14 Activity Recognition 13 Skeleton Based Action Recognition 12 Video Classification 12 Visual Object Tracking 12 Video Prediction 11 Visual Tracking 11 Action Recognition In Videos 10 Autonomous Driving 9 DeepFake Detection 9 Person Re-Identification 9 Sign Language Recognition 9 Trajectory Prediction 9 Video Generation 9 3D Human Pose Estimation 8 Activity Detection 8 Facial Expression Recognition 8 Video Summarization 8 Action Segmentation 7 Ad-hoc video search 7 Depth Estimation 7 Face Anti-Spoofing 7 Human-Object Interaction Detection 7 Optical Flow Estimation 7 Face Recognition 6 Face Swapping 6 Instance Segmentation 6 Multi-Task Learning 6 Multimodal Activity Recognition 6 Multiple Object Tracking 6 Self-Supervised Learning 6 Sign Language Translation 6 Unsupervised Video Object Segmentation 6 Video Object Segmentation 6 Video Segmentation 6 Decision Making 5 Dense Video Captioning 5 Face Verification 5 Hand Gesture Recognition 5 Hand Pose Estimation 5 Human action generation 5 Image Generation 5 Lipreading 5 Online Multi-Object Tracking 5 Scene Understanding 5 Semi-Supervised Video Object Segmentation 5 Spatio-Temporal Action Localization 5 Speech Recognition 5 Video Frame Interpolation 5 Video Object Detection 5 Video Quality Assessment 5 Video Recognition 5 2D Semantic Segmentation 4 3D Action Recognition 4 3D Hand Pose Estimation 4 Action Quality Assessment 4 Action Triplet Recognition 4 Action Understanding 4 Anomaly Detection 4 Audio Classification 4 Autonomous Vehicles 4 Crowd Counting 4 Emotion Recognition 4 Emotion Recognition in Conversation 4 Face Detection 4 Human Detection 4 Image Inpainting 4 Lane Detection 4 Motion Segmentation 4 Person Search 4 Real-Time Object Detection 4 Self-Supervised Action Recognition 4 Temporal Action Proposal Generation 4 Video Description 4 Video Inpainting 4 Video Instance Segmentation 4 Video Salient Object Detection 4 Video Super-Resolution 4 Visual Speech Recognition 4 Zero-Shot Action Recognition 4 3D Absolute Human Pose Estimation 3 3D Object Detection 3 3D Pose Estimation 3 3D Reconstruction 3 Action Anticipation 3 Anomaly Detection In Surveillance Videos 3 Deblurring 3 Disentanglement 3 Domain Adaptation 3 Emotion Classification 3 Face Presentation Attack Detection 3 Facial Action Unit Detection 3 Few Shot Action Recognition 3 Gait Recognition 3 Human Interaction Recognition 3 Human Pose Forecasting 3 Interactive Video Object Segmentation 3 Lip Reading 3 Medical Image Segmentation 3 Moment Retrieval 3 Monocular Depth Estimation 3 Multimodal Sentiment Analysis 3 Natural Language Moment Retrieval 3 Object Localization 3 Object Recognition 3 Panoptic Segmentation 3 Pose Prediction 3 Pose Tracking 3 Real-Time Multi-Object Tracking 3 Referring Expression Segmentation 3 Steering Control 3 Trajectory Forecasting 3 Unconstrained Lip-synchronization 3 Unsupervised Person Re-Identification 3 Video Object Tracking 3 Video Reconstruction 3 Visual Keyword Spotting 3 Weakly Supervised Action Localization 3 motion prediction 3 3D Object Tracking 2 Abnormal Event Detection In Video 2 Action Localization 2 Action Parsing 2 Action Spotting 2 Action Unit Detection 2 Active Learning 2 Activity Prediction 2 Activity Recognition In Videos 2 Audio-Visual Speech Recognition 2 Bayesian Inference 2 Boundary Detection 2 Camera shot boundary detection 2 Class-agnostic Object Detection 2 Denoising 2 Egocentric Activity Recognition 2 Face Alignment 2 Face Identification 2 Facial Emotion Recognition 2 Facial Landmark Detection 2 Few Shot Temporal Action Localization 2 Fine-Grained Action Detection 2 Frame 2 Generalized Zero Shot skeletal action recognition 2 Gesture Recognition 2 Group Activity Recognition 2 Homography Estimation 2 Image Classification 2 Interactive Segmentation 2 Lip to Speech Synthesis 2 Metric Learning 2 Motion Forecasting 2 Multi-Label Classification 2 Multi-Person Pose Estimation 2 Multi-future Trajectory Prediction 2 Multiple Instance Learning 2 Multiple People Tracking 2 Multiview Learning 2 Music Information Retrieval 2 Natural Language Visual Grounding 2 Novel View Synthesis 2 One-Shot 3D Action Recognition 2 Open World Object Detection 2 Pedestrian Detection 2 Pose Retrieval 2 Real-Time Semantic Segmentation 2 Robot Navigation 2 Scene Change Detection 2 Scene Text Recognition 2 Self-Driving Cars 2 Self-supervised Video Retrieval 2 Semantic Object Interaction Classification 2 Sentiment Analysis 2 Sign Language Production 2 Skills Assessment 2 Skills Evaluation 2 Small Object Detection 2 Speech Emotion Recognition 2 Supervised Video Summarization 2 Surgical Gesture Recognition 2 Surgical tool detection 2 Talking Face Generation 2 Text-to-Image Generation 2 Text-to-video search 2 Unsupervised Skeleton Based Action Recognition 2 Unsupervised Video Summarization 2 Vehicle Re-Identification 2 Video Alignment 2 Video Compression 2 Video Denoising 2 Video Emotion Recognition 2 Video Matting 2 Video Polyp Segmentation 2 Video Semantic Segmentation 2 Video Visual Relation Detection 2 Video Visual Relation Tagging 2 Video-Based Person Re-Identification 2 Video-Text Retrieval 2 Visual Reasoning 2 Weakly Supervised Object Detection 2 Weakly Supervised Temporal Action Localization 2 Weather Forecasting 2 Zero Shot Skeletal Action Recognition 2 Zero-Shot Learning 2 audio-visual learning 2 2D Human Pose Estimation 1 3D Car Instance Understanding 1 3D Classification 1 3D Face Reconstruction 1 3D Feature Matching 1 3D Human Dynamics 1 3D Human Pose Tracking 1 3D Human Reconstruction 1 3D Instance Segmentation 1 3D Lane Detection 1 3D Multi-Person Pose Estimation 1 3D Object Recognition 1 3D Object Reconstruction 1 3D Object Retrieval 1 3D Point Cloud Matching 1 3D Shape Reconstruction 1 3D Shape Representation 1 3D human pose and shape estimation 1 6D Pose Estimation 1 6D Pose Estimation using RGB 1 6D Pose Estimation using RGBD 1 Accident Anticipation 1 Action Recognition In Videos 1 Active Object Detection 1 Activeness Detection 1 Aesthetics Quality Assessment 1 Age Estimation 1 Amodal Panoptic Segmentation 1 Anxiety Detection 1 Arousal Estimation 1 Atari Games 1 Audio Generation 1 Audio Source Separation 1 Audio-Visual Active Speaker Detection 1 Audio-Visual Synchronization 1 Binarization 1 Camera Auto-Calibration 1 Camera shot segmentation 1 Classification 1 Clinical Concept Extraction 1 Colorectal Gland Segmentation: 1 Colorectal Polyps Characterization 1 Colorization 1 Conditional Image Generation 1 Continual Learning 1 Contrastive Learning 1 Copy Detection 1 Cross-Modal Retrieval 1 Deep Attention 1 Depression Detection 1 Dimensionality Reduction 1 Domain Generalization 1 Driver Attention Monitoring 1 English Conversational Speech Recognition 1 Event Detection 1 Event Segmentation 1 Face Generation 1 Few-Shot Image Classification 1 Few-Shot Learning 1 Few-Shot Object Detection 1 Fine-Grained Vehicle Classification 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Future prediction 1 Gait Identification 1 Gaze Estimation 1 Gaze Prediction 1 Gender Prediction 1 General Action Video Anomaly Detection 1 General Classification 1 Generalizable Person Re-identification 1 Generalized Zero-Shot Object Detection 1 Genre classification 1 Graph Matching 1 Hand Joint Reconstruction 1 Hand Segmentation 1 Hand-Gesture Recognition 1 Head Pose Estimation 1 Heart Rate Variability 1 Heart rate estimation 1 Highlight Detection 1 Home Activity Monitoring 1 Human Dynamics 1 Human Instance Segmentation 1 Human Part Segmentation 1 Human fMRI response prediction 1 Human motion prediction 1 Human-Object-interaction motion tracking 1 Image Captioning 1 Image Deblurring 1 Image Denoising 1 Image Generation from Scene Graphs 1 Image Manipulation 1 Image Registration 1 Image Relighting 1 Image Restoration 1 Image Retrieval 1 Image Super-Resolution 1 Image-level Supervised Instance Segmentation 1 Imitation Learning 1 Imputation 1 Indoor Localization 1 Instrument Recognition 1 Keypoint Detection 1 Kinematic Based Workflow Recognition 1 Knowledge Distillation 1 Language Modelling 1 Layout-to-Image Generation 1 License Plate Detection 1 License Plate Recognition 1 Lip password classification 1 Localization In Video Forgery 1 Logical Reasoning Question Answering 1 Low resource - Speech Emotion Recognition 1 Low-Light Image Enhancement 1 MULTI-VIEW LEARNING 1 Medical Diagnosis 1 Medical Image Registration 1 Metaheuristic Optimization 1 Micro-Expression Spotting 1 Misinformation 1 Mistake Detection 1 Monocular 3D Human Pose Estimation 1 Monocular 3D Object Detection 1 Monocular Cross-View Road Scene Parsing(Road) 1 Monocular Cross-View Road Scene Parsing(Vehicle) 1 Motion Estimation 1 Multi Future Trajectory Prediction 1 Multi-Frame Super-Resolution 1 Multi-Hypotheses 3D Human Pose Estimation 1 Multi-Label Learning 1 Multi-Object Tracking and Segmentation 1 Multi-Person Pose Estimation and Tracking 1 Multi-agent Reinforcement Learning 1 Multi-object discovery 1 Multi-task Audio Source Seperation 1 Multimodal Abstractive Text Summarization 1 Multimodal Deep Learning 1 Multimodal Emotion Recognition 1 Multimodal GIF Dialog 1 Multiple Object Track and Segmentation 1 Multiview Detection 1 Multiview Gait Recognition 1 Music Emotion Recognition 1 Music Generation 1 Natural Language Inference 1 Natural Language Understanding 1 Object Counting 1 Object Detection In Indoor Scenes 1 Object Discovery 1 Object Proposal Generation 1 Occluded 3D Object Symmetry Detection 1 Offline RL 1 One-Shot Instance Segmentation 1 One-Shot Object Detection 1 One-shot visual object segmentation 1 Organ Detection 1 Paraphrase Generation 1 Partial Label Learning 1 Partial Point Cloud Matching 1 Pedestrian Trajectory Prediction 1 Person Identification 1 Person Recognition 1 Point Cloud Registration 1 Point-Supervised Instance Segmentation 1 Portrait Segmentation 1 Privacy Preserving Deep Learning 1 Prosody Prediction 1 Question Generation 1 Reading Comprehension 1 Real-time Instance Segmentation 1 Recognizing And Localizing Human Actions 1 Recognizing Emotion Cause in Conversations 1 Region Proposal 1 Replay Grounding 1 Robust Object Detection 1 Root Joint Localization 1 Sarcasm Detection 1 Scene Classification 1 Scene Flow Estimation 1 Scene Graph Detection 1 Scene-Aware Dialogue 1 Segmentation Based Workflow Recognition 1 Semantic SLAM 1 Semi Supervised Learning for Image Captioning 1 Semi-Supervised Instance Segmentation 1 Semi-Supervised Video Classification 1 Semi-supervised Anomaly Detection 1 Sentence Embedding 1 Simultaneous Localization and Mapping 1 Single-Image Portrait Relighting 1 Single-object discovery 1 Small Data Image Classification 1 Smile Recognition 1 Speaker Diarization 1 Speaker Recognition 1 Speaker Separation 1 Speaker Verification 1 Speech Emotion Recognition - 5-Fold 1 Speech Enhancement 1 Speech Extraction 1 Speech Separation 1 Speech Synthesis 1 Speech-to-Gesture Translation 1 Stereo Matching 1 Super-Resolution 1 Surgical Skills Evaluation 1 Symmetry Detection 1 Synthetic-to-Real Translation 1 TFLM sequence generation 1 TGIF-Action 1 TGIF-Frame 1 TGIF-Transition 1 Talking Head Generation 1 Temporal Localization 1 Temporal/Casual QA 1 Text Spotting 1 Text Summarization 1 Text-Image Retrieval 1 Text-to-Video Generation 1 Texture Synthesis 1 Thermal Infrared Object Tracking 1 Traffic Accident Detection 1 Traffic Object Detection 1 Transfer Learning 1 Unsupervised Anomaly Detection 1 Unsupervised Domain Adaptation 1 Unsupervised Semantic Segmentation 1 VQA 1 Valence Estimation 1 Vehicle Pose Estimation 1 Vehicle Speed Estimation 1 Video & Kinematic Base Workflow Recognition 1 Video Based Workflow Recognition 1 Video Deinterlacing 1 Video Enhancement 1 Video Forensics 1 Video Harmonization 1 Video Restoration 1 Video Saliency Detection 1 Video Saliency Prediction 1 Video Story QA 1 Video Synchronization 1 Video, Kinematic & Segmentation Base Workflow Recognition 1 Video-to-Shop 1 Video-to-Video Synthesis 1 Vision and Language Navigation 1 Visual Crowd Analysis 1 Visual Odometry 1 Weakly Supervised Action Segmentation (Transcript) 1 Weakly Supervised Classification 1 Weakly-supervised 3D Human Pose Estimation 1 Weakly-supervised Temporal Action Localization 1 Weakly-supervised instance segmentation 1 Wikipedia Summarization 1 Word Embeddings 1 Zero-Shot Cross-Modal Retrieval 1 Zero-Shot Object Detection 1 Zero-Shot Text-to-Image Generation 1 Zero-shot Image Retrieval 1 Zero-shot Text Retrieval 1 drone-based object tracking 1 eXtreme-Video-Frame-Interpolation 1
Filter by Language
English 125 Chinese 7 German 4 Hindi 4 French 3 Korean 3 Portuguese 3 Spanish 3 American Sign Language 2 Arabic 2 Japanese 2 Multilingual 2 Russian 2 Turkish Sign Language 2 Bengali 1 Greek Sign Language 1 Italian 1 Mongolian 1 Swiss German 1 Swiss-German Sign Language 1 Telugu 1 Turkish 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Moroccan Arabic 0 Mundurukú 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Pidgin 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Polish 0 Pontic 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Tetum 0 Thai 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zeeuws 0 Zhuang 0 Zulu 0

606 dataset results for Videos