Datasets

5,437 machine learning datasets
Filter by Task
Action Recognition 54 Temporal Action Localization 26 Video Understanding 20 Object Detection 19 Object Tracking 19 Pose Estimation 18 Video Captioning 18 Video Retrieval 18 Video Question Answering 17 Action Classification 14 Action Detection 14 Question Answering 14 Semantic Segmentation 13 Visual Question Answering 13 Skeleton Based Action Recognition 12 Video Classification 12 Visual Object Tracking 12 Multi-Object Tracking 11 Visual Tracking 11 Activity Recognition 10 Video Prediction 10 Action Recognition In Videos 9 Person Re-Identification 9 Trajectory Prediction 9 Video Summarization 9 Autonomous Driving 8 DeepFake Detection 8 Facial Expression Recognition 8 Sign Language Recognition 8 Video Generation 8 Depth Estimation 7 Face Anti-Spoofing 7 Optical Flow Estimation 7 3D Human Pose Estimation 6 Activity Detection 6 Human-Object Interaction Detection 6 Multi-Task Learning 6 Multiple Object Tracking 6 Self-Supervised Learning 6 Unsupervised Video Object Segmentation 6 Video Object Detection 6 Action Segmentation 5 Dense Video Captioning 5 Face Recognition 5 Face Swapping 5 Face Verification 5 Hand Gesture Recognition 5 Hand Pose Estimation 5 Human action generation 5 Instance Segmentation 5 Multimodal Activity Recognition 5 Online Multi-Object Tracking 5 Sign Language Translation 5 Spatio-Temporal Action Localization 5 Video Object Segmentation 5 Video Quality Assessment 5 Video Segmentation 5 3D Hand Pose Estimation 4 Anomaly Detection 4 Audio Classification 4 Crowd Counting 4 Decision Making 4 Emotion Recognition 4 Human Detection 4 Image Generation 4 Lipreading 4 Person Search 4 Scene Understanding 4 Self-Supervised Action Recognition 4 Semi-Supervised Video Object Segmentation 4 Speech Recognition 4 Temporal Action Proposal Generation 4 Video Frame Interpolation 4 Video Inpainting 4 Video Instance Segmentation 4 Video Salient Object Detection 4 Video Super-Resolution 4 3D Action Recognition 3 3D Object Detection 3 3D Reconstruction 3 Action Quality Assessment 3 Anomaly Detection In Surveillance Videos 3 Autonomous Vehicles 3 Deblurring 3 Domain Adaptation 3 Emotion Recognition in Conversation 3 Face Detection 3 Face Presentation Attack Detection 3 Gait Recognition 3 Human Interaction Recognition 3 Human Pose Forecasting 3 Image Inpainting 3 Interactive Video Object Segmentation 3 Lane Detection 3 Medical Image Segmentation 3 Monocular Depth Estimation 3 Motion Segmentation 3 Multimodal Sentiment Analysis 3 Natural Language Moment Retrieval 3 Object Recognition 3 Referring Expression Segmentation 3 Steering Control 3 Supervised Video Summarization 3 Trajectory Forecasting 3 Unsupervised Video Summarization 3 Video Description 3 Video Object Tracking 3 Video Recognition 3 Video Reconstruction 3 Visual Keyword Spotting 3 Visual Speech Recognition 3 Weakly Supervised Action Localization 3 Zero-Shot Action Recognition 3 motion prediction 3 3D Absolute Human Pose Estimation 2 3D Object Tracking 2 3D Pose Estimation 2 Abnormal Event Detection In Video 2 Action Anticipation 2 Action Localization 2 Action Parsing 2 Action Spotting 2 Action Understanding 2 Action Unit Detection 2 Active Learning 2 Audio-Visual Speech Recognition 2 Bayesian Inference 2 Boundary Detection 2 Camera shot boundary detection 2 Curriculum Learning 2 Denoising 2 Egocentric Activity Recognition 2 Emotion Classification 2 Face Alignment 2 Face Identification 2 Facial Landmark Detection 2 Few Shot Temporal Action Localization 2 Fine-Grained Action Detection 2 Generalized Zero Shot skeletal action recognition 2 Gesture Recognition 2 Group Activity Recognition 2 Image Classification 2 Lip Reading 2 Metric Learning 2 Multi-future Trajectory Prediction 2 Multiple Instance Learning 2 Multiple People Tracking 2 Multiview Learning 2 Music Information Retrieval 2 Natural Language Visual Grounding 2 Novel View Synthesis 2 Object Localization 2 One-Shot 3D Action Recognition 2 Pedestrian Detection 2 Pose Prediction 2 Pose Retrieval 2 Real-Time Multi-Object Tracking 2 Real-Time Object Detection 2 Real-Time Semantic Segmentation 2 Robot Navigation 2 Scene Change Detection 2 Scene Text Recognition 2 Self-Driving Cars 2 Self-supervised Video Retrieval 2 Semantic Object Interaction Classification 2 Sentiment Analysis 2 Skills Assessment 2 Skills Evaluation 2 Speech Emotion Recognition 2 Surgical tool detection 2 Text-to-video search 2 Unconstrained Lip-synchronization 2 Unsupervised Person Re-Identification 2 Vehicle Re-Identification 2 Video Alignment 2 Video Compression 2 Video Denoising 2 Video Matting 2 Video Semantic Segmentation 2 Video Visual Relation Detection 2 Video Visual Relation Tagging 2 Video-Based Person Re-Identification 2 Weakly Supervised Temporal Action Localization 2 Weather Forecasting 2 Zero Shot Skeletal Action Recognition 2 Zero-Shot Learning 2 2D Semantic Segmentation 1 3D Car Instance Understanding 1 3D Face Reconstruction 1 3D Feature Matching 1 3D Human Reconstruction 1 3D Multi-Person Pose Estimation 1 3D Object Recognition 1 3D Object Reconstruction 1 3D Object Retrieval 1 3D Point Cloud Matching 1 3D Shape Reconstruction 1 3D Shape Representation 1 6D Pose Estimation 1 6D Pose Estimation using RGB 1 6D Pose Estimation using RGBD 1 Accident Anticipation 1 Action Recognition In Videos 1 Action Triplet Recognition 1 Activity Prediction 1 Activity Recognition In Videos 1 Aesthetics Quality Assessment 1 Age Estimation 1 Anxiety Detection 1 Atari Games 1 Audio Generation 1 Audio Source Separation 1 Audio-Visual Active Speaker Detection 1 Audio-Visual Synchronization 1 Binarization 1 Camera Auto-Calibration 1 Camera shot segmentation 1 Class-agnostic Object Detection 1 Clinical Concept Extraction 1 Colorectal Gland Segmentation: 1 Colorectal Polyps Characterization 1 Colorization 1 Contrastive Learning 1 Deep Attention 1 Depression Detection 1 Dimensionality Reduction 1 Domain Generalization 1 Driver Attention Monitoring 1 Event Segmentation 1 Facial Action Unit Detection 1 Facial Emotion Recognition 1 Few-Shot Image Classification 1 Few-Shot Learning 1 Fine-Grained Vehicle Classification 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Future prediction 1 Gait Identification 1 Gaze Estimation 1 Gender Prediction 1 General Action Video Anomaly Detection 1 General Classification 1 Generalizable Person Re-identification 1 Genre classification 1 Graph Matching 1 Hand Joint Reconstruction 1 Hand Segmentation 1 Hand-Gesture Recognition 1 Head Pose Estimation 1 Heart Rate Variability 1 Heart rate estimation 1 Home Activity Monitoring 1 Homography Estimation 1 Human Dynamics 1 Human Instance Segmentation 1 Human Part Segmentation 1 Human fMRI response prediction 1 Human motion prediction 1 Human-Object-interaction motion tracking 1 Image Denoising 1 Image Manipulation 1 Image Registration 1 Image Restoration 1 Image Retrieval 1 Image Super-Resolution 1 Imitation Learning 1 Imputation 1 Indoor Localization 1 Instrument Recognition 1 Interactive Segmentation 1 Language Modelling 1 License Plate Detection 1 License Plate Recognition 1 Lip password classification 1 Lip to Speech Synthesis 1 Localization In Video Forgery 1 Logical Reasoning Question Answering 1 Low-Light Image Enhancement 1 MULTI-VIEW LEARNING 1 Metaheuristic Optimization 1 Micro-Expression Spotting 1 Misinformation 1 Moment Retrieval 1 Monocular 3D Human Pose Estimation 1 Monocular 3D Object Detection 1 Monocular Cross-View Road Scene Parsing(Road) 1 Monocular Cross-View Road Scene Parsing(Vehicle) 1 Motion Estimation 1 Motion Forecasting 1 Multi Future Trajectory Prediction 1 Multi-Frame Super-Resolution 1 Multi-Hypotheses 3D Human Pose Estimation 1 Multi-Label Classification 1 Multi-Label Learning 1 Multi-Object Tracking and Segmentation 1 Multi-Person Pose Estimation 1 Multi-Person Pose Estimation and Tracking 1 Multi-agent Reinforcement Learning 1 Multi-task Audio Source Seperation 1 Multimodal Abstractive Text Summarization 1 Multimodal Deep Learning 1 Multimodal Emotion Recognition 1 Multimodal GIF Dialog 1 Multiple Object Track and Segmentation 1 Multiview Detection 1 Multiview Gait Recognition 1 Music Emotion Recognition 1 Music Generation 1 Object Detection In Indoor Scenes 1 Object Discovery 1 Occluded 3D Object Symmetry Detection 1 Offline RL 1 One-shot visual object segmentation 1 Open World Object Detection 1 Organ Detection 1 Panoptic Segmentation 1 Partial Label Learning 1 Partial Point Cloud Matching 1 Pedestrian Trajectory Prediction 1 Person Identification 1 Person Recognition 1 Point Cloud Registration 1 Portrait Segmentation 1 Pose Tracking 1 Privacy Preserving Deep Learning 1 Reading Comprehension 1 Recognizing And Localizing Human Actions 1 Rectification 1 Replay Grounding 1 Root Joint Localization 1 Scene Classification 1 Scene Flow Estimation 1 Scene Graph Detection 1 Scene-Aware Dialogue 1 Semi-Supervised Instance Segmentation 1 Semi-Supervised Video Classification 1 Semi-supervised Anomaly Detection 1 Sentence Embedding 1 Sign Language Production 1 Simultaneous Localization and Mapping 1 Small Data Image Classification 1 Small Object Detection 1 Smile Recognition 1 Speaker Diarization 1 Speaker Recognition 1 Speaker Verification 1 Speech Enhancement 1 Speech Synthesis 1 Speech-to-Gesture Translation 1 Stereo Matching 1 Super-Resolution 1 Surgical Gesture Recognition 1 Surgical Skills Evaluation 1 Symmetry Detection 1 Synthetic-to-Real Translation 1 TFLM sequence generation 1 Talking Face Generation 1 Talking Head Generation 1 Temporal Localization 1 Temporal Logic 1 Text Spotting 1 Text Summarization 1 Text-to-Image Generation 1 Text-to-Video Generation 1 Texture Synthesis 1 Thermal Infrared Object Tracking 1 Traffic Accident Detection 1 Traffic Object Detection 1 Transfer Learning 1 Unsupervised Anomaly Detection 1 Unsupervised Domain Adaptation 1 Vehicle Pose Estimation 1 Vehicle Speed Estimation 1 Video Deinterlacing 1 Video Emotion Recognition 1 Video Enhancement 1 Video Forensics 1 Video Harmonization 1 Video Saliency Detection 1 Video Story QA 1 Video Synchronization 1 Video-Text Retrieval 1 Video-to-Shop 1 Video-to-Video Synthesis 1 Vision and Language Navigation 1 Visual Crowd Analysis 1 Visual Odometry 1 Visual Reasoning 1 Weakly Supervised Action Segmentation (Transcript) 1 Weakly Supervised Classification 1 Weakly Supervised Object Detection 1 Weakly-supervised 3D Human Pose Estimation 1 Weakly-supervised Temporal Action Localization 1 Word Embeddings 1 audio-visual learning 1 drone-based object tracking 1 eXtreme-Video-Frame-Interpolation 1
Filter by Language
English 88 Chinese 6 German 3 Hindi 3 Korean 3 Portuguese 3 Spanish 3 American Sign Language 2 French 2 Japanese 2 Multilingual 2 Russian 2 Turkish Sign Language 2 Arabic 1 Bengali 1 Greek Sign Language 1 Italian 1 Swiss German 1 Swiss-German Sign Language 1 Telugu 1 Turkish 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 Amharic 0 Ancient Greek 0 Apurinã 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mongolian 0 Moroccan Arabic 0 Mundurukú 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Pidgin 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Polish 0 Pontic 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Rusyn 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Tetum 0 Thai 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zeeuws 0 Zhuang 0 Zulu 0

540 dataset results for Videos