Reinforcement Learning (RL)
Visual Question Answering (VQA)
Medical Image Segmentation
3D Object Super-Resolution
Optical Character Recognition (OCR)
Facial Recognition and Modelling
Multi-Label Classification
Video Object Segmentation
Out-of-Distribution Detection
Video Semantic Segmentation
Explainable artificial intelligence
Explainable Artificial Intelligence (XAI)
3D Point Cloud Classification
3D Shape Reconstruction from Videos
Visual Question Answering
3D Point Cloud Interpolation
Stereo Disparity Estimation
Human-Object Interaction Detection
Point Cloud Classification
Image-to-Image Translation
Multimodal Large Language Model
Document Text Classification
3D Point Cloud Reconstruction
Temporal Action Localization
3D Absolute Human Pose Estimation
Few-Shot Object Detection
Few-Shot Semantic Segmentation
Privacy Preserving Deep Learning
Few-Shot Transfer Learning for Saliency Prediction
Referring Expression Segmentation
Sign Language Recognition
Video Instance Segmentation
Vision-Language Navigation
Weakly supervised segmentation
Class Incremental Learning
LIDAR Semantic Segmentation
RGB Salient Object Detection
Open Vocabulary Semantic Segmentation
3D Multi-Person Pose Estimation
Zero-Shot Image Classification
Facial Landmark Detection
3D Character Animation From A Single Photo
Multi-Label Image Classification
Physics-informed machine learning
Decision Making Under Uncertainty
Handwritten Text Recognition
Vehicle Re-Identification
Sign Language Translation
satellite image super-resolution
Semi-Supervised Object Detection
Action Recognition In Videos
Facial Expression Recognition (FER)
Key Information Extraction
Infrared And Visible Image Fusion
Class-Incremental Semantic Segmentation
Single-Image-Based Hdr Reconstruction
Surgical phase recognition
Breast Cancer Histology Image Classification
Transparent Object Detection
Zero-Shot Action Recognition
Camouflaged Object Segmentation
Natural Language Transduction
Image Manipulation Detection
Abnormal Event Detection In Video
Action Quality Assessment
Multi-View 3D Reconstruction
Pedestrian Attribute Recognition
Temporal Action Segmentation
Weakly Supervised Action Localization
Probabilistic Deep Learning
Surface Normals Estimation
Zero-Shot Video Retrieval
Activity Recognition In Videos
Language Model Evaluation
Personalized Image Generation
Computer Vision Techniques Adopted in 3D Cryogenic Electron Microscopy
Simultaneous Localization and Mapping
Text based Person Retrieval
Universal Domain Adaptation
Few-Shot Image Classification
Zero-Shot Semantic Segmentation
Diffusion Personalization
Intrinsic Image Decomposition
Referring expression generation
Semi-Supervised Video Object Segmentation
Composed Image Retrieval (CoIR)
Document Image Classification
Multispectral Object Detection
Content-Based Image Retrieval
Multi-target Domain Adaptation
Unsupervised Object Segmentation
Human Interaction Recognition
Weakly-supervised instance segmentation
Precipitation Forecasting
Synthetic Image Detection
3D Point Cloud Linear Classification
Bird's-Eye View Semantic Segmentation
Space-time Video Super-resolution
Dense Pixel Correspondence Estimation
Image Manipulation Localization
Multi-Object Tracking and Segmentation
Temporal Sentence Grounding
Zero-Shot Transfer Image Classification
No-Reference Image Quality Assessment
Semi-Supervised Image Classification
Shape Representation Of 3D Point Clouds
Subject-driven Video Generation
3D Semantic Occupancy Prediction
Full reference image quality assessment
Text-based Person Retrieval
Source-Free Domain Adaptation
Handwritten Mathmatical Expression Recognition
Unsupervised Semantic Segmentation
3D Multi-Person Pose Estimation (absolute)
Anatomical Landmark Detection
Audio-Visual Synchronization
Open Vocabulary Object Detection
Open Vocabulary Panoptic Segmentation
Image/Document Clustering
Lung Nodule Classification
3D Multi-Person Pose Estimation (root-relative)
3D Semantic Scene Completion
Grounded Situation Recognition
Semi-Supervised Domain Generalization
Semi-Supervised Instance Segmentation
Single-View 3D Reconstruction
Unsupervised Image-To-Image Translation
Visual Relationship Detection
Point Cloud Super Resolution
Short-term Object Interaction Anticipation
Spatio-Temporal Video Grounding
Unified Image Restoration
3D Shape Reconstruction From A Single 2D Image
ENF (Electric Network Frequency) Extraction
3D Open-Vocabulary Instance Segmentation
Event data classification
Medical Image Enhancement
Multi-class Classification
Sketch-to-Image Translation
2D Semantic Segmentation task 3 (25 classes)
Birds Eye View Object Detection
Seeing Beyond the Visible
Training-free 3D Point Cloud Classification
Explanatory Visual Question Answering
Infrared image super-resolution
Instance Shadow Detection
continual anomaly detection
Conditional Image Generation
Facial Expression Recognition
Multimodal Machine Translation
Generalized Referring Expression Comprehension
Keypoint detection and image matching
Lifelike 3D Human Generation
Long Video Retrieval (Background Removed)
Online Vectorized HD Map Construction
Personality Trait Recognition
Personalized Segmentation
Repetitive Action Counting
Segmentation Of Remote Sensing Imagery
Text-based Person Retrieval with Noisy Correspondence
The Semantic Segmentation Of Remote Sensing Imagery
Unsupervised 3D Point Cloud Linear Evaluation
Visual Speech Recognition
Unsupervised Anomaly Detection
Unsupervised Panoptic Segmentation
Age and Gender Estimation
Audio-visual Question Answering
Generative 3D Object Classification
Synthetic Image Attribution
Traffic Accident Detection
Unsupervised Landmark Detection
Visual Social Relationship Recognition
Calving Front Delineation In Synthetic Aperture Radar Imagery
Continual Semantic Segmentation
Semi-supervised Anomaly Detection
Historical Color Image Dating
Image and Video Forgery Detection
Marine Animal Segmentation
Multi-modal image segmentation
Part-aware Panoptic Segmentation
Part-based Representation Learning
Prediction Of Occupancy Grid Maps
Reasoning Video Object Segmentation
Spatial Relation Recognition
Supervised Image Retrieval
Unsupervised Anomaly Detection with Specified Settings -- 0.1% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 1% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 10% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 20% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 30% anomaly
Zero-shot Text-to-Video Generation
Zero-shot skeleton-based action recognition
Video Frame Interpolation
Brain Visual Reconstruction
Micro-expression Generation
Few Shot Open Set Object Detection
Grounded Multimodal Named Entity Recognition
Hierarchical Text Segmentation
Human-Object Interaction Concept Discovery
Manufacturing Quality Control
Multi-Person Pose Estimation
Query focused video summarization
Retinal Vessel Segmentation
Semi-Supervised Video Classification
Single-Source Domain Generalization
Specular Reflection Mitigation
Training-free 3D Part Segmentation
Unsupervised Image Decomposition
Video Individual Counting
Vietnamese Multimodal Learning
Weakly Supervised 3D Point Cloud Segmentation
Weakly-supervised panoptic segmentation
drone-based object tracking
4D Spatio Temporal Semantic Segmentation
Animal Action Recognition
Clothing Attribute Recognition
Damaged Building Detection
Dynamic Texture Recognition
Fine-Grained Vehicle Classification
Flooded Building Segmentation
Generalized Zero-Shot Learning - Unseen
Human Instance Segmentation
Human fMRI response prediction
Human-Object Interaction Anticipation
Micro-gesture Recognition
Multi-Person Pose Estimation and Tracking
Open Vocabulary Action Detection
Partial Video Copy Detection
Perpetual View Generation
Physical Attribute Prediction
Prompt-driven Zero-shot Domain Adaptation
Pupil Diameter Estimation
Safety Perception Recognition
Single-shot HDR Reconstruction
Sketch-Based Image Retrieval
Unsupervised Instance Segmentation
Vehicle Key-Point and Orientation Estimation
Video-to-image Affordance Grounding
Visual Sentiment Prediction
human-scene contact detection
self-supervised scene text recognition
spatial-aware image editing
Action Quality Assessment Report Generation
Audio-Video Question Answering (AVQA)
Concept-based Classification
Constrained Diffeomorphic Image Registration
Continuous Affect Estimation
Document Image Skew Estimation
Fashion Compatibility Learning
Fine-Grained Image Classification
Generative Temporal Nursing
IFC Entity Classification
Image Similarity Detection
Image-To-Gps Verification
Image-based Automatic Meter Reading
Indoor Scene Reconstruction
Interactive 3D Instance Segmentation
Laminar-Turbulent Flow Localisation
Language-Based Temporal Localization
Linear Probing Object-Level 3D Awareness
MLLM Aesthetic Evaluation
MLLM Evaluation: Aesthetics
Mental Workload Estimation
Motion Expressions Guided Video Segmentation
Multi-Oriented Scene Text Detection
Multi-object colocalization
Multilingual Text-to-Image Generation
Multimodal Emotion Recognition
Occluded 3D Object Symmetry Detection
Open Set Video Captioning
Open-Vocabulary Semantic Segmentation
Partial Point Cloud Matching
Partially View-aligned Multi-view Learning
Personality Trait Recognition by Face
Point cloud classification dataset
Point- of-no-return (PNR) temporal localization
Pose Contrastive Learning
Procedure Step Recognition
Prostate Zones Segmentation
Pulmorary Vessel Segmentation
Reference Expression Generation
Referring Multi-Object Tracking
Semi-Supervised Image Regression
State Change Object Detection
Streaming video understanding
Surface Normals Estimation from Point Clouds
Transform A Video Into A Comics
Unsupervised Long Term Person Re-Identification
Unsupervised Zero-Shot Panoptic Segmentation
Video Correspondence Flow
Weakly Supervised Referring Expression Segmentation
Yield Mapping In Apple Orchards
eXtreme-Video-Frame-Interpolation
lidar absolute pose regression
video narration captioning
3D Canonical Hand Pose Estimation
Atomic action recognition
Calving Front Delineation From Synthetic Aperture Radar Imagery
Computer Vision Transduction
Crosslingual Text-to-Image Generation
Document To Image Conversion
Frame Duplication Detection
Hyperspectral Image Segmentation
Image Operation Chain Detection
Kinematic Based Workflow Recognition
Motion Detection In Non-Stationary Scenes
Multi Class Classification (Four-level Video Classification)
Satellite Orbit Determination
Segmentation Based Workflow Recognition
Sperm Morphology Classification
Temperature Prediction Using Specklegrams
Video & Kinematic Base Workflow Recognition
Video Based Workflow Recognition
Video, Kinematic & Segmentation Base Workflow Recognition
Weakly-supervised Temporal Action Localization