Optical Flow Estimation

650 papers with code • 10 benchmarks • 33 datasets

Optical Flow Estimation is a computer vision task that involves computing the motion of objects in an image or a video sequence. The goal of optical flow estimation is to determine the movement of pixels or features in the image, which can be used for various applications such as object tracking, motion analysis, and video compression.

Approaches for optical flow estimation include correlation-based, block-matching, feature tracking, energy-based, and more recently gradient-based.

Further readings:

Definition source: Devon: Deformable Volume Network for Learning Optical Flow

Image credit: Optical Flow Estimation

Libraries

Use these libraries to find Optical Flow Estimation models and implementations
9 papers
891
5 papers
128
5 papers
128

RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow Estimation

hmorimitsu/ptlflow IEEE International Conference on Robotics and Automation (ICRA) 2024

Extracting motion information from videos with optical flow estimation is vital in multiple practical robot applications.

197
01 May 2024

Moving Object Segmentation: All You Need Is SAM (and Flow)

Jyxarthur/flowsam 18 Apr 2024

The objective of this paper is motion segmentation -- discovering and segmenting the moving objects in a video.

125
18 Apr 2024

DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping

great-whu/dba-fusion 20 Mar 2024

Visual simultaneous localization and mapping (VSLAM) has broad applications, with state-of-the-art methods leveraging deep neural networks for better robustness and applicability.

27
20 Mar 2024

NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices

neufieldrobotics/neuflow 15 Mar 2024

Given the features of the input images extracted at different spatial resolutions, global matching is employed to estimate an initial optical flow on the 1/16 resolution, capturing large displacement, which is then refined on the 1/8 resolution with lightweight CNN layers for better accuracy.

41
15 Mar 2024

Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation

wpr1018001/rethinking-low-quality-optical-flow 15 Mar 2024

Video-based surgical instrument segmentation plays an important role in robot-assisted surgeries.

10
15 Mar 2024

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

bigai-nlco/lstp-chat 25 Feb 2024

Despite progress in video-language modeling, the computational challenge of interpreting long-form videos in response to task-specific linguistic queries persists, largely due to the complexity of high-dimensional video data and the misalignment between language and visual cues over space and time.

15
25 Feb 2024

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Yui010206/CREMA 8 Feb 2024

Furthermore, we propose a fusion module designed to compress multimodal queries, maintaining computational efficiency in the LLM while combining additional modalities.

18
08 Feb 2024

Motion-Aware Video Frame Interpolation

zdyshine/Video-Frame-Interpolation-Summary 5 Feb 2024

Subsequently, a cross-scale motion structure is presented to estimate and refine intermediate flow maps by the extracted features.

95
05 Feb 2024

Taylor Videos for Action Recognition

leiwangr/video-ar 5 Feb 2024

Addressing these challenges, we propose the Taylor video, a new video format that highlights the dominate motions (e. g., a waving hand) in each of its frames named the Taylor frame.

5
05 Feb 2024

Recurrent Partial Kernel Network for Efficient Optical Flow Estimation

hmorimitsu/ptlflow The 38th Annual AAAI Conference on Artificial Intelligence (AAAI) 2024

However, this impacts the widespread adoption of optical flow methods and makes it harder to train more general models since the optical flow data is hard to obtain.

197
01 Feb 2024