TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Optical Flow Estimation	KITTI 2015 (train)	VCN	F1-all	25.1	# 14
Optical Flow Estimation	KITTI 2015 (train)	VCN	EPE	8.36	# 12
Optical Flow Estimation	Sintel-final	VCN	Average End-Point Error	4.40	# 16

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/volumetric-correspondence-networks-for/optical-flow-estimation-on-kitti-2015-train)](https://paperswithcode.com/sota/optical-flow-estimation-on-kitti-2015-train?p=volumetric-correspondence-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/volumetric-correspondence-networks-for/optical-flow-estimation-on-sintel-final)](https://paperswithcode.com/sota/optical-flow-estimation-on-sintel-final?p=volumetric-correspondence-networks-for)`

Volumetric Correspondence Networks for Optical Flow

NeurIPS 2019 · Gengshan Yang, Deva Ramanan ·

Many classic tasks in vision -- such as the estimation of optical flow or stereo disparities -- can be cast as dense correspondence matching. Well-known techniques for doing so make use of a cost volume, typically a 4D tensor of match costs between all pixels in a 2D image and their potential matches in a 2D search window. State-of-the-art (SOTA) deep networks for flow/stereo make use of such volumetric representations as internal layers. However, such layers require significant amounts of memory and compute, making them cumbersome to use in practice. As a result, SOTA networks also employ various heuristics designed to limit volumetric processing, leading to limited accuracy and overfitting. Instead, we introduce several simple modifications that dramatically simplify the use of volumetric layers - (1) volumetric encoder-decoder architectures that efficiently capture large receptive fields, (2) multi-channel cost volumes that capture multi-dimensional notions of pixel similarities, and finally, (3) separable volumetric filtering that significantly reduces computation and parameters while preserving accuracy. Our innovations dramatically improve accuracy over SOTA on standard benchmarks while being significantly easier to work with - training converges in 10X fewer iterations, and most importantly, our networks generalize across correspondence tasks. On-the-fly adaptation of search windows allows us to repurpose optical flow networks for stereo (and vice versa), and can also be used to implement adaptive networks that increase search window sizes on-demand.

PDF Abstract

Code

Add Remove Mark official

gengshan-y/VCN official

152

neu-vig/ezflow

127

Tasks

Add Remove

Optical Flow Estimation

Datasets

KITTI

MPI Sintel FlyingChairs

Results from the Paper

Add Remove

Ranked #14 on Optical Flow Estimation on KITTI 2015 (train)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Optical Flow Estimation	KITTI 2015 (train)	VCN	F1-all	25.1	# 14	Compare
Optical Flow Estimation	KITTI 2015 (train)	VCN	EPE	8.36	# 12	Compare
Optical Flow Estimation	Sintel-final	VCN	Average End-Point Error	4.40	# 16	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Volumetric Correspondence Networks for Optical Flow

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove