TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Frame Interpolation	Vid4 - 4x upscaling	Zooming Slow-Mo	PSNR	26.31	# 4
Video Frame Interpolation	Vid4 - 4x upscaling	Zooming Slow-Mo	SSIM	0.7976	# 4
Video Frame Interpolation	Vid4 - 4x upscaling	Zooming Slow-Mo	Parameters	11100000	# 1
Video Frame Interpolation	Vid4 - 4x upscaling	Zooming Slow-Mo	runtime (s)	0.0606	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/zooming-slow-mo-fast-and-accurate-one-stage/video-frame-interpolation-on-vid4-4x)](https://paperswithcode.com/sota/video-frame-interpolation-on-vid4-4x?p=zooming-slow-mo-fast-and-accurate-one-stage)`

Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution

CVPR 2020 · Xiaoyu Xiang, Yapeng Tian, Yulun Zhang, Yun Fu, Jan P. Allebach, Chenliang Xu ·

In this paper, we explore the space-time video super-resolution task, which aims to generate a high-resolution (HR) slow-motion video from a low frame rate (LFR), low-resolution (LR) video. A simple solution is to split it into two sub-tasks: video frame interpolation (VFI) and video super-resolution (VSR). However, temporal interpolation and spatial super-resolution are intra-related in this task. Two-stage methods cannot fully take advantage of the natural property. In addition, state-of-the-art VFI or VSR networks require a large frame-synthesis or reconstruction module for predicting high-quality video frames, which makes the two-stage methods have large model sizes and thus be time-consuming. To overcome the problems, we propose a one-stage space-time video super-resolution framework, which directly synthesizes an HR slow-motion video from an LFR, LR video. Rather than synthesizing missing LR video frames as VFI networks do, we firstly temporally interpolate LR frame features in missing LR video frames capturing local temporal contexts by the proposed feature temporal interpolation network. Then, we propose a deformable ConvLSTM to align and aggregate temporal information simultaneously for better leveraging global temporal contexts. Finally, a deep reconstruction network is adopted to predict HR slow-motion video frames. Extensive experiments on benchmark datasets demonstrate that the proposed method not only achieves better quantitative and qualitative performance but also is more than three times faster than recent two-stage state-of-the-art methods, e.g., DAIN+EDVR and DAIN+RBPN.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

Mukosame/Zooming-Slow-Mo-CVPR-2020 official

896

YapengTian/TDAN_VSR

401

YapengTian/TDAN-VSR-CVPR-2020

401

Tasks

Add Remove

Space-time Video Super-resolution

Super-Resolution

Video Frame Interpolation

Video Super-Resolution

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #4 on Video Frame Interpolation on Vid4 - 4x upscaling

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Frame Interpolation	Vid4 - 4x upscaling	Zooming Slow-Mo	PSNR	26.31	# 4	Compare
			SSIM	0.7976	# 4	Compare
			Parameters	11100000	# 1	Compare
			runtime (s)	0.0606	# 1	Compare

Methods

Add Remove

ConvLSTM • Convolution • Sigmoid Activation • Tanh Activation

Edit Social Preview

Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove