TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	PSNR	29.89	# 1
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	SSIM	0.953	# 1
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	VMAF	71.71	# 2
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	LPIPS	0.022	# 2
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	MS-SSIM	0.965	# 1
Video Frame Interpolation	SNU-FILM (easy)	EMA-VFI	PSNR	39.98	# 6
Video Frame Interpolation	SNU-FILM (easy)	EMA-VFI	SSIM	0.9910	# 2
Video Frame Interpolation	SNU-FILM (extreme)	EMA-VFI	PSNR	25.69	# 3
Video Frame Interpolation	SNU-FILM (extreme)	EMA-VFI	SSIM	0.8661	# 2
Video Frame Interpolation	SNU-FILM (hard)	EMA-VFI	PSNR	30.94	# 3
Video Frame Interpolation	SNU-FILM (hard)	EMA-VFI	SSIM	0.9392	# 2
Video Frame Interpolation	SNU-FILM (medium)	EMA-VFI	PSNR	36.09	# 5
Video Frame Interpolation	SNU-FILM (medium)	EMA-VFI	SSIM	0.9801	# 2
Video Frame Interpolation	UCF101	EMA-VFI	PSNR	35.48	# 1
Video Frame Interpolation	UCF101	EMA-VFI	SSIM	0.9701	# 3
Video Frame Interpolation	Vimeo90K	EMA-VFI	PSNR	36.64	# 2
Video Frame Interpolation	Vimeo90K	EMA-VFI	SSIM	0.9819	# 1
Video Frame Interpolation	X4K1000FPS	EMA-VFI	PSNR	31.46	# 3
Video Frame Interpolation	X4K1000FPS-2K	EMA-VFI	PSNR	32.85	# 1
Video Frame Interpolation	Xiph-2K	EMA-VFI	PSNR	36.90	# 1
Video Frame Interpolation	Xiph-2K	EMA-VFI	SSIM	0.945	# 3
Video Frame Interpolation	Xiph-4k	EMA-VFI	PSNR	34.67	# 1
Video Frame Interpolation	Xiph-4k	EMA-VFI	SSIM	0.907	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-msu-video-frame)](https://paperswithcode.com/sota/video-frame-interpolation-on-msu-video-frame?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-ucf101-1)](https://paperswithcode.com/sota/video-frame-interpolation-on-ucf101-1?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-x4k1000fps-2k)](https://paperswithcode.com/sota/video-frame-interpolation-on-x4k1000fps-2k?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-xiph-2k)](https://paperswithcode.com/sota/video-frame-interpolation-on-xiph-2k?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-xiph-4k-1)](https://paperswithcode.com/sota/video-frame-interpolation-on-xiph-4k-1?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-vimeo90k)](https://paperswithcode.com/sota/video-frame-interpolation-on-vimeo90k?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-snu-film-extreme)](https://paperswithcode.com/sota/video-frame-interpolation-on-snu-film-extreme?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-snu-film-hard)](https://paperswithcode.com/sota/video-frame-interpolation-on-snu-film-hard?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-x4k1000fps)](https://paperswithcode.com/sota/video-frame-interpolation-on-x4k1000fps?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-snu-film-medium)](https://paperswithcode.com/sota/video-frame-interpolation-on-snu-film-medium?p=extracting-motion-and-appearance-via-inter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extracting-motion-and-appearance-via-inter/video-frame-interpolation-on-snu-film-easy)](https://paperswithcode.com/sota/video-frame-interpolation-on-snu-film-easy?p=extracting-motion-and-appearance-via-inter)`

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

CVPR 2023 · Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang ·

Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or elaborate separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

mcg-nju/ema-vfi official

307

Tasks

Add Remove

Video Frame Interpolation

Datasets

UCF101

Middlebury

Vimeo90K

X4K1000FPS

MSU Video Frame Interpolation

Results from the Paper

Edit

Ranked #1 on Video Frame Interpolation on MSU Video Frame Interpolation (PSNR metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Frame Interpolation	MSU Video Frame Interpolation	EMA-VFI	PSNR	29.89	# 1	Compare
			SSIM	0.953	# 1	Compare
			VMAF	71.71	# 2	Compare
			LPIPS	0.022	# 2	Compare
			MS-SSIM	0.965	# 1	Compare
Video Frame Interpolation	SNU-FILM (easy)	EMA-VFI	PSNR	39.98	# 6	Compare
Video Frame Interpolation	SNU-FILM (easy)	EMA-VFI	SSIM	0.9910	# 2	Compare
Video Frame Interpolation	SNU-FILM (extreme)	EMA-VFI	PSNR	25.69	# 3	Compare
Video Frame Interpolation	SNU-FILM (extreme)	EMA-VFI	SSIM	0.8661	# 2	Compare
Video Frame Interpolation	SNU-FILM (hard)	EMA-VFI	PSNR	30.94	# 3	Compare
Video Frame Interpolation	SNU-FILM (hard)	EMA-VFI	SSIM	0.9392	# 2	Compare
Video Frame Interpolation	SNU-FILM (medium)	EMA-VFI	PSNR	36.09	# 5	Compare
Video Frame Interpolation	SNU-FILM (medium)	EMA-VFI	SSIM	0.9801	# 2	Compare
Video Frame Interpolation	UCF101	EMA-VFI	PSNR	35.48	# 1	Compare
Video Frame Interpolation	UCF101	EMA-VFI	SSIM	0.9701	# 3	Compare
Video Frame Interpolation	Vimeo90K	EMA-VFI	PSNR	36.64	# 2	Compare
Video Frame Interpolation	Vimeo90K	EMA-VFI	SSIM	0.9819	# 1	Compare
Video Frame Interpolation	X4K1000FPS	EMA-VFI	PSNR	31.46	# 3	Compare
Video Frame Interpolation	X4K1000FPS-2K	EMA-VFI	PSNR	32.85	# 1	Compare
Video Frame Interpolation	Xiph-2K	EMA-VFI	PSNR	36.90	# 1	Compare
Video Frame Interpolation	Xiph-2K	EMA-VFI	SSIM	0.945	# 3	Compare
Video Frame Interpolation	Xiph-4k	EMA-VFI	PSNR	34.67	# 1	Compare
Video Frame Interpolation	Xiph-4k	EMA-VFI	SSIM	0.907	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Temporal attention • Transformer

Edit Social Preview

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove