TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Reconstruction	MGif	FOMM	L1	0.0223	# 2
Video Reconstruction	MGif	Siarohin et al.	L1	0.0206	# 1
Video Reconstruction	Tai-Chi-HD (256)	FOMM	L1	0.056	# 2
Video Reconstruction	Tai-Chi-HD (256)	FOMM	AED	0.172	# 2
Video Reconstruction	Tai-Chi-HD (256)	FOMM	AKD	6.53	# 2
Video Reconstruction	Tai-Chi-HD (256)	FOMM	MKR	0.033	# 2
Video Reconstruction	Tai-Chi-HD (256)	Siarohin et al.	L1	0.047	# 1
Video Reconstruction	Tai-Chi-HD (256)	Siarohin et al.	AED	0.152	# 1
Video Reconstruction	Tai-Chi-HD (256)	Siarohin et al.	AKD	5.58	# 1
Video Reconstruction	Tai-Chi-HD (256)	Siarohin et al.	MKR	0.027	# 1
Video Reconstruction	Tai-Chi-HD (512)	FOMM	L1	0.075	# 2
Video Reconstruction	Tai-Chi-HD (512)	FOMM	AKD	17.12	# 2
Video Reconstruction	Tai-Chi-HD (512)	FOMM	MKR	0.066	# 2
Video Reconstruction	Tai-Chi-HD (512)	FOMM	AED	0.203	# 2
Video Reconstruction	Tai-Chi-HD (512)	Siarohin et al.	L1	0.064	# 1
Video Reconstruction	Tai-Chi-HD (512)	Siarohin et al.	AKD	13.86	# 1
Video Reconstruction	Tai-Chi-HD (512)	Siarohin et al.	MKR	0.043	# 1
Video Reconstruction	Tai-Chi-HD (512)	Siarohin et al.	AED	0.172	# 1
Video Reconstruction	TED-talks	Siarohin et al.	L1	0.026	# 1
Video Reconstruction	TED-talks	Siarohin et al.	AKD	3.75	# 2
Video Reconstruction	TED-talks	Siarohin et al.	MKR	0.007	# 1
Video Reconstruction	TED-talks	Siarohin et al.	AED	0.114	# 1
Video Reconstruction	TED-talks	FOMM	L1	0.033	# 2
Video Reconstruction	TED-talks	FOMM	AKD	7.07	# 1
Video Reconstruction	TED-talks	FOMM	MKR	0.014	# 2
Video Reconstruction	TED-talks	FOMM	AED	0.163	# 2
Video Reconstruction	VoxCeleb	Siarohin et al.	L1	0.040	# 1
Video Reconstruction	VoxCeleb	Siarohin et al.	AKD	1.28	# 2
Video Reconstruction	VoxCeleb	Siarohin et al.	AED	0.133	# 1
Video Reconstruction	VoxCeleb	FOMM	L1	0.041	# 2
Video Reconstruction	VoxCeleb	FOMM	AKD	1.27	# 1
Video Reconstruction	VoxCeleb	FOMM	AED	0.134	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-representations-for-articulated-1/video-reconstruction-on-mgif)](https://paperswithcode.com/sota/video-reconstruction-on-mgif?p=motion-representations-for-articulated-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-representations-for-articulated-1/video-reconstruction-on-tai-chi-hd-256)](https://paperswithcode.com/sota/video-reconstruction-on-tai-chi-hd-256?p=motion-representations-for-articulated-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-representations-for-articulated-1/video-reconstruction-on-tai-chi-hd-512)](https://paperswithcode.com/sota/video-reconstruction-on-tai-chi-hd-512?p=motion-representations-for-articulated-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-representations-for-articulated-1/video-reconstruction-on-ted-talks)](https://paperswithcode.com/sota/video-reconstruction-on-ted-talks?p=motion-representations-for-articulated-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-representations-for-articulated-1/video-reconstruction-on-voxceleb)](https://paperswithcode.com/sota/video-reconstruction-on-voxceleb?p=motion-representations-for-articulated-1)`

Motion Representations for Articulated Animation

CVPR 2021 · Aliaksandr Siarohin, Oliver J. Woodford, Jian Ren, Menglei Chai, Sergey Tulyakov ·

We propose novel motion representations for animating articulated objects consisting of distinct parts. In a completely unsupervised manner, our method identifies object parts, tracks them in a driving video, and infers their motions by considering their principal axes. In contrast to the previous keypoint-based works, our method extracts meaningful and consistent regions, describing locations, shape, and pose. The regions correspond to semantically relevant and distinct object parts, that are more easily detected in frames of the driving video. To force decoupling of foreground from background, we model non-object related global motion with an additional affine transformation. To facilitate animation and prevent the leakage of the shape of the driving object, we disentangle shape and pose of objects in the region space. Our model can animate a variety of objects, surpassing previous methods by a large margin on existing benchmarks. We present a challenging new benchmark with high-resolution videos and show that the improvement is particularly pronounced when articulated objects are considered, reaching 96.6% user preference vs. the state of the art.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

snap-research/articulated-animation official

1,183

AliaksandrSiarohin/first-order-model

↳ Quickstart in

Colab

Spaces

14,197

Tasks

Add Remove

Object

Video Reconstruction

Datasets

Introduced in the Paper:

TED-talks

Used in the Paper:

VoxCeleb1

MGif

Tai-Chi-HD

Results from the Paper

Edit

Ranked #1 on Video Reconstruction on Tai-Chi-HD (512)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Reconstruction	MGif	FOMM	L1	0.0223	# 2	Compare
Video Reconstruction	MGif	Siarohin et al.	L1	0.0206	# 1	Compare
Video Reconstruction	Tai-Chi-HD (256)	FOMM	L1	0.056	# 2	Compare
			AED	0.172	# 2	Compare
			AKD	6.53	# 2	Compare
			MKR	0.033	# 2	Compare
Video Reconstruction	Tai-Chi-HD (256)	Siarohin et al.	L1	0.047	# 1	Compare
			AED	0.152	# 1	Compare
			AKD	5.58	# 1	Compare
			MKR	0.027	# 1	Compare
Video Reconstruction	Tai-Chi-HD (512)	FOMM	L1	0.075	# 2	Compare
			AKD	17.12	# 2	Compare
			MKR	0.066	# 2	Compare
			AED	0.203	# 2	Compare
Video Reconstruction	Tai-Chi-HD (512)	Siarohin et al.	L1	0.064	# 1	Compare
			AKD	13.86	# 1	Compare
			MKR	0.043	# 1	Compare
			AED	0.172	# 1	Compare
Video Reconstruction	TED-talks	Siarohin et al.	L1	0.026	# 1	Compare
			AKD	3.75	# 2	Compare
			MKR	0.007	# 1	Compare
			AED	0.114	# 1	Compare
Video Reconstruction	TED-talks	FOMM	L1	0.033	# 2	Compare
			AKD	7.07	# 1	Compare
			MKR	0.014	# 2	Compare
			AED	0.163	# 2	Compare
Video Reconstruction	VoxCeleb	Siarohin et al.	L1	0.040	# 1	Compare
			AKD	1.28	# 2	Compare
			AED	0.133	# 1	Compare
Video Reconstruction	VoxCeleb	FOMM	L1	0.041	# 2	Compare
			AKD	1.27	# 1	Compare
			AED	0.134	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Motion Representations for Articulated Animation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove