TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Self-Supervised Action Recognition	UCF101	VideoGan (C3D)	3-fold Accuracy	52.1	# 51
Self-Supervised Action Recognition	UCF101	VideoGan (C3D)	Pre-Training Dataset	UCF101	# 1
Self-Supervised Action Recognition	UCF101	VideoGan (C3D)	Frozen	false	# 1
Video Generation	UCF-101 16 frames, 64x64, Unconditional	VGAN	Inception Score	8.18	# 7
Video Generation	UCF-101 16 frames, Unconditional, Single GPU	VGAN	Inception Score	8.18	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generating-videos-with-scene-dynamics/video-generation-on-ucf-101-16-frames-64x64)](https://paperswithcode.com/sota/video-generation-on-ucf-101-16-frames-64x64?p=generating-videos-with-scene-dynamics)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generating-videos-with-scene-dynamics/video-generation-on-ucf-101-16-frames)](https://paperswithcode.com/sota/video-generation-on-ucf-101-16-frames?p=generating-videos-with-scene-dynamics)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generating-videos-with-scene-dynamics/self-supervised-action-recognition-on-ucf101)](https://paperswithcode.com/sota/self-supervised-action-recognition-on-ucf101?p=generating-videos-with-scene-dynamics)`

Generating Videos with Scene Dynamics

NeurIPS 2016 · Carl Vondrick, Hamed Pirsiavash, Antonio Torralba ·

We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e.g. action classification) and video generation tasks (e.g. future prediction). We propose a generative adversarial network for video with a spatio-temporal convolutional architecture that untangles the scene's foreground from the background. Experiments suggest this model can generate tiny videos up to a second at full frame rate better than simple baselines, and we show its utility at predicting plausible futures of static images. Moreover, experiments and visualizations show the model internally learns useful features for recognizing actions with minimal supervision, suggesting scene dynamics are a promising signal for representation learning. We believe generative video models can impact many applications in video understanding and simulation.

PDF Abstract NeurIPS 2016 PDF NeurIPS 2016 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Classification

Future prediction

General Classification

Generative Adversarial Network

Representation Learning

Self-Supervised Action Recognition

Video Generation

Video Recognition

Video Understanding

Datasets

UCF101

Results from the Paper

Edit

Ranked #7 on Video Generation on UCF-101 16 frames, Unconditional, Single GPU

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Self-Supervised Action Recognition	UCF101	VideoGan (C3D)	3-fold Accuracy	52.1	# 51	Compare
			Pre-Training Dataset	UCF101	# 1	Compare
			Frozen	false	# 1	Compare
Video Generation	UCF-101 16 frames, 64x64, Unconditional	VGAN	Inception Score	8.18	# 7	Compare
Video Generation	UCF-101 16 frames, Unconditional, Single GPU	VGAN	Inception Score	8.18	# 7	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Generating Videos with Scene Dynamics

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove