TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Text-to-Video Generation	UCF-101	REGIS-Fuse (Finetuning, 128x128)	FVD16	141	# 1
Video Generation	UCF-101	REGIS-Fuse (Finetuning, 128x128, text-conditional)	FVD16	141	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/regis-refining-generated-videos-via-iterative/text-to-video-generation-on-ucf-101)](https://paperswithcode.com/sota/text-to-video-generation-on-ucf-101?p=regis-refining-generated-videos-via-iterative)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/regis-refining-generated-videos-via-iterative/video-generation-on-ucf-101)](https://paperswithcode.com/sota/video-generation-on-ucf-101?p=regis-refining-generated-videos-via-iterative)`

REGIS: Refining Generated Videos via Iterative Stylistic Redesigning

Research Square Preprint 2023 · Jason Abohwo ·

In recent years, generative models have made impressive advancements towards realistic output; in particular, models working in the modalities of text and audio have reached a level of quality at which generated text or audio cannot be easily distinguished from real text or audio Despite these revolutionary advancements, the synthesis of realistic and temporally consistent videos is still in its inception. In this paper, we introduce a novel approach to the creation of realistic videos that focuses on improving the generated video in the latter steps of a video generation process. Specifically, we propose a framework for the iterative refinement of generated videos through repeated passes through a neural network trained to model the spatio-temporal dependencies found in real videos. Through our experiments, we demonstrate that our proposed approach significantly improves upon the generations of text-to-video models and achieves state-of-the-art results of the UCF-101 benchmark; removing the spatio-temporal artifacts and noise that make synthetic videos distinguishable from real videos. In addition, we discuss the ways in which one might augment this framework to achieve better performance.

PDF

Code

Add Remove Mark official

Jaso1024/Refining-Generated-Videos

Tasks

Add Remove

Text-to-Video Generation

Video Generation

Datasets

UCF101

Results from the Paper

Add Remove

Ranked #1 on Text-to-Video Generation on UCF-101

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-to-Video Generation	UCF-101	REGIS-Fuse (Finetuning, 128x128)	FVD16	141	# 1	Compare
Video Generation	UCF-101	REGIS-Fuse (Finetuning, 128x128, text-conditional)	FVD16	141	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

REGIS: Refining Generated Videos via Iterative Stylistic Redesigning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove