Visual Storytelling

25 papers with code • 1 benchmarks • 4 datasets

( Image credit: No Metrics Are Perfect )

Benchmarks

Add a Result

These leaderboards are used to track progress in Visual Storytelling

Trend	Dataset	Best Model	Paper	Code	Compare
	VIST	HEGR			See all

Datasets

Subtasks

Image-guided Story Ending Generation

Latest papers

Most implemented Social Latest No code

Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas

jiaweisii/gorgeous • 22 Apr 2024

Contemporary makeup transfer methods primarily focus on replicating makeup from one face to another, considerably limiting their use in creating diverse and creative character makeup essential for visual storytelling.

22 Apr 2024

Paper
Code

inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE

rossiyareich/inknhue • • 3 Nov 2023

Yet, the desire to experience manga in vibrant colors has sparked the pursuit of manga colorization, a task of paramount significance for artists.

03 Nov 2023

Paper
Code

GROOViST: A Metric for Grounding Objects in Visual Storytelling

akskuchi/groovist • • 26 Oct 2023

A proper evaluation of stories generated for a sequence of images -- the task commonly referred to as visual storytelling -- must consider multiple aspects, such as coherence, grammatical correctness, and visual grounding.

26 Oct 2023

Paper
Code

Envisioning Narrative Intelligence: A Creative Visual Storytelling Anthology

USArmyResearchLab/ARL-Creative-Visual-Storytelling • 6 Oct 2023

In this paper, we collect an anthology of 100 visual stories from authors who participated in our systematic creative process of improvised story-building based on image sequences.

06 Oct 2023

Paper
Code

TouchStone: Evaluating Vision-Language Models by Language Models

ofa-sys/touchstone • 31 Aug 2023

Large vision-language models (LVLMs) have recently witnessed rapid advancements, exhibiting a remarkable capacity for perceiving, understanding, and processing visual information by connecting visual receptor with large language models (LLMs).

31 Aug 2023

Paper
Code

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

videocrafter/animate-a-story • 13 Jul 2023

For the first module, we leverage an off-the-shelf video retrieval system and extract video depths as motion structure.

234

13 Jul 2023

Paper
Code

Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models

haoningwu3639/StoryGen • • 1 Jun 2023

Generative models have recently exhibited exceptional capabilities in text-to-image generation, but still struggle to generate image sequences coherently.

146

01 Jun 2023

Paper
Code

Detecting and Grounding Important Characters in Visual Stories

iz2late/VIST-Character • 30 Mar 2023

Characters are essential to the plot of any story.

30 Mar 2023

Paper
Code

Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models

IIT-PAVIS/Positional_Diffusion • • 20 Mar 2023

We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning.

20 Mar 2023

Paper
Code

Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning

jaleedkhan/neusire • • European Semantic Web Conference (ESWC) 2022

These results depict the effectiveness of commonsense knowledge infusion in improving the performance and expressiveness of scene graph generation for visual understanding and reasoning tasks.

31 May 2022

Paper
Code

Visual Storytelling

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result