Visual Storytelling

25 papers with code • 1 benchmarks • 4 datasets

( Image credit: No Metrics Are Perfect )

Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas

jiaweisii/gorgeous 22 Apr 2024

Contemporary makeup transfer methods primarily focus on replicating makeup from one face to another, considerably limiting their use in creating diverse and creative character makeup essential for visual storytelling.

6
22 Apr 2024

inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE

rossiyareich/inknhue 3 Nov 2023

Yet, the desire to experience manga in vibrant colors has sparked the pursuit of manga colorization, a task of paramount significance for artists.

13
03 Nov 2023

GROOViST: A Metric for Grounding Objects in Visual Storytelling

akskuchi/groovist 26 Oct 2023

A proper evaluation of stories generated for a sequence of images -- the task commonly referred to as visual storytelling -- must consider multiple aspects, such as coherence, grammatical correctness, and visual grounding.

1
26 Oct 2023

Envisioning Narrative Intelligence: A Creative Visual Storytelling Anthology

USArmyResearchLab/ARL-Creative-Visual-Storytelling 6 Oct 2023

In this paper, we collect an anthology of 100 visual stories from authors who participated in our systematic creative process of improvised story-building based on image sequences.

1
06 Oct 2023

TouchStone: Evaluating Vision-Language Models by Language Models

ofa-sys/touchstone 31 Aug 2023

Large vision-language models (LVLMs) have recently witnessed rapid advancements, exhibiting a remarkable capacity for perceiving, understanding, and processing visual information by connecting visual receptor with large language models (LLMs).

71
31 Aug 2023

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

videocrafter/animate-a-story 13 Jul 2023

For the first module, we leverage an off-the-shelf video retrieval system and extract video depths as motion structure.

234
13 Jul 2023

Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models

haoningwu3639/StoryGen 1 Jun 2023

Generative models have recently exhibited exceptional capabilities in text-to-image generation, but still struggle to generate image sequences coherently.

146
01 Jun 2023

Detecting and Grounding Important Characters in Visual Stories

iz2late/VIST-Character 30 Mar 2023

Characters are essential to the plot of any story.

4
30 Mar 2023

Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models

IIT-PAVIS/Positional_Diffusion 20 Mar 2023

We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning.

15
20 Mar 2023

Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning

jaleedkhan/neusire European Semantic Web Conference (ESWC) 2022

These results depict the effectiveness of commonsense knowledge infusion in improving the performance and expressiveness of scene graph generation for visual understanding and reasoning tasks.

7
31 May 2022