Search Results for author: Daechul Ahn

Found 3 papers, 2 papers with code

Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

no code implementations6 Feb 2024 Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi

Recent advancements in large language models have influenced the development of video large multimodal models (VLMMs).

Story Visualization by Online Text Augmentation with Context Memory

1 code implementation ICCV 2023 Daechul Ahn, Daneul Kim, Gwangmo Song, Seung Hwan Kim, Honglak Lee, Dongyeop Kang, Jonghyun Choi

Story visualization (SV) is a challenging text-to-image generation task for the difficulty of not only rendering visual details from the text descriptions but also encoding a long-term context across multiple sentences.

Sentence Story Visualization +2

Zero-shot Natural Language Video Localization

1 code implementation ICCV 2021 Jinwoo Nam, Daechul Ahn, Dongyeop Kang, Seong Jong Ha, Jonghyun Choi

Understanding videos to localize moments with natural language often requires large expensive annotated video regions paired with language queries.

Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.