no code implementations • 18 Mar 2025 • Susung Hong, Ira Kemelmacher-Shlizerman, Brian Curless, Steven M. Seitz
We introduce MusicInfuser, an approach for generating high-quality dance videos that are synchronized to a specified music track.
no code implementations • 6 Dec 2024 • Susung Hong, Johanna Karras, Ricardo Martin-Brualla, Ira Kemelmacher-Shlizerman
Combined with the generative process, we impose identity-preserving gradients to refine the edited NeRF.
no code implementations • 27 Nov 2024 • Junha Hyung, Kinam Kim, Susung Hong, Min-Jung Kim, Jaegul Choo
Our contributions include: (1) introducing STG as an efficient, high-performing guidance technique for video diffusion models, (2) eliminating the need for auxiliary models by simulating a weak model through layer skipping, and (3) ensuring quality-enhanced guidance without compromising sample diversity or dynamics unlike CFG.
1 code implementation • 1 Aug 2024 • Susung Hong
Recent attempts to extend guidance to unconditional models have relied on heuristic techniques, resulting in suboptimal generation quality and unintended effects.
no code implementations • 17 Jun 2024 • Junha Hyung, Susung Hong, Sungwon Hwang, Jaeseong Lee, Jaegul Choo, Jin-Hwa Kim
3D reconstruction from multi-view images is one of the fundamental challenges in computer vision and graphics.
1 code implementation • 5 Feb 2024 • Junyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim
We leverage the retrieved asset to incorporate its geometric prior in the variational objective and adapt the diffusion model's 2D prior toward view consistency, achieving drastic improvements in both geometry and fidelity of generated scenes.
1 code implementation • 23 May 2023 • Susung Hong, Junyoung Seo, Heeseong Shin, Sunghwan Hong, Seungryong Kim
In the paradigm of AI-generated content (AIGC), there has been increasing attention to transferring knowledge from pre-trained text-to-image (T2I) models to text-to-video (T2V) generation.
1 code implementation • NeurIPS 2023 • Susung Hong, Donghoon Ahn, Seungryong Kim
In this work, we explore existing frameworks for score-distilling text-to-3D generation and identify the main causes of the view inconsistency problem -- the embedded bias of 2D diffusion models.
1 code implementation • 17 Dec 2022 • Gyeongnyeon Kim, Wooseok Jang, Gyuseong Lee, Susung Hong, Junyoung Seo, Seungryong Kim
Generative models have recently undergone significant advancement due to the diffusion models.
1 code implementation • 6 Oct 2022 • Sunghwan Hong, Jisu Nam, Seokju Cho, Susung Hong, Sangryul Jeon, Dongbo Min, Seungryong Kim
Existing pipelines of semantic correspondence commonly include extracting high-level semantic features for the invariance against intra-class variations and background clutters.
5 code implementations • ICCV 2023 • Susung Hong, Gyuseong Lee, Wooseok Jang, Seungryong Kim
Denoising diffusion models (DDMs) have attracted attention for their exceptional generation quality and diversity.