Elevating Flow-Guided Video Inpainting with Reference Generation

12 Dec 2024  ยท  Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee ยท

Video inpainting (VI) is a challenging task that requires effective propagation of observable content across frames while simultaneously generating new content not present in the original video. In this study, we propose a robust and practical VI framework that leverages a large generative model for reference generation in combination with an advanced pixel propagation algorithm. Powered by a strong generative model, our method not only significantly enhances frame-level quality for object removal but also synthesizes new content in the missing areas based on user-provided text prompts. For pixel propagation, we introduce a one-shot pixel pulling method that effectively avoids error accumulation from repeated sampling while maintaining sub-pixel precision. To evaluate various VI methods in realistic scenarios, we also propose a high-quality VI benchmark, HQVI, comprising carefully generated videos using alpha matte composition. On public benchmarks and the HQVI dataset, our method demonstrates significantly higher visual quality and metric scores compared to existing solutions. Furthermore, it can process high-resolution videos exceeding 2K resolution with ease, underscoring its superiority for real-world applications.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Video Inpainting HQVI (240p) RGVI PSNR 30.66 # 2
SSIM 0.9527 # 2
LPIPS 0.0335 # 1
VFID 0.1825 # 1
Video Inpainting HQVI (240p) RGVI w/o Ref. PSNR 31.60 # 1
SSIM 0.9559 # 1
LPIPS 0.0390 # 3
VFID 0.1868 # 2
Video Inpainting HQVI (2K) RGVI PSNR 30.10 # 1
LPIPS 0.0357 # 1
VFID 0.0058 # 1
SSIM 0.9489 # 2
Video Inpainting HQVI (2K) RGVI w/o Ref. PSNR 29.81 # 2
LPIPS 0.0403 # 2
VFID 0.0101 # 2
SSIM 0.9501 # 1
Video Inpainting HQVI (480p) RGVI PSNR 30.90 # 2
LPIPS 0.0342 # 1
VFID 0.0311 # 1
SSIM 0.9513 # 2
Video Inpainting HQVI (480p) RGVI w/o Ref. PSNR 31.19 # 1
LPIPS 0.0403 # 3
VFID 0.0404 # 2
SSIM 0.9534 # 1

Methods