no code implementations • 23 Nov 2024 • Ryugo Morita, Stanislav Frolov, Brian Bernhard Moser, Takahiro Shirakawa, Ko Watanabe, Andreas Dengel, Jinjia Zhou
To address this limitation, we present a novel Training-Free Chroma Key Content Generation Diffusion Model (TKG-DM), which optimizes the initial random noise to produce images with foreground objects on a specifiable color background.
no code implementations • 15 Nov 2024 • Kosuke Iwama, Ryugo Morita, Jinjia Zhou
To further reduce the sampled data while keeping the video quality, this paper explores the temporal redundancy in video CS and proposes a block based adaptive compressive sensing framework with a sampling rate (SR) control strategy.
no code implementations • 15 Nov 2024 • Mizuki Miyamoto, Ryugo Morita, Jinjia Zhou
After that, we use Visual Question Answering(VQA) to measure the relevance of the generated images to the input text, which allows for a more detailed evaluation of the alignment compared to existing methods.
no code implementations • 17 Sep 2024 • Ryugo Morita, Hitoshi Nishimura, Ko Watanabe, Andreas Dengel, Jinjia Zhou
In recent years, deep learning-based image compression, particularly through generative models, has emerged as a pivotal area of research.
no code implementations • 11 Aug 2023 • Ryugo Morita, Zhiqiang Zhang, Jinjia Zhou
We proposed a Background-Aware Text to Image synthesis and manipulation Network (BATINet), which contains two key components: Position Detect Network (PDN) and Harmonize Network (HN).
no code implementations • 25 Nov 2022 • Ryugo Morita, Zhiqiang Zhang, Man M. Ho, Jinjia Zhou
To solve these problems, we propose a novel image manipulation method that interactively edits an image using complex text instructions.