Search Results for author: Yongsheng Yu

Found 11 papers, 7 papers with code

PromptFix: You Prompt and We Fix the Photo

1 code implementation27 May 2024 Yongsheng Yu, Ziyun Zeng, Hang Hua, Jianlong Fu, Jiebo Luo

To address these limitations, we propose PromptFix, a comprehensive framework that enables diffusion models to follow human instructions to perform a wide variety of image-processing tasks.

Denoising Image Generation +1

Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Models

no code implementations24 May 2024 Yongsheng Yu, Jiebo Luo

Conventional demographic inference methods have predominantly operated under the supervision of accurately labeled data, yet struggle to adapt to shifting social landscapes and diverse cultural contexts, leading to narrow specialization and limited accuracy in applications.

Zero-Shot Learning

Flow-Guided Diffusion for Video Inpainting

1 code implementation26 Nov 2023 Bohai Gu, Yongsheng Yu, Heng Fan, Libo Zhang

Video inpainting has been challenged by complex scenarios like large movements and low-light conditions.

Denoising Image Generation +2

GPT-4V(ision) as A Social Media Analysis Engine

1 code implementation13 Nov 2023 Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

Our investigation begins with a preliminary quantitative analysis for each task using existing benchmark datasets, followed by a careful review of the results and a selection of qualitative samples that illustrate GPT-4V's potential in understanding multimodal social media content.

Hallucination Hate Speech Detection +1

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text

no code implementations31 Jul 2023 Junchen Zhu, Huan Yang, Wenjing Wang, Huiguo He, Zixi Tuo, Yongsheng Yu, Wen-Huang Cheng, Lianli Gao, Jingkuan Song, Jianlong Fu, Jiebo Luo

In the basic generation, we take advantage of the pretrained image diffusion model, and adapt it to a high-quality open-domain vertical video generator for mobile devices.

Video Generation

Deficiency-Aware Masked Transformer for Video Inpainting

1 code implementation17 Jul 2023 Yongsheng Yu, Heng Fan, Libo Zhang

Firstly, we pretrain a image inpainting model DMT_img serve as a prior for distilling the video model DMT_vid, thereby benefiting the hallucination of deficiency cases.

Hallucination Image Inpainting +2

MaGIC: Multi-modality Guided Image Completion

no code implementations19 May 2023 Yongsheng Yu, Hao Wang, Tiejian Luo, Heng Fan, Libo Zhang

In this paper, we propose a novel, simple yet effective method for Multi-modal Guided Image Completion, dubbed MaGIC, which not only supports a wide range of single modality as the guidance (e. g., text, canny edge, sketch, segmentation, depth, and pose), but also adapts to arbitrarily customized combination of these modalities (i. e., arbitrary multi-modality) for image completion.

Unbiased Multi-Modality Guidance for Image Inpainting

1 code implementation25 Aug 2022 Yongsheng Yu, Dawei Du, Libo Zhang, Tiejian Luo

Image inpainting is an ill-posed problem to recover missing or damaged image content based on incomplete images with masks.

Image Inpainting Semantic Segmentation

High-Fidelity Image Inpainting with GAN Inversion

no code implementations25 Aug 2022 Yongsheng Yu, Libo Zhang, Heng Fan, Tiejian Luo

Addressing this problem, in this paper, we devise a novel GAN inversion model for image inpainting, dubbed InvertFill, mainly consisting of an encoder with a pre-modulation module and a GAN generator with F&W+ latent space.

Image Inpainting Vocal Bursts Intensity Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.