Search Results for author: Hu Ye

Found 6 papers, 6 papers with code

DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction

1 code implementation27 May 2025 Yiheng Liu, Liao Qu, Huichao Zhang, Xu Wang, Yi Jiang, Yiming Gao, Hu Ye, Xian Li, Shuai Wang, Daniel K. Du, Shu Cheng, Zehuan Yuan, Xinglong Wu

Moreover, due to the significantly reduced token count and parallel inference mechanism, our method runs nearly 2x faster inference speed compared to VAR and FlexVAR.

Image Generation

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

2 code implementations CVPR 2025 Liao Qu, Huichao Zhang, Yiheng Liu, Xu Wang, Yi Jiang, Yiming Gao, Hu Ye, Daniel K. Du, Zehuan Yuan, Xinglong Wu

This design enables direct access to both high-level semantic representations crucial for understanding tasks and fine-grained visual features essential for generation through shared indices.

Image Generation Image Reconstruction +1

IMAGDressing-v1: Customizable Virtual Dressing

1 code implementation17 Jul 2024 Fei Shen, Xin Jiang, Xin He, Hu Ye, Cong Wang, Xiaoyu Du, Zechao Li, Jinhui Tang

Latest advances have achieved realistic virtual try-on (VTON) through localized garment inpainting using latent diffusion models, significantly enhancing consumers' online shopping experience.

Denoising Image Generation +1

Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models

1 code implementation2 Jul 2024 Fei Shen, Hu Ye, Sibo Liu, Jun Zhang, Cong Wang, Xiao Han, Wei Yang

Moreover, RCDMs can generate consistent stories with a single forward inference compared to autoregressive models.

Story Visualization

Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models

1 code implementation10 Oct 2023 Fei Shen, Hu Ye, Jun Zhang, Cong Wang, Xiao Han, Wei Yang

Specifically, in the first stage, we design a simple prior conditional diffusion model that predicts the global features of the target image by mining the global alignment relationship between pose coordinates and image appearance.

Image Generation

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

4 code implementations13 Aug 2023 Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, Wei Yang

Despite the simplicity of our method, an IP-Adapter with only 22M parameters can achieve comparable or even better performance to a fully fine-tuned image prompt model.

Diffusion Personalization Tuning Free Image Generation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.