Search Results for author: Lunhao Duan

Found 4 papers, 3 papers with code

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

1 code implementation5 May 2025 Xinjie Zhang, Jintao Guo, Shanshan Zhao, Minghao Fu, Lunhao Duan, Guo-Hua Wang, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang

Despite their respective successes, these two domains have evolved independently, leading to distinct architectural paradigms: While autoregressive-based architectures have dominated multimodal understanding, diffusion-based models have become the cornerstone of image generation.

Survey Text-to-Image Generation

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

no code implementations25 Dec 2024 Lunhao Duan, Shanshan Zhao, Wenjun Yan, Yinglun Li, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Mingming Gong, Gui-Song Xia

Recently, text-to-image generation models have achieved remarkable advancements, particularly with diffusion models facilitating high-quality image synthesis from textual descriptions.

Text-to-Image Generation

Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis

1 code implementation CVPR 2024 Yiyang Chen, Lunhao Duan, Shanshan Zhao, Changxing Ding, DaCheng Tao

Equipped with LCRF and RPR, our LocoTrans is capable of learning local-consistent transformation and preserving local geometry, which benefits rotation invariance learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.