no code implementations • 29 Jan 2024 • Yizheng Chen, Rengan Xie, Qi Ye, Sen yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo
Specifically, we first leverage to decouple the shading information from the generated images to reduce the impact of inconsistent lighting; then, we introduce mono prior with view-dependent transient encoding to enhance the reconstructed normal; and finally, we design a view augmentation fusion strategy that minimizes pixel-level loss in generated sparse views and semantic loss in augmented random views, resulting in view-consistent geometry and detailed textures.
no code implementations • 4 Jun 2023 • Jintao Rong, Hao Chen, Tianxiao Chen, Linlin Ou, Xinyi Yu, Yifan Liu
Prompt learning has become a popular approach for adapting large vision-language models, such as CLIP, to downstream tasks.