1 code implementation • 27 Nov 2023 • Bin Xia, Shiyin Wang, Yingfan Tao, Yitong Wang, Jiaya Jia
In the first stage, we train the MLLM to grasp the properties of image generation and editing, enabling it to generate detailed prompts.
no code implementations • ICCV 2023 • Yingfan Tao, Jingna Sun, Hao Yang, Li Chen, Xu Wang, Wenming Yang, Daniel Du, Min Zheng
LGLA consists of two core components: a Class-aware Logit Adjustment (CLA) strategy and an Adaptive Angular Weighted (AAW) loss.