no code implementations • 21 Jul 2024 • Yu Li, Yifan Chen, Gongye Liu, Jie Wu, Yujiu Yang
We find that these methods overly focus on content information and lack constraints on layout spatial structure, resulting in an imbalance of learning content-aware and graphic-aware features.
1 code implementation • 14 Jun 2024 • Chufan Shi, Cheng Yang, Yaxin Liu, Bo Shui, Junjie Wang, Mohan Jing, Linran Xu, Xinyu Zhu, Siheng Li, Yuxiang Zhang, Gongye Liu, Xiaomei Nie, Deng Cai, Yujiu Yang
We introduce a new benchmark, ChartMimic, aimed at assessing the visually-grounded code generation capabilities of large multimodal models (LMMs).
2 code implementations • 1 Dec 2023 • Gongye Liu, Menghan Xia, Yong Zhang, Haoxin Chen, Jinbo Xing, Yibo Wang, Xintao Wang, Yujiu Yang, Ying Shan
To address these challenges, we introduce StyleCrafter, a generic method that enhances pre-trained T2V models with a style control adapter, enabling video generation in any style by providing a reference image.
1 code implementation • 26 May 2023 • Gongye Liu, Haoze Sun, Jiayi Li, Fei Yin, Yujiu Yang
To derive the transitional state during the forward process, we introduce Distortion Adaptive Inversion.