no code implementations • 19 Apr 2024 • Longfei Huang, Shupeng Zhong, Xiangyu Wu, Ruoxuan Li, QingGuo Chen, Yang Yang
Subsequently, we propose caption-level strategy for the high-quality caption data generated by the image caption models and integrate them with retrieval augmentation strategy into the template to compel the model to generate higher quality, more matching, and semantically enriched captions based on the retrieval augmentation prompts.
no code implementations • 26 Mar 2024 • Dian Chao, Xin Song, Shupeng Zhong, Boyuan Wang, Xiangyu Wu, Chen Zhu, Yang Yang
In this paper, we propose a solution for improving the quality of captions generated for figures in papers.