no code implementations • 23 Nov 2023 • Shiyu Qin, Yimin Zhou, Jinpeng Wang, Bin Chen, Baoyi An, Tao Dai, Shu-Tao Xia
In this paper, we propose a progressive learning paradigm for transformer-based variable-rate image compression.
no code implementations • 25 Jan 2019 • Yimin Zhou, Yiwei Sun, Vasant Honavar
We explore the use of a knowledge graphs, that capture general or commonsense knowledge, to augment the information extracted from images by the state-of-the-art methods for image captioning.