1 code implementation • 28 Nov 2022 • Xian Zhong, Zipeng Li, Shuqin Chen, Kui Jiang, Chen Chen, Mang Ye
In this paper, we introduce a novel Refined Semantic enhancement method towards Frequency Diffusion (RSFD), a captioning model that constantly perceives the linguistic representation of the infrequent tokens.
no code implementations • 16 Oct 2021 • Zhixin Sun, Xian Zhong, Shuqin Chen, Lin Li, Luo Zhong
Video captioning is a challenging task that captures different visual parts and describes them in sentences, for it requires visual and linguistic coherence.
1 code implementation • 21 Jul 2020 • Xian Zhong, Cheng Gu, Wenxin Huang, Lin Li, Shuqin Chen, Chia-Wen Lin
As a result, a meta-learner cannot be trained well in a high-dimensional parameter space to generalize to new tasks.
Ranked #17 on Few-Shot Image Classification on FC100 5-way (5-shot)