Search Results for author: Yuanen Zhou

Found 4 papers, 3 papers with code

Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning

no code implementations • 19 Jul 2023 • Zijie Song, Zhenzhen Hu, Yuanen Zhou, Ye Zhao, Richang Hong, Meng Wang

The crucial issue in this task is to model the global and the local matching between the image and different languages.

Paper
Add Code

Compact Bidirectional Transformer for Image Captioning

1 code implementation • 6 Jan 2022 • Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Huixia Ben, Meng Wang

In this paper, we introduce a Compact Bidirectional Transformer model for image captioning that can leverage bidirectional context implicitly and explicitly while the decoder can be executed parallelly.

Image Captioning Sentence

Paper
Code

Semi-Autoregressive Transformer for Image Captioning

1 code implementation • 17 Jun 2021 • Yuanen Zhou, Yong Zhang, Zhenzhen Hu, Meng Wang

To tackle this issue, non-autoregressive image captioning models have recently been proposed to significantly accelerate the speed of inference by generating all words in parallel.

Image Captioning

Paper
Code

More Grounded Image Captioning by Distilling Image-Text Matching Model

1 code implementation • CVPR 2020 • Yuanen Zhou, Meng Wang, Daqing Liu, Zhenzhen Hu, Hanwang Zhang

To improve the grounding accuracy while retaining the captioning quality, it is expensive to collect the word-region alignment as strong supervision.

Image Captioning Image-text matching +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.