Search Results for author: Feiqi Cao

Found 4 papers, 1 papers with code

SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering

no code implementations16 Dec 2022 Feiqi Cao, Siwen Luo, Felipe Nunez, Zean Wen, Josiah Poon, Caren Han

To make explicit teaching of the relations between the two modalities, we proposed and integrated two attention modules, namely a scene graph-based semantic relation-aware attention and a positional relation-aware attention.

Optical Character Recognition Optical Character Recognition (OCR) +2

Understanding Attention for Vision-and-Language Tasks

1 code implementation COLING 2022 Feiqi Cao, Soyeon Caren Han, Siqu Long, Changwei Xu, Josiah Poon

Attention mechanism has been used as an important component across Vision-and-Language(VL) tasks in order to bridge the semantic gap between visual and textual features.

Image Retrieval Question Answering +2

Vision-and-Language Pretrained Models: A Survey

no code implementations15 Apr 2022 Siqu Long, Feiqi Cao, Soyeon Caren Han, Haiqin Yang

Pretrained models have produced great success in both Computer Vision (CV) and Natural Language Processing (NLP).

Cannot find the paper you are looking for? You can Submit a new open access paper.