Search Results for author: Changqiao Wu

Found 1 papers, 0 papers with code

TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval

no code implementations • 28 Sep 2022 • Xiaohan Zou, Changqiao Wu, Lele Cheng, Zhongyuan Wang

Most existing methods in vision-language retrieval match two modalities by either comparing their global feature vectors which misses sufficient information and lacks interpretability, detecting objects in images or videos and aligning the text with fine-grained features which relies on complicated model designs, or modeling fine-grained interaction via cross-attention upon visual and textual tokens which suffers from inferior efficiency.

Retrieval Text Retrieval +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.