2 code implementations • 15 Nov 2017 • Zhedong Zheng, Liang Zheng, Michael Garrett, Yi Yang, Mingliang Xu, Yi-Dong Shen
In this paper, we propose a new system to discriminatively embed the image and text to a shared visual-textual space.
Ranked #1 on Cross-Modal Retrieval on CUHK-PEDES