Multitask Text-to-Visual Embedding with Titles and Clickthrough Data

30 May 2019Pranav AggarwalZhe LinBaldo FaietaSaeid Motiian

Text-visual (or called semantic-visual) embedding is a central problem in vision-language research. It typically involves mapping of an image and a text description to a common feature space through a CNN image encoder and a RNN language encoder... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.