Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval

23 Aug 2018Niluthpol Chowdhury MithunRameswar PandaEvangelos E. PapalexakisAmit K. Roy-Chowdhury

Cross-modal retrieval between visual data and natural language description remains a long-standing challenge in multimedia. While recent image-text retrieval methods offer great promise by learning deep representations aligned across modalities, most of these methods are plagued by the issue of training with small-scale datasets covering a limited number of images with ground-truth sentences... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.