1 code implementation • 2 Dec 2020 • Ricardo Guerrero, Hai Xuan Pham, Vladimir Pavlovic
A key to making CFA possible is multi-modal shared representation learning, which aims to create a joint representation of the multiple views (text and image) of the data.
Ranked #5 on Cross-Modal Retrieval on Recipe1M