no code implementations • 27 Oct 2019 • Gal Sadeh Kenigsfield, Ran El-Yaniv
Our model relies on a shared text--image representation of subject-verb-object relationships appearing in the text, and object interactions in images.
Graph Generation Relationship Detection +2