Search Results for author: Alon Mendelson

Found 1 papers, 0 papers with code

Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs

no code implementations • 10 May 2023 • Roei Herzig, Alon Mendelson, Leonid Karlinsky, Assaf Arbelle, Rogerio Feris, Trevor Darrell, Amir Globerson

For the visual side, we incorporate a special "SG Component" in the image transformer trained to predict SG information, while for the textual side, we utilize SGs to generate fine-grained captions that highlight different compositional aspects of the scene.

Ranked #24 on Visual Reasoning on Winoground

Scene Understanding Visual Reasoning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.