Scene Graph Generation

71 papers with code • 4 benchmarks • 5 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing


Use these libraries to find Scene Graph Generation models and implementations

Most implemented papers

Learning to Compose Dynamic Tree Structures for Visual Contexts

KaihuaTang/Scene-Graph-Benchmark.pytorch CVPR 2019

We propose to compose dynamic tree structures that place the objects in an image into a visual context, helping visual reasoning tasks such as scene graph generation and visual Q&A.

Unbiased Scene Graph Generation from Biased Training

KaihuaTang/Scene-Graph-Benchmark.pytorch CVPR 2020

Today's scene graph generation (SGG) task is still far from practical, mainly due to the severe training bias, e. g., collapsing diverse "human walk on / sit on / lay on beach" into "human on beach".

Scene Graph Generation by Iterative Message Passing

microsoft/scene_graph_benchmark CVPR 2017

In this work, we explicitly model the objects and their relationships using scene graphs, a visually-grounded graphical structure of an image.

Graph R-CNN for Scene Graph Generation

jwyang/graph-rcnn.pytorch ECCV 2018

We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images.

Graphical Contrastive Losses for Scene Graph Parsing

dmlc/dgl CVPR 2019

The first, Entity Instance Confusion, occurs when the model confuses multiple instances of the same type of entity (e. g. multiple cups).

Knowledge-Embedded Routing Network for Scene Graph Generation

yuweihao/KERN CVPR 2019

More specifically, we show that the statistical correlations between objects appearing in images and their relationships, can be explicitly represented by a structured knowledge graph, and a routing mechanism is learned to propagate messages through the graph to explore their interactions.

Relation Transformer Network

rajatkoner08/rtn 13 Apr 2020

In this work, we propose a novel transformer formulation for scene graph generation and relation prediction.

Learning Visual Commonsense for Robust Scene Graph Generation

ZhecanJamesWang/GLAT_SGG ECCV 2020

Scene graph generation models understand the scene through object and predicate recognition, but are prone to mistakes due to the challenges of perception in the wild.

Learning and Reasoning with the Graph Structure Representation in Robotic Surgery

mobarakol/Surgical_SceneGraph_Generation 7 Jul 2020

Learning to infer graph representations and performing spatial reasoning in a complex surgical environment can play a vital role in surgical scene understanding in robotic surgery.