Scene Graph Generation

110 papers with code • 5 benchmarks • 7 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Libraries

Use these libraries to find Scene Graph Generation models and implementations

EGTR: Extracting Graph from Transformer for Scene Graph Generation

naver-ai/egtr 2 Apr 2024

We propose a lightweight one-stage SGG model that extracts the relation graph from the various relationships learned in the multi-head self-attention layers of the DETR decoder.

12
02 Apr 2024

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

mlvlab/speaq 26 Mar 2024

Groupwise Query Specialization trains a specialized query by dividing queries and relations into disjoint groups and directing a query in a specific query group solely toward relations in the corresponding relation group.

8
26 Mar 2024

HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation

zhangce01/HiKER-SGG 18 Mar 2024

Being able to understand visual scenes is a precursor for many downstream tasks, including autonomous driving, robotics, and other vision-based approaches.

27
18 Mar 2024

SGTR+: End-to-end Scene Graph Generation with Transformer

scarecrow0/sgtr 23 Jan 2024

Moreover, we design a graph assembling module to infer the connectivity of the bipartite scene graph based on our entity-aware structure, enabling us to generate the scene graph in an end-to-end manner.

64
23 Jan 2024

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

rlqja1107/torch-st-sgg 18 Jan 2024

To this end, we introduce a Self-Training framework for SGG (ST-SGG) that assigns pseudo-labels for unannotated triplets based on which the SGG models are trained.

14
18 Jan 2024

Panoptic Video Scene Graph Generation

lilydaytoy/openpvsg CVPR 2023

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

51
28 Nov 2023

VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation

franciszzj/VLPrompt 27 Nov 2023

Panoptic Scene Graph Generation (PSG) aims at achieving a comprehensive image understanding by simultaneously segmenting objects and predicting relations among objects.

9
27 Nov 2023

Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge

bowen-upenn/scene_graph_commonsense 21 Nov 2023

This work presents an enhanced approach to generating scene graphs by incorporating a relationship hierarchy and commonsense knowledge.

11
21 Nov 2023

NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment

jaleedkhan/neusire Semantic Web 2023

We present a loosely-coupled neuro-symbolic visual understanding and reasoning framework that employs a DNN-based pipeline for object detection and multi-modal pairwise relationship prediction for scene graph generation and leverages common sense knowledge in heterogenous knowledge graphs to enrich scene graphs for improved downstream reasoning.

7
05 Nov 2023

LLM4SGG: Large Language Model for Weakly Supervised Scene Graph Generation

rlqja1107/torch-LLM4SGG 16 Oct 2023

Weakly-Supervised Scene Graph Generation (WSSGG) research has recently emerged as an alternative to the fully-supervised approach that heavily relies on costly annotations.

55
16 Oct 2023