Search Results for author: Rongjie Li

Found 7 papers, 4 papers with code

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

1 code implementation ICCV 2019 Bo Wan, Desen Zhou, Yongfei Liu, Rongjie Li, Xuming He

Reasoning human object interactions is a core problem in human-centric scene understanding and detecting such relations poses a unique challenge to vision systems due to large variations in human-object configurations, multiple co-occurring relation instances and subtle visual difference between relation categories.

Human-Object Interaction Detection Object +2

SGTR: End-to-end Scene Graph Generation with Transformer

1 code implementation CVPR 2022 Rongjie Li, Songyang Zhang, Xuming He

Scene Graph Generation (SGG) remains a challenging visual understanding task due to its compositional property.

graph construction Graph Generation +1

SGTR+: End-to-end Scene Graph Generation with Transformer

1 code implementation23 Jan 2024 Rongjie Li, Songyang Zhang, Xuming He

Moreover, we design a graph assembling module to infer the connectivity of the bipartite scene graph based on our entity-aware structure, enabling us to generate the scene graph in an end-to-end manner.

graph construction Graph Generation +1

Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning

no code implementations1 Apr 2024 Rongjie Li, Yu Wu, Xuming He

Generative vision-language models (VLMs) have shown impressive performance in zero-shot vision-language tasks like image captioning and visual question answering.

Image Captioning Instruction Following +5

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

no code implementations1 Apr 2024 Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He

Scene graph generation (SGG) aims to parse a visual scene into an intermediate graph representation for downstream reasoning tasks.

Graph Generation Relation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.