Search Results for author: Danyang Tu

Found 7 papers, 2 papers with code

Joint Gaze-Location and Gaze-Object Detection

no code implementations • 26 Aug 2023 • Danyang Tu, Wei Shen, Wei Sun, Xiongkuo Min, Guangtao Zhai

In contrast, we reframe the gaze following detection task as detecting human head locations and their gaze followings simultaneously, aiming at jointly detect human gaze location and gaze object in a unified and single-stage pipeline.

Object object-detection +1

Paper
Add Code

Agglomerative Transformer for Human-Object Interaction Detection

no code implementations • ICCV 2023 • Danyang Tu, Wei Sun, Guangtao Zhai, Wei Shen

We propose an agglomerative Transformer (AGER) that enables Transformer-based human-object interaction (HOI) detectors to flexibly exploit extra instance-level cues in a single-stage and end-to-end manner for the first time.

Clustering Human-Object Interaction Detection +1

Paper
Add Code

Masked Autoencoders as Image Processors

1 code implementation • 30 Mar 2023 • Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Ranked #4 on Image Defocus Deblurring on DPD (Dual-view)

Deblurring Image Defocus Deblurring +2

Paper
Code

Video-based Human-Object Interaction Detection from Tubelet Tokens

no code implementations • 4 Jun 2022 • Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen

We present a novel vision Transformer, named TUTOR, which is able to learn tubelet tokens, served as highly-abstracted spatiotemporal representations, for video-based human-object interaction (V-HOI) detection.

Human-Object Interaction Detection

Paper
Add Code

Saliency in Augmented Reality

1 code implementation • 18 Apr 2022 • Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Saliency Prediction

Paper
Code

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations • 20 Mar 2022 • Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Paper
Add Code

End-to-End Human-Gaze-Target Detection with Transformers

no code implementations • CVPR 2022 • Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

In contrast, we redefine the HGT detection task as detecting human head locations and their gaze targets, simultaneously.

Gaze Prediction object-detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.