Search Results for author: Aixi Zhang

Found 5 papers, 3 papers with code

TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding

no code implementations • 5 Aug 2021 • Dailan He, Yusheng Zhao, Junyu Luo, Tianrui Hui, Shaofei Huang, Aixi Zhang, Si Liu

Existing works usually adopt dynamic graph networks to indirectly model the intra/inter-modal interactions, making the model difficult to distinguish the referred object from distractors due to the monolithic representations of visual and linguistic contents.

Relation Sentence +1

Paper
Add Code

Mining the Benefits of Two-stage and One-stage HOI Detection

1 code implementation • NeurIPS 2021 • Aixi Zhang, Yue Liao, Si Liu, Miao Lu, Yongliang Wang, Chen Gao, Xiaobo Li

To this end, we propose a novel one-stage framework with disentangling human-object detection and interaction classification in a cascade manner.

Ranked #7 on Human-Object Interaction Detection on V-COCO

Classification Human-Object Interaction Detection +5

Paper
Code

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection

1 code implementation • CVPR 2022 • Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu

In this paper, we reveal and address the disadvantages of the conventional query-driven HOI detectors from the two aspects.

Ranked #12 on Human-Object Interaction Detection on HICO-DET

Human-Object Interaction Detection Position +1

Paper
Code

Video Background Music Generation: Dataset, Method and Evaluation

1 code implementation • ICCV 2023 • Le Zhuo, Zhaokai Wang, Baisen Wang, Yue Liao, Chenxi Bao, Stanley Peng, Songhao Han, Aixi Zhang, Fei Fang, Si Liu

We believe our dataset, benchmark model, and evaluation metric will boost the development of video background music generation.

Music Generation Representation Learning +1

Paper
Code

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

no code implementations • 5 Aug 2023 • Qiaosong Qi, Le Zhuo, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan

To address these limitations, we present a novel cascaded motion diffusion model, DiffDance, designed for high-resolution, long-form dance generation.

Representation Learning Super-Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.