Search Results for author: Qiong Cao

Found 25 papers, 11 papers with code

Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution

1 code implementation20 Dec 2024 Wentao Tan, Qiong Cao, Yibing Zhan, Chao Xue, Changxing Ding

To address these issues, we propose a novel multimodal self-evolution framework that enables the model to autonomously generate high-quality questions and answers using only unannotated images.

Answer Generation Image Captioning

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

no code implementations17 Oct 2024 Changcheng Xiao, Qiong Cao, Yujie Zhong, Xiang Zhang, Tao Wang, Canqun Yang, Long Lan

In addition, we introduce a novel task called Referring Multi-Object Tracking and Segmentation (RMOTS) and construct a new dataset named Ref-KITTI Segmentation.

Multi-Object Tracking and Segmentation Referring Multi-Object Tracking +3

Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models

no code implementations26 Jun 2024 Xiaolin Hong, Hongwei Yi, Fazhi He, Qiong Cao

To address this limitation, we explore the potential of diffusion models that simultaneously consider all input humans and the floor plan to generate plausible 3D scenes.

Collision Avoidance Human-Object Interaction Detection +2

PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions

no code implementations20 Jun 2024 Sihan Ma, Jing Zhang, Qiong Cao, DaCheng Tao

We evaluated 60 representative models, including top-down, bottom-up, heatmap-based, regression-based, and classification-based methods, across three datasets for human and animal pose estimation.

Animal Pose Estimation Autonomous Driving +1

Towards Variable and Coordinated Holistic Co-Speech Motion Generation

no code implementations CVPR 2024 Yifei Liu, Qiong Cao, Yandong Wen, Huaiguang Jiang, Changxing Ding

This paper addresses the problem of generating lifelike holistic co-speech motions for 3D avatars, focusing on two key aspects: variability and coordination.

Motion Generation Quantization

GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction

1 code implementation29 Jun 2023 Sihan Ma, Qiong Cao, Hongwei Yi, Jing Zhang, DaCheng Tao

Demystifying complex human-ground interactions is essential for accurate and realistic 3D human motion reconstruction from RGB videos, as it ensures consistency between the humans and the ground plane.

MotionTrack: Learning Motion Predictor for Multiple Object Tracking

no code implementations5 Jun 2023 Changcheng Xiao, Qiong Cao, Yujie Zhong, Long Lan, Xiang Zhang, Zhigang Luo, DaCheng Tao

This challenge arises from two main factors: the insufficient discriminability of ReID features and the predominant utilization of linear motion models in MOT.

motion prediction Multi-Object Tracking +2

A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

1 code implementation13 Jan 2023 Jie Gui, Tuo Chen, Jing Zhang, Qiong Cao, Zhenan Sun, Hao Luo, DaCheng Tao

Deep supervised learning algorithms typically require a large volume of labeled data to achieve satisfactory performance.

Self-Supervised Learning

Learning Sequence Representations by Non-local Recurrent Neural Memory

1 code implementation20 Jul 2022 Wenjie Pei, Xin Feng, Canmiao Fu, Qiong Cao, Guangming Lu, Yu-Wing Tai

The key challenge of sequence representation learning is to capture the long-range temporal dependencies.

Representation Learning

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers

no code implementations CVPR 2022 Xianing Chen, Qiong Cao, Yujie Zhong, Jing Zhang, Shenghua Gao, DaCheng Tao

Our DearKD is a two-stage framework that first distills the inductive biases from the early intermediate layers of a CNN and then gives the transformer full play by training without distillation.

Knowledge Distillation

MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames

1 code implementation24 Apr 2022 Xiangyu Zhu, Tingting Liao, Jiangjing Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei

In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions.

Camera Calibration

FEATURE-AUGMENTED HYPERGRAPH NEURAL NETWORKS

no code implementations29 Sep 2021 Xueqi Ma, Pan Li, Qiong Cao, James Bailey, Yue Gao

In FAHGNN, we explore the influence of node features for the expressive power of GNNs and augment features by introducing common features and personal features to model information.

Node Classification Representation Learning

VGGFace2: A dataset for recognising faces across pose and age

24 code implementations23 Oct 2017 Qiong Cao, Li Shen, Weidi Xie, Omkar M. Parkhi, Andrew Zisserman

The dataset was collected with three goals in mind: (i) to have both a large number of identities and also a large number of images for each identity; (ii) to cover a large range of pose, age and ethnicity; and (iii) to minimize the label noise.

 Ranked #1 on Face Verification on IJB-C (training dataset metric)

Face Recognition Face Verification +1

Template Adaptation for Face Verification and Identification

no code implementations12 Mar 2016 Nate Crosswhite, Jeffrey Byrne, Omkar M. Parkhi, Chris Stauffer, Qiong Cao, Andrew Zisserman

Face recognition performance evaluation has traditionally focused on one-to-one verification, popularized by the Labeled Faces in the Wild dataset for imagery and the YouTubeFaces dataset for videos.

Face Identification Face Recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.