Search Results for author: Guan Huang

Found 33 papers, 12 papers with code

WebFace260M: A Benchmark for Million-Scale Deep Face Recognition

no code implementations21 Apr 2022 Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Dalong Du, Jiwen Lu, Jie zhou

For a comprehensive evaluation of face matchers, three recognition tasks are performed under standard, masked and unbiased settings, respectively.

Face Recognition

MVSTER: Epipolar Transformer for Efficient Multi-View Stereo

1 code implementation15 Apr 2022 XiaoFeng Wang, Zheng Zhu, Fangbo Qin, Yun Ye, Guan Huang, Xu Chi, Yijia He, Xingang Wang

Therefore, we present MVSTER, which leverages the proposed epipolar Transformer to learn both 2D semantics and 3D spatial associations efficiently.

HFT: Lifting Perspective Representations via Hybrid Feature Transformation

1 code implementation11 Apr 2022 Jiayu Zou, Junrui Xiao, Zheng Zhu, JunJie Huang, Guan Huang, Dalong Du, Xingang Wang

In order to reap the benefits and avoid the drawbacks of CBFT and CFFT, we propose a novel framework with a Hybrid Feature Transformation module (HFT).

Autonomous Driving Decision Making +1

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

1 code implementation7 Apr 2022 Yi Wei, Linqing Zhao, Wenzhao Zheng, Zheng Zhu, Yongming Rao, Guan Huang, Jiwen Lu, Jie zhou

In this paper, we propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.

Autonomous Driving Monocular Depth Estimation

BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection

1 code implementation31 Mar 2022 JunJie Huang, Guan Huang

Single frame data contains finite information which limits the performance of the existing vision-based multi-camera 3D object detection paradigms.

3D Object Detection Frame

CAFE: Learning to Condense Dataset by Aligning Features

1 code implementation3 Mar 2022 Kai Wang, Bo Zhao, Xiangyu Peng, Zheng Zhu, Shuo Yang, Shuo Wang, Guan Huang, Hakan Bilen, Xinchao Wang, Yang You

Dataset condensation aims at reducing the network training effort through condensing a cumbersome training set into a compact synthetic one.

Joint Demand Prediction for Multimodal Systems: A Multi-task Multi-relational Spatiotemporal Graph Neural Network Approach

no code implementations15 Dec 2021 Yuebing Liang, Guan Huang, Zhan Zhao

Despite some recent efforts, existing approaches to multimodal demand prediction are generally not flexible enough to account for multiplex networks with diverse spatial units and heterogeneous spatiotemporal correlations across different modes.

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

1 code implementation2 Dec 2021 Yongming Rao, Wenliang Zhao, Guangyi Chen, Yansong Tang, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu

In this work, we present a new framework for dense prediction by implicitly and explicitly leveraging the pre-trained knowledge from CLIP.

Instance Segmentation Language Modelling +4

Hand gesture detection in tests performed by older adults

no code implementations27 Oct 2021 Guan Huang, Son N. Tran, Quan Bai, Jane Alty

We have implemented a hand gesture detector to detect the gestures in the hand movement tests and our detection mAP is 0. 782 which is better than the state-of-the-art.

Face-NMS: A Core-set Selection Approach for Efficient Face Recognition

no code implementations10 Sep 2021 Yunze Chen, JunJie Huang, Jiagang Zhu, Zheng Zhu, Tian Yang, Guan Huang, Dalong Du

The current research on this problem mainly focuses on designing an efficient Fully-connected layer (FC) to reduce GPU memory consumption caused by a large number of identities.

Face Recognition Object Detection

Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes

1 code implementation CVPR 2021 Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie zhou

To address the dilemma of large-scale training and efficient inference, we propose the STructure-AwaRe Face Clustering (STAR-FC) method.

Face Clustering Graph Clustering

SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

no code implementations6 Apr 2021 Jiabin Zhang, Zheng Zhu, Jiwen Lu, JunJie Huang, Guan Huang, Jie zhou

To make a better trade-off between accuracy and efficiency, we propose a novel multi-person pose estimation framework, SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation (SIMPLE).

Human Detection Multi-Person Pose Estimation

Structure-Aware Face Clustering on a Large-Scale Graph with $\bf{10^{7}}$ Nodes

no code implementations24 Mar 2021 Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie zhou

To address the dilemma of large-scale training and efficient inference, we propose the STructure-AwaRe Face Clustering (STAR-FC) method.

Face Clustering Graph Clustering

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

no code implementations CVPR 2021 Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie zhou

In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol.

 Ranked #1 on Face Verification on IJB-C (dataset metric)

Face Recognition Face Verification

AID: Pushing the Performance Boundary of Human Pose Estimation with Information Dropping Augmentation

2 code implementations17 Aug 2020 Junjie Huang, Zheng Zhu, Guan Huang, Dalong Du

As AID successfully pushes the performance boundary of human pose estimation problem by considerable margin and sets a new state-of-the-art, we hope AID to be a regular configuration for training human pose estimators.

Multi-Person Pose Estimation

The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation

2 code implementations CVPR 2020 Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang, Dalong Du

Specifically, by investigating the standard data processing in state-of-the-art approaches mainly including coordinate system transformation and keypoint format transformation (i. e., encoding and decoding), we find that the results obtained by common flipping strategy are unaligned with the original ones in inference.

Pose Estimation

Multi-Stage HRNet: Multiple Stage High-Resolution Network for Human Pose Estimation

no code implementations14 Oct 2019 Junjie Huang, Zheng Zhu, Guan Huang

Human pose estimation are of importance for visual understanding tasks such as action recognition and human-computer interaction.

Action Recognition Multi-Person Pose Estimation

High Performance Visual Object Tracking with Unified Convolutional Networks

no code implementations26 Aug 2019 Zheng Zhu, Wei Zou, Guan Huang, Dalong Du, Chang Huang

In this paper, we propose an end-to-end framework to learn the convolutional features and perform the tracking process simultaneously, namely, a unified convolutional tracker (UCT).

Visual Object Tracking

FastPose: Towards Real-time Pose Estimation and Tracking via Scale-normalized Multi-task Networks

no code implementations15 Aug 2019 Jiabin Zhang, Zheng Zhu, Wei Zou, Peng Li, Yanwei Li, Hu Su, Guan Huang

Given the results of MTN, we adopt an occlusion-aware Re-ID feature strategy in the pose tracking module, where pose information is utilized to infer the occlusion state to make better use of Re-ID feature.

Human Detection Multi-Person Pose Estimation +3

State-aware Re-identification Feature for Multi-target Multi-camera Tracking

no code implementations4 Jun 2019 Peng Li, Jiabin Zhang, Zheng Zhu, Yanwei Li, Lu Jiang, Guan Huang

Multi-target Multi-camera Tracking (MTMCT) aims to extract the trajectories from videos captured by a set of cameras.

Action Machine: Rethinking Action Recognition in Trimmed Videos

no code implementations14 Dec 2018 Jiagang Zhu, Wei Zou, Liang Xu, Yiming Hu, Zheng Zhu, Manyu Chang, Jun-Jie Huang, Guan Huang, Dalong Du

On NTU RGB-D, Action Machine achieves the state-of-the-art performance with top-1 accuracies of 97. 2% and 94. 3% on cross-view and cross-subject respectively.

Action Recognition Multimodal Activity Recognition +2

Attention-guided Unified Network for Panoptic Segmentation

no code implementations CVPR 2019 Yanwei Li, Xinze Chen, Zheng Zhu, Lingxi Xie, Guan Huang, Dalong Du, Xingang Wang

This paper studies panoptic segmentation, a recently proposed task which segments foreground (FG) objects at the instance level as well as background (BG) contents at the semantic level.

Panoptic Segmentation

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

4 code implementations21 Apr 2018 Xiao Ma, Liqin Zhao, Guan Huang, Zhi Wang, Zelin Hu, Xiaoqiang Zhu, Kun Gai

To the best of our knowledge, this is the first public dataset which contains samples with sequential dependence of click and conversion labels for CVR modeling.

Click-Through Rate Prediction Recommendation Systems +2

UCT: Learning Unified Convolutional Networks for Real-time Visual Tracking

no code implementations10 Nov 2017 Zheng Zhu, Guan Huang, Wei Zou, Dalong Du, Chang Huang

Convolutional neural networks (CNN) based tracking approaches have shown favorable performance in recent benchmarks.

Real-Time Visual Tracking

Tag-Weighted Topic Model For Large-scale Semi-Structured Documents

no code implementations30 Jul 2015 Shuangyin Li, Jiefei Li, Guan Huang, Ruiyang Tan, Rong Pan

We propose a novel method to model the SSDs by a so-called Tag-Weighted Topic Model (TWTM).

Distributed Computing TAG +2

Cannot find the paper you are looking for? You can Submit a new open access paper.