no code implementations • 11 Dec 2024 • Wang Chen, Guan Huang, Jintao Ke
This study investigates the development dilemma of ride-sharing services using real-world mobility datasets from nine cities and calibrated customers' price and detour elasticity.
no code implementations • 29 Nov 2024 • Chaojun Ni, Guosheng Zhao, XiaoFeng Wang, Zheng Zhu, Wenkang Qin, Guan Huang, Chen Liu, Yuyin Chen, Yida Wang, Xueyang Zhang, Yifei Zhan, Kun Zhan, Peng Jia, Xianpeng Lang, Xingang Wang, Wenjun Mei
This is complemented by a progressive data update strategy designed to ensure high-quality rendering for more complex maneuvers.
no code implementations • 17 Oct 2024 • Guosheng Zhao, Chaojun Ni, XiaoFeng Wang, Zheng Zhu, Xueyang Zhang, Yida Wang, Guan Huang, Xinze Chen, Boyuan Wang, Youyi Zhang, Wenjun Mei, Xingang Wang
Contemporary sensor simulation methods, such as NeRF and 3DGS, rely predominantly on conditions closely aligned with training data distributions, which are largely confined to forward-driving scenarios.
1 code implementation • 6 May 2024 • Zheng Zhu, XiaoFeng Wang, Wangbo Zhao, Chen Min, Nianchen Deng, Min Dou, Yuqi Wang, Botian Shi, Kai Wang, Chi Zhang, Yang You, Zhaoxiang Zhang, Dawei Zhao, Liang Xiao, Jian Zhao, Jiwen Lu, Guan Huang
General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.
no code implementations • 11 Mar 2024 • Guosheng Zhao, XiaoFeng Wang, Zheng Zhu, Xinze Chen, Guan Huang, Xiaoyi Bao, Xingang Wang
DriveDreamer-2 is the first world model to generate customized driving videos, it can generate uncommon driving videos (e. g., vehicles abruptly cut in) in a user-friendly manner.
2 code implementations • 24 Jan 2024 • Zhelin Li, Rami Mrad, Runxian Jiao, Guan Huang, Jun Shan, Shibing Chu, Yuanping Chen
Efficiently generating energetically stable crystal structures has long been a challenge in material design, primarily due to the immense arrangement of atoms in a crystal lattice.
no code implementations • 18 Jan 2024 • XiaoFeng Wang, Zheng Zhu, Guan Huang, Boyuan Wang, Xinze Chen, Jiwen Lu
World models play a crucial role in understanding and predicting the dynamics of the world, which is essential for video generation.
no code implementations • 18 Sep 2023 • XiaoFeng Wang, Zheng Zhu, Guan Huang, Xinze Chen, Jiagang Zhu, Jiwen Lu
The established world model holds immense potential for the generation of high-quality driving videos, and driving policies for safe maneuvering.
no code implementations • 20 Mar 2023 • Yuebing Liang, Fangyi Ding, Guan Huang, Zhan Zhao
For station-based BSSs, this means planning new stations based on existing ones over time, which requires prediction of the number of trips generated by these new stations across the whole system.
1 code implementation • 1 Jan 2023 • Boyu Zhang, Wenbo Xu, Zheng Zhu, Guan Huang
Specifically, it employs a neural representation to capture the scene distribution in the static background and a 6D-input NeRF to represent dynamic objects, respectively.
1 code implementation • CVPR 2023 • XiaoFeng Wang, Zheng Zhu, Yunpeng Zhang, Guan Huang, Yun Ye, Wenbo Xu, Ziwei Chen, Xingang Wang
To mitigate the problem, we propose the Autonomous-driving StreAming Perception (ASAP) benchmark, which is the first benchmark to evaluate the online performance of vision-centric perception in autonomous driving.
1 code implementation • 30 Nov 2022 • JunJie Huang, Guan Huang
We offer an example of deployment to the TensorRT backend in branch dev2. 0 and show how fast the BEVDet paradigm can be processed on it.
no code implementations • 16 Nov 2022 • Yuebing Liang, Guan Huang, Zhan Zhao
Existing methods for bike sharing demand prediction are mostly based on its own historical demand variation, essentially regarding it as a closed system and neglecting the interaction between different transportation modes.
no code implementations • 31 Oct 2022 • Wenli Yang, Guan Huang, Renjie Li, Jiahao Yu, Yanyu Chen, Quan Bai, Beyong Kang
Convolutional neural network (CNN) models have seen advanced improvements in performance in various domains, but lack of interpretability is a major barrier to assurance and regulation during operation for acceptance and deployment of AI-assisted applications.
1 code implementation • 22 Aug 2022 • Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu
First, we extract multi-scale features and generate the perspective object proposals on each monocular image.
1 code implementation • 19 Aug 2022 • XiaoFeng Wang, Zheng Zhu, Guan Huang, Xu Chi, Yun Ye, Ziwei Chen, Xingang Wang
In contrast, multi-frame depth estimation methods improve the depth accuracy thanks to the success of Multi-View Stereo (MVS), which directly makes use of geometric constraints.
1 code implementation • 6 Aug 2022 • Chaoqiang Zhao, Youmin Zhang, Matteo Poggi, Fabio Tosi, Xianda Guo, Zheng Zhu, Guan Huang, Yang Tang, Stefano Mattoccia
Self-supervised monocular depth estimation is an attractive solution that does not require hard-to-source depth labels for training.
Ranked #1 on
Monocular Depth Estimation
on KITTI
no code implementations • 6 Jul 2022 • Renjie Li, Xinyi Wang, Guan Huang, Wenli Yang, Kaining Zhang, Xiaotong Gu, Son N. Tran, Saurabh Garg, Jane Alty, Quan Bai
Deep supervision, or known as 'intermediate supervision' or 'auxiliary supervision', is to add supervision at hidden layers of a neural network.
1 code implementation • 19 May 2022 • Yunpeng Zhang, Zheng Zhu, Wenzhao Zheng, JunJie Huang, Guan Huang, Jie zhou, Jiwen Lu
Specifically, BEVerse first performs shared feature extraction and lifting to generate 4D BEV representations from multi-timestamp and multi-view images.
Ranked #15 on
Robust Camera Only 3D Object Detection
on nuScenes-C
1 code implementation • ICCV 2021 • Xianda Guo, Zheng Zhu, Tian Yang, Beibei Lin, JunJie Huang, Jiankang Deng, Guan Huang, Jie zhou, Jiwen Lu
To the best of our knowledge, this is the first large-scale dataset for gait recognition in the wild.
no code implementations • 21 Apr 2022 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Dalong Du, Jiwen Lu, Jie zhou
For a comprehensive evaluation of face matchers, three recognition tasks are performed under standard, masked and unbiased settings, respectively.
1 code implementation • 15 Apr 2022 • XiaoFeng Wang, Zheng Zhu, Fangbo Qin, Yun Ye, Guan Huang, Xu Chi, Yijia He, Xingang Wang
Therefore, we present MVSTER, which leverages the proposed epipolar Transformer to learn both 2D semantics and 3D spatial associations efficiently.
1 code implementation • 11 Apr 2022 • Jiayu Zou, Junrui Xiao, Zheng Zhu, JunJie Huang, Guan Huang, Dalong Du, Xingang Wang
In order to reap the benefits and avoid the drawbacks of CBFT and CFFT, we propose a novel framework with a Hybrid Feature Transformation module (HFT).
1 code implementation • 7 Apr 2022 • Yi Wei, Linqing Zhao, Wenzhao Zheng, Zheng Zhu, Yongming Rao, Guan Huang, Jiwen Lu, Jie zhou
In this paper, we propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.
1 code implementation • 31 Mar 2022 • JunJie Huang, Guan Huang
Single frame data contains finite information which limits the performance of the existing vision-based multi-camera 3D object detection paradigms.
Ranked #18 on
3D Object Detection
on nuScenes Camera Only
no code implementations • 18 Mar 2022 • Yuebing Liang, Guan Huang, Zhan Zhao
Bike sharing is an increasingly popular part of urban transportation systems.
2 code implementations • CVPR 2022 • Kai Wang, Bo Zhao, Xiangyu Peng, Zheng Zhu, Shuo Yang, Shuo Wang, Guan Huang, Hakan Bilen, Xinchao Wang, Yang You
Dataset condensation aims at reducing the network training effort through condensing a cumbersome training set into a compact synthetic one.
no code implementations • CVPR 2022 • Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Dalong Du, Jie zhou, Jiwen Lu
In this paper, we propose a general method to learn appropriate embeddings for dimension estimation in monocular 3D object detection.
2 code implementations • 22 Dec 2021 • JunJie Huang, Guan Huang, Zheng Zhu, Yun Ye, Dalong Du
As a fast version, BEVDet-Tiny scores 31. 2% mAP and 39. 2% NDS on the nuScenes val set.
Ranked #20 on
Robust Camera Only 3D Object Detection
on nuScenes-C
no code implementations • 15 Dec 2021 • Yuebing Liang, Guan Huang, Zhan Zhao
Despite some recent efforts, existing approaches to multimodal demand prediction are generally not flexible enough to account for multiplex networks with diverse spatial units and heterogeneous spatiotemporal correlations across different modes.
1 code implementation • CVPR 2022 • Yongming Rao, Wenliang Zhao, Guangyi Chen, Yansong Tang, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu
In this work, we present a new framework for dense prediction by implicitly and explicitly leveraging the pre-trained knowledge from CLIP.
no code implementations • 27 Oct 2021 • Guan Huang, Son N. Tran, Quan Bai, Jane Alty
We have implemented a hand gesture detector to detect the gestures in the hand movement tests and our detection mAP is 0. 782 which is better than the state-of-the-art.
no code implementations • 10 Sep 2021 • Yunze Chen, JunJie Huang, Jiagang Zhu, Zheng Zhu, Tian Yang, Guan Huang, Dalong Du
The current research on this problem mainly focuses on designing an efficient Fully-connected layer (FC) to reduce GPU memory consumption caused by a large number of identities.
no code implementations • 16 Aug 2021 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jia Guo, Jiwen Lu, Dalong Du, Jie zhou
There are second phase of the challenge till October 1, 2021 and on-going leaderboard.
1 code implementation • CVPR 2021 • Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie zhou
To address the dilemma of large-scale training and efficient inference, we propose the STructure-AwaRe Face Clustering (STAR-FC) method.
no code implementations • 7 Jun 2021 • Yiding Liu, Guan Huang, Jiaxiang Liu, Weixue Lu, Suqi Cheng, Yukun Li, Daiting Shi, Shuaiqiang Wang, Zhicong Cheng, Dawei Yin
More importantly, we present a practical system workflow for deploying the model in web-scale retrieval.
no code implementations • 6 Apr 2021 • Jiabin Zhang, Zheng Zhu, Jiwen Lu, JunJie Huang, Guan Huang, Jie zhou
To make a better trade-off between accuracy and efficiency, we propose a novel multi-person pose estimation framework, SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation (SIMPLE).
1 code implementation • 24 Mar 2021 • Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie zhou
To address the dilemma of large-scale training and efficient inference, we propose the STructure-AwaRe Face Clustering (STAR-FC) method.
no code implementations • CVPR 2021 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie zhou
In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol.
Ranked #1 on
Face Verification
on IJB-C
(training dataset metric)
2 code implementations • 17 Aug 2020 • Junjie Huang, Zheng Zhu, Guan Huang, Dalong Du
As AID successfully pushes the performance boundary of human pose estimation problem by considerable margin and sets a new state-of-the-art, we hope AID to be a regular configuration for training human pose estimators.
Ranked #1 on
Multi-Person Pose Estimation
on COCO minival
3 code implementations • CVPR 2020 • Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang, Dalong Du
Specifically, by investigating the standard data processing in state-of-the-art approaches mainly including coordinate system transformation and keypoint format transformation (i. e., encoding and decoding), we find that the results obtained by common flipping strategy are unaligned with the original ones in inference.
Ranked #14 on
Pose Estimation
on COCO test-dev
no code implementations • 14 Oct 2019 • Junjie Huang, Zheng Zhu, Guan Huang
Human pose estimation are of importance for visual understanding tasks such as action recognition and human-computer interaction.
no code implementations • 26 Aug 2019 • Zheng Zhu, Wei Zou, Guan Huang, Dalong Du, Chang Huang
In this paper, we propose an end-to-end framework to learn the convolutional features and perform the tracking process simultaneously, namely, a unified convolutional tracker (UCT).
no code implementations • 20 Aug 2019 • Zewen He, He Huang, Yudong Wu, Guan Huang, Wensheng Zhang
Scale variation remains a challenging problem for object detection.
no code implementations • 15 Aug 2019 • Jiabin Zhang, Zheng Zhu, Wei Zou, Peng Li, Yanwei Li, Hu Su, Guan Huang
Given the results of MTN, we adopt an occlusion-aware Re-ID feature strategy in the pose tracking module, where pose information is utilized to infer the occlusion state to make better use of Re-ID feature.
no code implementations • 4 Jun 2019 • Rui Zhang, Zheng Zhu, Peng Li, Rui Wu, Chaoxu Guo, Guan Huang, Hailun Xia
Human pose estimation has witnessed a significant advance thanks to the development of deep learning.
no code implementations • 4 Jun 2019 • Peng Li, Jiabin Zhang, Zheng Zhu, Yanwei Li, Lu Jiang, Guan Huang
Multi-target Multi-camera Tracking (MTMCT) aims to extract the trajectories from videos captured by a set of cameras.
3 code implementations • 29 Dec 2018 • Houjing Huang, Wenjie Yang, Xiaotang Chen, Xin Zhao, Kaiqi Huang, Jinbin Lin, Guan Huang, Dalong Du
Person re-identification (ReID) has achieved significant improvement under the single-domain setting.
no code implementations • 14 Dec 2018 • Jiagang Zhu, Wei Zou, Liang Xu, Yiming Hu, Zheng Zhu, Manyu Chang, Jun-Jie Huang, Guan Huang, Dalong Du
On NTU RGB-D, Action Machine achieves the state-of-the-art performance with top-1 accuracies of 97. 2% and 94. 3% on cross-view and cross-subject respectively.
Ranked #1 on
Action Recognition
on UTD-MHAD
no code implementations • CVPR 2019 • Yanwei Li, Xinze Chen, Zheng Zhu, Lingxi Xie, Guan Huang, Dalong Du, Xingang Wang
This paper studies panoptic segmentation, a recently proposed task which segments foreground (FG) objects at the instance level as well as background (BG) contents at the semantic level.
Ranked #24 on
Panoptic Segmentation
on COCO test-dev
6 code implementations • 21 Apr 2018 • Xiao Ma, Liqin Zhao, Guan Huang, Zhi Wang, Zelin Hu, Xiaoqiang Zhu, Kun Gai
To the best of our knowledge, this is the first public dataset which contains samples with sequential dependence of click and conversion labels for CVR modeling.
no code implementations • 10 Nov 2017 • Zheng Zhu, Guan Huang, Wei Zou, Dalong Du, Chang Huang
Convolutional neural networks (CNN) based tracking approaches have shown favorable performance in recent benchmarks.
no code implementations • 30 Jul 2015 • Shuangyin Li, Jiefei Li, Guan Huang, Ruiyang Tan, Rong Pan
We propose a novel method to model the SSDs by a so-called Tag-Weighted Topic Model (TWTM).