1 code implementation • 28 May 2024 • Bencheng Liao, Xinggang Wang, Lianghui Zhu, Qian Zhang, Chang Huang
Recently, linear complexity sequence modeling networks have achieved modeling capabilities similar to Vision Transformers on a variety of computer vision tasks, while using fewer FLOPs and less memory.
no code implementations • 13 May 2024 • Chang Huang, Junqiao Zhao, Shatong Zhu, Hongtu Zhou, Chen Ye, Tiantian Feng, Changjun Jiang
Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 20 Feb 2024 • Shaoyu Chen, Bo Jiang, Hao Gao, Bencheng Liao, Qing Xu, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
Learning a human-like driving policy from large-scale driving demonstrations is promising, but the uncertainty and non-deterministic nature of planning make it challenging.
1 code implementation • NeurIPS 2023 • Jialv Zou, Xinggang Wang, Jiahao Guo, Wenyu Liu, Qian Zhang, Chang Huang
In our work, we propose a novel perspective for circuit design by treating circuit components as point clouds and using Transformer-based point cloud perception methods to extract features from the circuit.
2 code implementations • 10 Aug 2023 • Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
We propose a unified permutation-equivalent modeling approach, \ie, modeling map element as a point set with a group of equivalent permutations, which accurately describes the shape of map element and stabilizes the learning process.
no code implementations • 24 Jun 2023 • Xiao Zhang, Hai Zhang, Hongtu Zhou, Chang Huang, Di Zhang, Chen Ye, Junqiao Zhao
In this paper, we propose a method to construct a boundary that discriminates safe and unsafe states.
1 code implementation • 19 Apr 2023 • Shaoyu Chen, Yunchi Zhang, Bencheng Liao, Jiafeng Xie, Tianheng Cheng, Wei Sui, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
We design a divide-and-conquer annotation scheme to solve the spatial extensibility problem of HD map generation, and abstract map elements with a variety of geometric patterns as unified point sequence representation, which can be extended to most map elements in the driving scene.
2 code implementations • ICCV 2023 • Bo Jiang, Shaoyu Chen, Qing Xu, Bencheng Liao, Jiajie Chen, Helong Zhou, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
In this paper, we propose VAD, an end-to-end vectorized paradigm for autonomous driving, which models the driving scene as a fully vectorized representation.
1 code implementation • 15 Mar 2023 • Bencheng Liao, Shaoyu Chen, Bo Jiang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
Motivated by this, we propose to model the lane graph in a novel path-wise manner, which well preserves the continuity of the lane and encodes traffic information for planning.
no code implementations • 5 Dec 2022 • Bo Jiang, Shaoyu Chen, Xinggang Wang, Bencheng Liao, Tianheng Cheng, Jiajie Chen, Helong Zhou, Qian Zhang, Wenyu Liu, Chang Huang
Motion prediction is highly relevant to the perception of dynamic objects and static map elements in the scenarios of autonomous driving.
no code implementations • 13 Sep 2022 • Hao Luan, Yu Yao, Chang Huang
A domain specific memory architecture is essential to achieve the above goals.
1 code implementation • 30 Aug 2022 • Bencheng Liao, Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang
High-definition (HD) map provides abundant and precise environmental information of the driving scene, serving as a fundamental and indispensable component for planning in autonomous driving system.
Ranked #7 on 3D Lane Detection on OpenLane-V2 val
1 code implementation • 22 Jun 2022 • Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Chang Huang, Wenyu Liu
Based on Polar Parametrization, we propose a surround-view 3D DEtection TRansformer, named PolarDETR.
2 code implementations • CVPR 2022 • Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Wenqiang Zhang, Qian Zhang, Chang Huang, Zhaoxiang Zhang, Wenyu Liu
In this paper, we propose a conceptually novel, efficient, and fully convolutional framework for real-time instance segmentation.
Ranked #7 on Real-time Instance Segmentation on MSCOCO
1 code implementation • CVPR 2022 • Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Wenqiang Zhang, Qian Zhang, Chang Huang, Wenyu Liu
For segmentation, we integrate AziNorm into KPConv.
1 code implementation • 25 Mar 2021 • Xinggang Wang, Zhaojin Huang, Bencheng Liao, Lichao Huang, Yongchao Gong, Chang Huang
Based on deep networks, video object detection is actively studied for pushing the limits of detection speed and accuracy.
1 code implementation • 31 Dec 2019 • Mengting Chen, Yuxin Fang, Xinggang Wang, Heng Luo, Yifeng Geng, Xin-Yu Zhang, Chang Huang, Wenyu Liu, Bo wang
The learning problem of the sample generation (i. e., diversity transfer) is solved via minimizing an effective meta-classification loss in a single-stage network, instead of the generative loss in previous works.
1 code implementation • 11 Dec 2019 • Shaoru Wang, Yongchao Gong, Junliang Xing, Lichao Huang, Chang Huang, Weiming Hu
To reciprocate these two tasks, we design a two-stream structure to learn features on both the object level (i. e., bounding boxes) and the pixel level (i. e., instance masks) jointly.
Ranked #94 on Instance Segmentation on COCO test-dev
no code implementations • 26 Aug 2019 • Zheng Zhu, Wei Zou, Guan Huang, Dalong Du, Chang Huang
In this paper, we propose an end-to-end framework to learn the convolutional features and perform the tracking process simultaneously, namely, a unified convolutional tracker (UCT).
no code implementations • 11 Jul 2019 • Hao Luo, Lichao Huang, Han Shen, Yuan Li, Chang Huang, Xinggang Wang
Without any bells and whistles, our method obtains 80. 3\% mAP on the ImageNet VID dataset, which is superior over the previous state-of-the-arts.
1 code implementation • 2 Jul 2019 • Qiang Zhou, Zilong Huang, Lichao Huang, Yongchao Gong, Han Shen, Chang Huang, Wenyu Liu, Xinggang Wang
Video object segmentation (VOS) aims at pixel-level object tracking given only the annotations in the first frame.
Ranked #1 on Visual Object Tracking on YouTube-VOS 2018 (Jaccard (Seen) metric)
3 code implementations • CVPR 2019 • Zhaojin Huang, Lichao Huang, Yongchao Gong, Chang Huang, Xinggang Wang
In this paper, we study this problem and propose Mask Scoring R-CNN which contains a network block to learn the quality of the predicted instance masks.
Ranked #75 on Instance Segmentation on COCO minival
1 code implementation • 17 Jan 2019 • Jiemin Fang, Yukang Chen, Xinbang Zhang, Qian Zhang, Chang Huang, Gaofeng Meng, Wenyu Liu, Xinggang Wang
In our implementations, architectures are first searched on a small dataset, e. g., CIFAR-10.
no code implementations • ECCV 2018 • Cheng Wang, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
We propose a novel deep network called Mancs that solves the person re-identification problem from the following aspects: fully utilizing the attention mechanism for the person misalignment problem and properly sampling for the ranking loss to obtain more stable person representation.
no code implementations • 5 Aug 2018 • Han Shen, Lichao Huang, Chang Huang, Wei Xu
The separation of the task requires to define a hand-crafted training goal in affinity learning stage and a hand-crafted cost function of data association stage, which prevents the tracking goals from learning directly from the feature.
1 code implementation • 1 Aug 2018 • Yukang Chen, Gaofeng Meng, Qian Zhang, Shiming Xiang, Chang Huang, Lisen Mu, Xinggang Wang
To address this issue, we propose the Reinforced Evolutionary Neural Architecture Search (RE- NAS), which is an evolutionary method with the reinforced mutation for NAS.
3 code implementations • 30 Jul 2018 • Liangchen Song, Cheng Wang, Lefei Zhang, Bo Du, Qian Zhang, Chang Huang, Xinggang Wang
We study the problem of unsupervised domain adaptive re-identification (re-ID) which is an active topic in computer vision but lacks a theoretical foundation.
Ranked #16 on Unsupervised Domain Adaptation on Market to Duke
no code implementations • 1 Feb 2018 • Jingchu Liu, Pengfei Hou, Lisen Mu, Yinan Yu, Chang Huang
Tactical driving decision making is crucial for autonomous driving systems and has attracted considerable interest in recent years.
no code implementations • 10 Nov 2017 • Zheng Zhu, Guan Huang, Wei Zou, Dalong Du, Chang Huang
Convolutional neural networks (CNN) based tracking approaches have shown favorable performance in recent benchmarks.
no code implementations • ICCV 2015 • Shangxuan Tian, Yifeng Pan, Chang Huang, Shijian Lu, Kai Yu, Chew Lim Tan
With character candidates detected by cascade boosting, the min-cost flow network model integrates the last three sequential steps into a single process which solves the error accumulation problem at both character level and text line level effectively.
1 code implementation • CVPR 2016 • Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu
While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image.
no code implementations • ICCV 2015 • Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang
This paper presents a data-driven matching cost for stereo matching.
no code implementations • ICCV 2015 • Chunshui Cao, Xian-Ming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang
While feedforward deep convolutional neural networks (CNNs) have been a great success in computer vision, it is important to remember that the human visual contex contains generally more feedback connections than foward connections.
no code implementations • 24 Jun 2015 • Jingtuo Liu, Yafeng Deng, Tao Bai, Zhengping Wei, Chang Huang
Face Recognition has been studied for many decades.
no code implementations • CVPR 2015 • Sifei Liu, Jimei Yang, Chang Huang, Ming-Hsuan Yang
This paper formulates face labeling as a conditional random field with unary and pairwise classifiers.
no code implementations • CVPR 2015 • Jiajun Wu, Yinan Yu, Chang Huang, Kai Yu
The recent development in learning deep representations has demonstrated its wide applications in traditional vision tasks like classification and detection.
no code implementations • CVPR 2015 • Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang
To demonstrate the effectiveness of our approach, we collect a large-scale real-world clothing classification dataset with both noisy and clean labels.
6 code implementations • ICCV 2015 • Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr
Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding.
Ranked #36 on Semantic Segmentation on PASCAL VOC 2012 test
1 code implementation • 25 Dec 2012 • Chang Huang, Shenghuo Zhu, Kai Yu
Learning Mahanalobis distance metrics in a high- dimensional feature space is very difficult especially when structural sparsity and low rank are enforced to improve com- putational efficiency in testing phase.