no code implementations • 23 Jun 2022 • Xu Zhang, Wen Wang, Zhe Chen, Jing Zhang, DaCheng Tao
In PromptPose, we propose that adapting the language knowledge to the visual animal poses is key to achieve effective animal pose estimation.
no code implementations • 19 May 2022 • Xiao Wang, Zhe Chen, Bo Jiang, Jin Tang, Bin Luo, DaCheng Tao
To track the target in a video, current visual trackers usually adopt greedy search for target object localization in each frame, that is, the candidate region with the maximum response score will be selected as the tracking result of each frame.
1 code implementation • 17 May 2022 • Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao
When fine-tuning on downstream tasks, a modality-specific adapter is used to introduce the data and tasks' prior information into the model, making it suitable for these tasks.
Ranked #1 on
Semantic Segmentation
on Cityscapes test
(using extra training data)
1 code implementation • CVPR 2022 • Liyao Tang, Yibing Zhan, Zhe Chen, Baosheng Yu, DaCheng Tao
Point cloud segmentation is fundamental in understanding 3D environments.
Ranked #4 on
Semantic Segmentation
on S3DIS Area5
no code implementations • 7 Mar 2022 • Zhe Chen, Cong Wang
We present sufficient conditions for the load-flow solvability under security constraints in DC distribution networks.
no code implementations • 28 Jan 2022 • Yu-Hong Cai, Xiao-Jun Wu, Zhe Chen
However, methods based on this technique ignore the pressure on a single transformation matrix due to the complex information contained in the data.
1 code implementation • 6 Jan 2022 • Chen Chen, Zhe Chen, Jing Zhang, DaCheng Tao
We observe that the prevailing set abstraction design for down-sampling points may maintain too much unimportant background information that can affect feature learning for detecting objects.
1 code implementation • CVPR 2022 • Zhe Chen, Jing Zhang, DaCheng Tao
Then, a glimpse-based decoder is introduced to provide refined detection results based on both the glimpse features and the attention modeling outputs of the previous stage.
no code implementations • 1 Dec 2021 • Tianyue Zheng, Zhe Chen, Shuya Ding, Chao Cai, Jun Luo
Whereas adversarial training can be useful against specific adversarial perturbations, they have also proven ineffective in generalizing towards attacks deviating from those used for training.
no code implementations • 16 Nov 2021 • Tianyue Zheng, Zhe Chen, Shujie Zhang, Chao Cai, Jun Luo
Crucial for healthcare and biomedical applications, respiration monitoring often employs wearable sensors in practice, causing inconvenience due to their direct contact with human bodies.
1 code implementation • 3 Nov 2021 • Zhe Chen, Wenhai Wang, Enze Xie, Zhibo Yang, Tong Lu, Ping Luo
We propose an accurate and efficient scene text detection framework, termed FAST (i. e., faster arbitrarily-shaped text detector).
Ranked #2 on
Scene Text Detection
on SCUT-CTW1500
1 code implementation • 29 Oct 2021 • Shuya Ding, Zhe Chen, Tianyue Zheng, Jun Luo
Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications.
no code implementations • 28 Oct 2021 • Tianyue Zheng, Zhe Chen, Shuya Ding, Jun Luo
To better understand this potential, this article takes a layered approach to summarize RF sensing enabled by deep learning.
no code implementations • 28 Oct 2021 • Tianyue Zheng, Zhe Chen, Chao Cai, Jun Luo, Xu Zhang
Given the significant amount of time people spend in vehicles, health issues under driving condition have become a major concern.
no code implementations • 27 Oct 2021 • Tianyue Zheng, Zhe Chen, Jun Luo, Lin Ke, Chaoyang Zhao, Yaowen Yang
To this end, we equip SiWa with a deep learning pipeline to parse the rich sensory data.
no code implementations • 13 Oct 2021 • Haichao Yu, Zhe Chen, Dong Lin, Gil Shamir, Jie Han
Dropout has been commonly used to quantify prediction uncertainty, i. e, the variations of model predictions on a given input example.
no code implementations • 29 Sep 2021 • Shujie Zhang, Tianyue Zheng, Zhe Chen, Jun Luo, Sinno Pan
In many practical scenarios of signal extraction from a nonlinear mixture, only one (signal) source is intended to be extracted.
no code implementations • 17 Sep 2021 • Yuanyuan Liu, Wenbin Wang, Chuanxu Feng, Haoyu Zhang, Zhe Chen, Yibing Zhan
To this end, we propose to decompose each video into a series of expression snippets, each of which contains a small number of facial movements, and attempt to augment the Transformer's ability for modeling intra-snippet and inter-snippet visual relations, respectively, obtaining the Expression snippet Transformer (EST).
2 code implementations • 11 Aug 2021 • Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, YaoWei Wang, Yonghong Tian, Feng Wu
Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency.
Ranked #1 on
Object Tracking
on VisEvent
1 code implementation • 30 Mar 2021 • Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, YaoWei Wang, Yonghong Tian, Feng Wu
In this paper, we propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy.
no code implementations • 30 Mar 2021 • Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty
However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner.
1 code implementation • 22 Mar 2021 • Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo
(1) We divide input image into small patches and adopt TIN, successfully transferring image style with arbitrary high-resolution.
no code implementations • 17 Feb 2021 • Zhe Chen, Daniel Harabor, Jiaoyang Li, Peter J. Stuckey
During Multi-Agent Path Finding (MAPF) problems, agents can be delayed by unexpected events.
no code implementations • 25 Nov 2020 • Jack Humphreys, Zhe Chen, DaCheng Tao
Action recognition, which is formulated as a task to identify various human actions in a video, has attracted increasing interest from computer vision researchers due to its importance in various applications.
1 code implementation • 1 Nov 2020 • Licheng Wen, Zhen Zhang, Zhe Chen, Xiangrui Zhao, Yong liu
In this paper, we give a mathematical formalization of Multi-Agent Path Finding for Car-Like robots (CL-MAPF) problem.
Robotics Multiagent Systems
no code implementations • 17 Aug 2020 • Zhe Chen, Yuyan Wang, Dong Lin, Derek Zhiyuan Cheng, Lichan Hong, Ed H. Chi, Claire Cui
Despite deep neural network (DNN)'s impressive prediction performance in various domains, it is well known now that a set of DNN models trained with the same model specification and the same data can produce very different prediction results.
1 code implementation • ECCV 2020 • Zhe Chen, Shohei Nobuhara, Ko Nishino
We introduce a novel neural network-based BRDF model and a Bayesian framework for object inverse rendering, i. e., joint estimation of reflectance and natural illumination from a single image of an object of known geometry.
1 code implementation • 9 Aug 2020 • Weifeng Ma, Zhe Chen, Caoting Ji
This method can act as a plug-in for Fast Style Transfer without any modification to the network architecture.
no code implementations • 24 Jun 2020 • Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen, Frede Blaabjerg
Accurate knowledge of the distribution system topology and parameters is required to achieve good voltage controls, but this is difficult to obtain in practice.
1 code implementation • 10 Jun 2020 • Zhe Chen, Jing Zhang, DaCheng Tao
Modern two-stage object detectors generally require excessively large models for their detection heads to achieve high accuracy.
no code implementations • 3 Jun 2020 • Zhe Chen
We extend the classical result asserting that the twisting operator preserves certain Deligne--Lusztig character values for truncated formal power series; along the way we discuss some properties of centralisers.
Representation Theory
no code implementations • 31 May 2020 • Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen
This paper proposes a data-driven distributed voltage control approach based on the spectrum clustering and the enhanced multi-agent deep reinforcement learning (MADRL) algorithm.
3 code implementations • 17 May 2020 • Jian Ye, Zhe Chen, Juhua Liu, Bo Du
More specifically, we propose to perceive texts from three levels of feature representations, i. e., character-, word- and global-level, and then introduce a novel text representation fusion technique to help achieve robust arbitrary text detection.
Ranked #1 on
Scene Text Detection
on ICDAR 2015
1 code implementation • 3 Feb 2020 • Jing Zhang, Zhe Chen, DaCheng Tao
Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance.
Ranked #5 on
Pose Estimation
on COCO test-dev
no code implementations • 15 Dec 2019 • Zhe Chen, Wanli Ouyang, Tongliang Liu, DaCheng Tao
Alternatively, to access much more natural-looking pedestrians, we propose to augment pedestrian detection datasets by transforming real pedestrians from the same dataset into different shapes.
1 code implementation • 27 Oct 2019 • Jing Zhang, Zhe Chen, DaCheng Tao
Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance of person instances.
no code implementations • 14 May 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
Only learning one projection matrix from original samples to the corresponding binary labels is too strict and will consequentlly lose some intrinsic geometric structures of data.
1 code implementation • 2 Apr 2019 • Zhe Chen, Jing Zhang, DaCheng Tao
To this end, LiDAR sensor data can be incorporated to improve the visual image-based road detection, because LiDAR data is less susceptible to visual noises.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
To solve above problems, we propose a low-rank discriminative least squares regression model (LRDLSR) for multi-class image classification.
1 code implementation • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
On one hand, the Fisher criterion improves the intra-class compactness of the relaxed labels during relaxation learning.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
In this paper, we propose a non-negative representation based discriminative dictionary learning algorithm (NRDL) for multicategory face classification.
no code implementations • ECCV 2018 • Zhe Chen, Shaoli Huang, DaCheng Tao
Current two-stage object detectors, which consists of a region proposal stage and a refinement stage, may produce unreliable results due to ill-localized proposed regions.
no code implementations • 18 Sep 2015 • Zhe Chen, Zhibin Hong, DaCheng Tao
We find that further improvements for correlation filter-based tracking can be made on estimating scales, applying part-based tracking strategy and cooperating with long-term tracking methods.
no code implementations • CVPR 2015 • Zhibin Hong, Zhe Chen, Chaohui Wang, Xue Mei, Danil Prokhorov, DaCheng Tao
Variations in the appearance of a tracked object, such as changes in geometry/photometry, camera viewpoint, illumination, or partial occlusion, pose a major challenge to object tracking.
no code implementations • 20 Mar 2015 • Rahul Agarwal, Zhe Chen, Sridevi V. Sarma
In this paper, a nonparametric maximum likelihood (ML) estimator for band-limited (BL) probability density functions (pdfs) is proposed.
no code implementations • 27 Nov 2014 • Scott W. Linderman, Matthew J. Johnson, Matthew A. Wilson, Zhe Chen
Rodent hippocampal population codes represent important spatial information about the environment during navigation.