PSTR: End-to-End One-Step Person Search With Transformers

1 code implementation7 Apr 2022 Jiale Cao, Yanwei Pang, Rao Muhammad Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, Fahad Shahbaz Khan

We propose a novel one-step transformer-based person search framework, PSTR, that jointly performs person detection and re-identification (re-id) in a single architecture.

Human Detection Person Search

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer

1 code implementation24 Mar 2022 Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan

When using the ResNet50 backbone, our MS-STS achieves a mask AP of 50. 1 %, outperforming the best reported results in literature by 2. 7 % and by 4. 8 % at higher overlap threshold of AP_75, while being comparable in model size and speed on Youtube-VIS 2019 val.

Instance Segmentation Semantic Segmentation +1

Shape Prior Non-Uniform Sampling Guided Real-time Stereo 3D Object Detection

no code implementations18 Jun 2021 Aqi Gao, Jiale Cao, Yanwei Pang

Compared with the baseline RTS3D, our proposed method has 2. 57% improvement on AP3d almost without extra network parameters.

3D Object Detection

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

1 code implementation3 Dec 2020 Tiancai Wang, Tong Yang, Jiale Cao, Xiangyu Zhang

Object detectors usually achieve promising results with the supervision of complete instance annotations.

MULTI-VIEW LEARNING Object Detection +1

From Handcrafted to Deep Features for Pedestrian Detection: A Survey

2 code implementations1 Oct 2020 Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, Ling Shao

In addition to single-spectral pedestrian detection, we also review multi-spectral pedestrian detection, which provides more robust features for illumination variance.

Pedestrian Detection

NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

no code implementations CVPR 2020 Yazhao Li, Yanwei Pang, Jianbing Shen, Jiale Cao, Ling Shao

With this observation, we propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.

Object Detection

Hierarchical Shot Detector

1 code implementation ICCV 2019 Jiale Cao, Yanwei Pang, Jungong Han, Xuelong Li

To further solve the second problem, a hierarchical shot detector (HSD) is proposed, which stacks two ROC modules and one feature enhanced module.

General Classification Object Detection

Triply Supervised Decoder Networks for Joint Detection and Segmentation

no code implementations CVPR 2019 Jiale Cao, Yanwei Pang, Xuelong. Li

Experimental results on the VOC2007 and VOC2012 datasets demonstrate that the proposed TripleNet is able to improve both the detection and segmentation accuracies without adding extra computational costs.

Object Detection Self-Driving Cars +1

Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection

no code implementations3 Apr 2018 Jiale Cao, Yanwei Pang, Xuelong. Li

In this paper, we propose a multi-branch and high-level semantic network by gradually splitting a base network into multiple different branches.

Object Detection Pedestrian Detection

Learning Multilayer Channel Features for Pedestrian Detection

no code implementations1 Mar 2016 Jiale Cao, Yanwei Pang, Xuelong. Li

For example, CNN classifies these proposals by the full-connected layer features while proposal scores and the features in the inner-layers of CNN are ignored.

Pedestrian Detection

Learning Sampling Distributions for Efficient Object Detection

no code implementations23 Aug 2015 Yanwei Pang, Jiale Cao, Xuelong. Li

Multistage particle windows (MPW), proposed by Gualdi et al., is an algorithm of fast and accurate object detection.

Face Detection Object Detection

Cascade Learning by Optimally Partitioning

no code implementations18 Aug 2015 Yanwei Pang, Jiale Cao, Xuelong. Li

iCascade searches the optimal number ri of weak classifiers of each stage i by directly minimizing the computation cost of the cascade.

Face Detection Object Detection

