no code implementations • 23 Aug 2023 • Donghao Zhou, Jialin Li, Jinpeng Li, Jiancheng Huang, Qiang Nie, Yong liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen
Large-scale well-annotated datasets are of great importance for training an effective object detector.
no code implementations • 20 Apr 2023 • Jinxiang Lai, Siqian Yang, JunHong Zhou, Wenlong Wu, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Chengjie Wang
According to this, we propose a novel Clustered-patch Element Connection (CEC) layer to correct the mismatch problem.
Ranked #47 on
Few-Shot Semantic Segmentation
on COCO-20i (5-shot)
1 code implementation • 15 Mar 2023 • Jinxiang Lai, Siqian Yang, Wenlong Wu, Tao Wu, Guannan Jiang, Xi Wang, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang
Then we derive two specific attention modules, named SpatialFormer Semantic Attention (SFSA) and SpatialFormer Target Attention (SFTA), to enhance the target object regions while reduce the background distraction.
no code implementations • 12 Dec 2022 • Chenliang Gu, Changan Wang, Bin-Bin Gao, Jun Liu, Tianliang Zhang
Recently, density map regression-based methods have dominated in crowd counting owing to their excellent fitting ability on density distribution.
no code implementations • 23 Nov 2022 • Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao, Tianliang Zhang, Wenlong Wu, Wei zhang, Chengjie Wang, Yuan Xie
This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning.
1 code implementation • 2 Nov 2022 • Jinxiang Lai, Siqian Yang, Guannan Jiang, Xi Wang, Yuxi Li, Zihui Jia, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang
In this paper, for the first time, we investigate the contributions of different distance metrics, and propose an adaptive fusion scheme, bringing significant improvements in few-shot classification.
1 code implementation • 2 Nov 2022 • Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, Bin-Bin Gao, Chengjie Wang
Few-Shot Learning (FSL) alleviates the data shortage challenge via embedding discriminative target-aware features among plenty seen (base) and few unseen (novel) labeled samples.
1 code implementation • ACMMM 2022 • Wujin Li, Jiawei Zhan, Jinbao Wang, Bizhong Xia, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng
We believe that the proposed task and benchmark will be beneficial to the field of AD.
1 code implementation • Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022) 2022 • Bin-Bin Gao, Xiaochen Chen, Zhongyi Huang, Congchong Nie, Jun Liu, Jinxiang Lai, Guannan Jiang, Xi Wang, Chengjie Wang
This paper focus on few-shot object detection~(FSOD) and instance segmentation~(FSIS), which requires a model to quickly adapt to novel classes with a few labeled instances.
Ranked #2 on
Few-Shot Object Detection
on MS-COCO (1-shot)
1 code implementation • 15 Jun 2022 • Guo-Hua Wang, Bin-Bin Gao, Chengjie Wang
And most segmentation networks can be adapted to solve the CD problems with our MTF module.
Ranked #1 on
Change Detection
on PCD
no code implementations • 24 Nov 2021 • Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao
In practice, it can adaptively generate multiple class-agnostic prototypes for query images and learn feature alignment in a self-contrastive manner.
no code implementations • 18 Jun 2021 • Feng Luo, Bin-Bin Gao, Jiangpeng Yan, Xiu Li
Experiments also show that our proposed method achieves competitive performance compared to existing boundary-based methods with a lightweight design and a simple pipeline.
no code implementations • 19 Apr 2021 • Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao
To this end, we generate self-contrastive background prototypes directly from the query image, with which we enable the construction of complete sample pairs and thus a complementary and auxiliary segmentation task to achieve the training of a better segmentation model.
1 code implementation • 3 Jul 2020 • Bin-Bin Gao, Hong-Yu Zhou
To bridge the gap between global and local streams, we propose a multi-class attentional region module which aims to make the number of attentional regions as small as possible and keep the diversity of these regions as high as possible.
Ranked #2 on
Multi-Label Classification
on PASCAL VOC 2012
1 code implementation • 3 Jul 2020 • Bin-Bin Gao, Xin-Xin Liu, Hong-Yu Zhou, Jianxin Wu, Xin Geng
The effectiveness of our approach has been demonstrated on both facial age and attractiveness estimation tasks.
Ranked #1 on
Age Estimation
on ChaLearn 2016
1 code implementation • 13 Jul 2018 • Bin-Bin Gao, Hong-Yu Zhou, Jianxin Wu, Xin Geng
Age estimation performance has been greatly improved by using convolutional neural network.
1 code implementation • 15 Nov 2017 • Bin-Bin Gao, Jian-Jun Wang
In this paper, we propose a Fast and Robust TSVM~(FR-TSVM) to deal with the above issues.
no code implementations • 20 Jul 2017 • Hong-Yu Zhou, Bin-Bin Gao, Jianxin Wu
The difficulty of image recognition has gradually increased from general category recognition to fine-grained recognition and to the recognition of some subtle attributes such as temperature and geolocation.
no code implementations • ICCV 2017 • Hong-Yu Zhou, Bin-Bin Gao, Jianxin Wu
In this paper, we propose Adaptive Feeding (AF) to combine a fast (but less accurate) detector and an accurate (but slow) detector, by adaptively determining whether an image is easy or hard and choosing an appropriate detector for it.
2 code implementations • 6 Nov 2016 • Bin-Bin Gao, Chao Xing, Chen-Wei Xie, Jianxin Wu, Xin Geng
However, it is difficult to collect sufficient training images with precise labels in some domains such as apparent age estimation, head pose estimation, multi-label classification and semantic segmentation.
Ranked #1 on
Head Pose Estimation
on BJUT-3D
no code implementations • CVPR 2016 • Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao, Jianxin Wu, Jianfei Cai
With strong labels, our framework is able to achieve state-of-the-art results in both datasets.
Ranked #16 on
Multi-Label Classification
on PASCAL VOC 2007
no code implementations • 21 Apr 2015 • Bin-Bin Gao, Xiu-Shen Wei, Jianxin Wu, Weiyao Lin
In this paper we show that by carefully making good choices for various detailed but important factors in a visual recognition framework using deep learning features, one can achieve a simple, efficient, yet highly accurate image classification system.
no code implementations • 19 Apr 2015 • Jianxin Wu, Bin-Bin Gao, Guoqing Liu
In computer vision, an entity such as an image or video is often represented as a set of instance vectors, which can be SIFT, motion, or deep learning feature vectors extracted from different parts of that entity.