no code implementations • 16 May 2024 • Binghui Chen, Chongyang Zhong, Wangmeng Xiang, Yifeng Geng, Xuansong Xie
Due to the significant advances in large-scale text-to-image generation by diffusion model (DM), controllable human image generation has been attracting much attention recently.
no code implementations • 7 Apr 2024 • Binghui Chen, Wenyu Li, Yifeng Geng, Xuansong Xie, WangMeng Zuo
Specifically, we propose a shoe-wearing system, called Shoe-Model, to generate plausible images of human legs interacting with the given shoes.
no code implementations • 7 Apr 2024 • Youze Xue, Binghui Chen, Yifeng Geng, Xuansong Xie, Jiansheng Chen, Hongbing Ma
Customized generative text-to-image models have the ability to produce images that closely resemble a given subject.
1 code implementation • 6 Feb 2024 • Mingyue Guo, Binghui Chen, Zhaoyi Yan, YaoWei Wang, Qixiang Ye
Multidomain crowd counting aims to learn a general model for multiple diverse datasets.
no code implementations • CVPR 2024 • Mingyue Guo, Li Yuan, Zhaoyi Yan, Binghui Chen, YaoWei Wang, Qixiang Ye
In this study, we propose mutual prompt learning (mPrompt), which leverages a regressor and a segmenter as guidance for each other, solving bias and inaccuracy caused by annotation variance while distinguishing foreground from background.
1 code implementation • 30 Mar 2023 • Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Wangmeng Xiang, Binghui Chen, Bin Luo, Yifeng Geng, Xuansong Xie
Real-time perception, or streaming perception, is a crucial aspect of autonomous driving that has yet to be thoroughly explored in existing research.
no code implementations • CVPR 2023 • Xiaolin Song, Binghui Chen, Pengyu Li, Jun-Yan He, Biao Wang, Yifeng Geng, Xuansong Xie, Honggang Zhang
End-to-end pedestrian detection focuses on training a pedestrian detection model via discarding the Non-Maximum Suppression (NMS) post-processing.
no code implementations • 7 Dec 2022 • Kaicheng Li, Hongyu Yang, Binghui Chen, Pengyu Li, Biao Wang, Di Huang
Along with the widespread use of face recognition systems, their vulnerability has become highlighted.
1 code implementation • CVPR 2022 • Binghui Chen, Pengyu Li, Xiang Chen, Biao Wang, Lei Zhang, Xian-Sheng Hua
Semi-supervised object detection (SSOD) aims to facilitate the training and deployment of object detectors with the help of a large amount of unlabeled data.
1 code implementation • ICCV 2021 • Binghui Chen, Zhaoyi Yan, Ke Li, Pengyu Li, Biao Wang, WangMeng Zuo, Lei Zhang
In crowd counting, due to the problem of laborious labelling, it is perceived intractability of collecting a new large-scale dataset which has plentiful images with large diversity in density, scene, etc.
no code implementations • CVPR 2021 • Wenyu Li, Tianchu Guo, Pengyu Li, Binghui Chen, Biao Wang, WangMeng Zuo, Lei Zhang
In this paper, we propose a novel face recognition method, named VirFace, to effectively apply the unlabeled shallow data for face recognition.
1 code implementation • Springer 2020 • Yuchong Gu, Zitao Zen, Haibin Chen, Jun Wei, Yaqin Zhang, Binghui Chen, Yingqin Li, Yujuan Qin, Qing Xie, Zhuoren Jiang, Yao Lu
Super-resolution (SR) in medical imaging is an emerging application in medical imaging due to the needs of high quality images acquired with limited radiation dose, such as low dose Computer Tomography (CT), low field magnetic resonance imaging (MRI).
1 code implementation • ICCV 2019 • Binghui Chen, Weihong Deng, Jiani Hu
Then, rethinking person ReID as a zero-shot learning problem, we propose the Mixed High-Order Attention Network (MHN) to further enhance the discrimination and richness of attention knowledge in an explicit manner.
Ranked #4 on Person Re-Identification on CUHK03-C
Person Re-Identification Vocal Bursts Intensity Prediction +1
no code implementations • CVPR 2019 • Binghui Chen, Weihong Deng
In zero-shot image retrieval (ZSIR) task, embedding learning becomes more attractive, however, many methods follow the traditional metric learning idea and omit the problems behind zero-shot settings.
no code implementations • CVPR 2019 • Tongtong Yuan, Weihong Deng, Jian Tang, Yinan Tang, Binghui Chen
In this paper, different from the approaches on learning the loss structures, we propose a robust SNR distance metric based on Signal-to-Noise Ratio (SNR) for measuring the similarity of image pairs for deep metric learning.
no code implementations • 22 Jan 2019 • Binghui Chen, Weihong Deng
However, in this paper, we first emphasize that the generalization ability is a core ingredient of this 'good' embedding as well and largely affects the metric performance in zero-shot settings as a matter of fact.
2 code implementations • NeurIPS 2018 • Binghui Chen, Weihong Deng, Haifeng Shen
Recently, learning discriminative features to improve the recognition performances gradually becomes the primary goal of deep learning, and numerous remarkable works have emerged.
no code implementations • 20 Nov 2018 • Wanxin Tian, Zixuan Wang, Haifeng Shen, Weihong Deng, Yiping Meng, Binghui Chen, Xiubao Zhang, Yuan Zhao, Xiehe Huang
We assume that problems inside are inadequate use of supervision information and imbalance between semantics and details at all level feature maps in CNN even with Feature Pyramid Networks (FPN).
no code implementations • 4 Jun 2018 • Binghui Chen, Weihong Deng
Deep embedding learning becomes more attractive for discriminative feature learning, but many methods still require hard-class mining, which is computationally complex and performance-sensitive.
no code implementations • CVPR 2017 • Binghui Chen, Weihong Deng, Junping Du
In this paper, we first emphasize that the early saturation behavior of softmax will impede the exploration of SGD, which sometimes is a reason for model converging at a bad local-minima, then propose Noisy Softmax to mitigating this early saturation issue by injecting annealed noise in softmax during each iteration.