no code implementations • 17 Oct 2023 • Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu
In this study, we propose a novel framework, CorrTalk, which effectively establishes the temporal correlation between hierarchical speech features and facial activities of different intensities across distinct regions.
no code implementations • 1 Aug 2023 • Bolun Cai, Pengfei Xiong, Shangxuan Tian
In this paper, we propose a novel metric learning function called Center Contrastive Loss, which maintains a class-wise center bank and compares the category centers with the query data points using a contrastive loss.
Ranked #5 on Metric Learning on CUB-200-2011
1 code implementation • 19 Mar 2021 • Zhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen
To fully utilize the label annotations, we propose Unified Momentum Contrast (UniMoCo), which extends MoCo to support arbitrary ratios of labeled data and unlabeled data training.
2 code implementations • CVPR 2021 • Zhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen
DEtection TRansformer (DETR) for object detection reaches competitive performance compared with Faster R-CNN via a transformer encoder-decoder architecture.
1 code implementation • 14 May 2019 • Weirui Lu, Xiaofen Xing, Bolun Cai, Xiangmin Xu
However, the performance of ranking-based methods is often poor and this is mainly due to two reasons: 1) image cropping is a listwise ranking task rather than pairwise comparison; 2) the rescaling caused by pooling layer and the deformation in view generation damage the performance of composition learning.
1 code implementation • 23 Apr 2019 • Bolun Cai, Xiangmin Xu, Xiaofen Xing, Kui Jia, Jie Miao, DaCheng Tao
Visual tracking is challenging due to image variations caused by various factors, such as object deformation, scale change, illumination change and occlusion.
no code implementations • 4 Dec 2017 • Bolun Cai, Xiangmin Xu, Kailing Guo, Kui Jia, DaCheng Tao
With the powerful down-sampling process, the co-training DSN set a new state-of-the-art performance for image super-resolution.
no code implementations • ICCV 2017 • Bolun Cai, Xianming Xu, Kailing Guo, Kui Jia, Bin Hu, DaCheng Tao
We propose a joint intrinsic-extrinsic prior model to estimate both illumination and reflectance from an observed image.
2 code implementations • 25 Jun 2017 • Suo Qiu, Xiangmin Xu, Bolun Cai
Rectified linear unit (ReLU) is a widely used activation function for deep convolutional neural networks.
no code implementations • 15 May 2017 • Xiaoyi Jia, Xiangmin Xu, Bolun Cai, Kailing Guo
However, the previous methods mainly restore images from one single area in the low resolution (LR) input, which limits the flexibility of models to infer various scales of details for high resolution (HR) output.
1 code implementation • 8 Feb 2017 • Lingke Zeng, Xiangmin Xu, Bolun Cai, Suo Qiu, Tong Zhang
Crowd counting on static images is a challenging problem due to scale variations.
3 code implementations • 28 Jan 2016 • Bolun Cai, Xiangmin Xu, Kui Jia, Chunmei Qing, DaCheng Tao
The key to achieve haze removal is to estimate a medium transmission map for an input hazy image.
Ranked #7 on Image Dehazing on RS-Haze