no code implementations • ECCV 2020 • Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei, Tao Mei
Knowledge distillation is an effective tool to compress large pre-trained Convolutional Neural Networks (CNNs) or their ensembles into models applicable to mobile and embedded devices.
no code implementations • 22 Dec 2024 • Xiangtian Li, Xiaobo Wang, Zhen Qi, Han Cao, Zhaoyang Zhang, Ao Xiang
Dynamic texture synthesis aims to generate sequences that are visually similar to a reference video texture and exhibit specific stationary properties in time.
no code implementations • 27 Nov 2024 • Chaoyi Tan, Xiangtian Li, Xiaobo Wang, Zhen Qi, Ao Xiang
Thispaperaimstoresearchandimplementa real-timevideotargettrackingalgorithmbasedon ConvolutionalNeuralNetworks(CNN), enhancingthe accuracyandrobustnessoftargettrackingincomplex scenarios. Addressingthelimitationsoftraditionaltracking algorithmsinhandlingissuessuchastargetocclusion, morphologicalchanges, andbackgroundinterference, our approachintegratestargetdetectionandtrackingstrategies. It continuouslyupdatesthetargetmodelthroughanonline learningmechanismtoadapttochangesinthetarget's appearance. Experimentalresultsdemonstratethat, when dealingwithsituationsinvolvingrapidmotion, partial occlusion, andcomplexbackgrounds, theproposedalgorithm exhibitshighertrackingsuccessratesandlowerfailurerates comparedtoseveralmainstreamtrackingalgorithms. This studysuccessfullyappliesCNNtoreal-timevideotarget tracking, improvingtheaccuracyandstabilityofthetracking algorithmwhilemaintaininghighprocessingspeeds, thus meetingthedemandsofreal-timeapplications. Thisalgorithm isexpectedtoprovidenewsolutionsfortargettrackingtasksin videosurveillanceandintelligenttransportationdomains.
1 code implementation • 17 Jun 2024 • Dawulie Jinensibieke, Mieradilijiang Maimaiti, Wentao Xiao, Yuanhang Zheng, Xiaobo Wang
Relation Extraction (RE) serves as a crucial technology for transforming unstructured text into structured information, especially within the framework of Knowledge Graph development.
1 code implementation • 17 Jun 2024 • Siyuan Qi, Bangcheng Yang, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, Zilong Zheng
In scenarios where language models must incorporate new information efficiently without extensive retraining, traditional fine-tuning methods are prone to overfitting, degraded generalization, and unnatural language generation.
no code implementations • 18 Apr 2024 • Jiaqi Li, Xiaobo Wang, Wentao Ding, ZiHao Wang, Yipeng Kang, Zixia Jia, Zilong Zheng
We introduce an innovative RAG-based framework with an ever-improving memory.
no code implementations • 25 Jul 2021 • Qiang Meng, Xiaqing Xu, Xiaobo Wang, Yang Qian, Yunxiao Qin, Zezheng Wang, Chenxu Zhao, Feng Zhou, Zhen Lei
Despite the great success achieved by deep learning methods in face recognition, severe performance drops are observed for large pose variations in unconstrained environments (e. g., in cases of surveillance and photo-tagging).
1 code implementation • CVPR 2022 • Kai Wang, Shuo Wang, Panpan Zhang, Zhipeng Zhou, Zheng Zhu, Xiaobo Wang, Xiaojiang Peng, Baigui Sun, Hao Li, Yang You
This method adopts Dynamic Class Pool (DCP) for storing and updating the identities features dynamically, which could be regarded as a substitute for the FC layer.
Ranked #1 on
Face Verification
on IJB-C
(training dataset metric)
2 code implementations • 12 Oct 2020 • Xiaoyong Yang, Yadong Zhu, Yi Zhang, Xiaobo Wang, Quan Yuan
Building a recommendation system that serves billions of users on daily basis is a challenging problem, as the system needs to make astronomical number of predictions per second based on real-time user behaviors with O(1) time complexity.
no code implementations • 4 Aug 2020 • Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang
Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
1 code implementation • ICML 2020 • Xiaobo Wang, Shuo Wang, Cheng Chi, Shifeng Zhang, Tao Mei
In face recognition, designing margin-based (e. g., angular, additive, additive angular margins) softmax loss functions plays an important role in learning discriminative features.
no code implementations • Proceedings of the AAAI Conference on Artificial Intelligence 2020 • Yinglu Liu, Hailin Shi, Hao Shen, Yue Si, Xiaobo Wang, Tao Mei
The dataset is publicly accessible to the community for boosting the advance of face parsing. 1 Second, a simple yet effective Boundary-Attention Semantic Segmentation (BASS) method is proposed for face parsing, which contains a three-branch network with elaborately developed loss functions to fully exploit the boundary information.
Ranked #9 on
Face Parsing
on LaPa
no code implementations • 26 Nov 2019 • Xiaobo Wang, Shifeng Zhang, Shuo Wang, Tianyu Fu, Hailin Shi, Tao Mei
Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination.
no code implementations • ICCV 2019 • Xiaobo Wang, Shuo Wang, Jun Wang, Hailin Shi, Tao Mei
Face recognition has achieved significant progress with the growing scale of collected datasets, which empowers us to train strong convolutional neural networks (CNNs).
no code implementations • 13 May 2019 • Yinglu Liu, Hailin Shi, Yue Si, Hao Shen, Xiaobo Wang, Tao Mei
Each image is provided with accurate annotation of a 11-category pixel-level label map along with coordinates of 106-point landmarks.
no code implementations • 9 May 2019 • Yinglu Liu, Hao Shen, Yue Si, Xiaobo Wang, Xiangyu Zhu, Hailin Shi, Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi, Jun Yu, Haonian Xie, Guochen Xie, Mengyan Li, Qing Lu, Zengfu Wang, Shenqi Lai, Zhenhua Chai, Xiaoming Wei
However, previous competitions on facial landmark localization (i. e., the 300-W, 300-VW and Menpo challenges) aim to predict 68-point landmarks, which are incompetent to depict the structure of facial components.
no code implementations • 5 Apr 2019 • Jianren Wang, Yihui He, Xiaobo Wang, Xinjia Yu, Xia Chen
We introduce a prediction driven method for visual tracking and segmentation in videos.
no code implementations • 20 Jan 2019 • Shifeng Zhang, Rui Zhu, Xiaobo Wang, Hailin Shi, Tianyu Fu, Shuo Wang, Tao Mei, Stan Z. Li
With the availability of face detection benchmark WIDER FACE dataset, much of the progresses have been made by various algorithms in recent years.
4 code implementations • 29 Dec 2018 • Xiaobo Wang, Shuo Wang, Shifeng Zhang, Tianyu Fu, Hailin Shi, Tao Mei
Face recognition has witnessed significant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination.
Ranked #1 on
Face Identification
on Trillion Pairs Dataset
2 code implementations • CVPR 2019 • Shifeng Zhang, Xiaobo Wang, Ajian Liu, Chenxu Zhao, Jun Wan, Sergio Escalera, Hailin Shi, Zezheng Wang, Stan Z. Li
To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and visual modalities.
1 code implementation • CVPR 2019 • Rui Zhu, Shifeng Zhang, Xiaobo Wang, Longyin Wen, Hailin Shi, Liefeng Bo, Tao Mei
Taking this advantage, we are able to explore various types of networks for object detection, without suffering from the poor convergence.
no code implementations • 21 Aug 2018 • Fei Sun, Peng Jiang, Hanxiao Sun, Changhua Pei, Wenwu Ou, Xiaobo Wang
For the second constraint, we restore the key information by copying words from the knowledge encoder with the help of the soft gating mechanism.
no code implementations • 10 May 2018 • Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li
On the other hand, the learned classifier of softmax loss is weak.
no code implementations • ICCV 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
3 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
Ranked #2 on
Face Detection
on PASCAL Face
11 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.
Ranked #3 on
Face Detection
on PASCAL Face
no code implementations • CVPR 2017 • Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, Stan Z. Li
Multi-view subspace clustering aims to partition a set of multi-source data into their underlying groups.
1 code implementation • 9 May 2017 • Haibo Jin, Xiaobo Wang, Shengcai Liao, Stan Z. Li
However, to achieve this, existing deep models prefer to adopt image pairs or triplets to form verification loss, which is inefficient and unstable since the number of training pairs or triplets grows rapidly as the number of training data grows.
no code implementations • ICCV 2015 • Xiaobo Wang, Xiaojie Guo, Stan Z. Li
In this paper, we present a novel semi-supervised dictionary learning method, which uses the informative coding vectors of both labeled and unlabeled data, and adaptively emphasizes the high confidence coding vectors of unlabeled data to enhance the dictionary discriminative capability simultaneously.