no code implementations • 25 Jun 2024 • Zhuoyuan Li, Yubo Ai, Jiahao Lu, Chuxin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang
Transformers have demonstrated impressive results for 3D point cloud semantic segmentation.
no code implementations • CVPR 2024 • Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh, Gerasimos Lampouras, Ignacio Iacobacci, Sarah Parisot
To build MuLAn, we developed a training free pipeline which decomposes a monocular RGB image into a stack of RGBA layers comprising of background and isolated instances.
no code implementations • 25 Mar 2024 • Si Liu, Zihan Ding, Jiahui Fu, Hongyu Li, Siheng Chen, Shifeng Zhang, Xu Zhou
The point cluster inherently preserves object information while packing messages, with weak relevance to the collaboration range, and supports explicit structure modeling.
no code implementations • 7 Mar 2024 • Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, MingChuan Yang
Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values.
1 code implementation • CVPR 2024 • Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze Xie, Zhenguo Li
While this is a significant development, most sampling methods still employ uniform time steps, which is not optimal when using a small number of steps.
no code implementations • 28 Nov 2023 • Bowen Li, Yongxin Yang, Steven McDonagh, Shifeng Zhang, Petru-Daniel Tudosiu, Sarah Parisot
Image editing affords increased control over the aesthetics and content of generated images.
1 code implementation • NeurIPS 2023 • Shuchen Xue, Mingyang Yi, Weijian Luo, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhi-Ming Ma
Based on our analysis, we propose SA-Solver, which is an improved efficient stochastic Adams method for solving diffusion SDE to generate data with high quality.
Ranked #19 on Image Generation on ImageNet 512x512
1 code implementation • NeurIPS 2023 • Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang
To demonstrate the effectiveness and universality of Diff-Instruct, we consider two scenarios: distilling pre-trained diffusion models and refining existing GAN models.
no code implementations • CVPR 2022 • Ning Kang, Shanzhao Qiu, Shifeng Zhang, Zhenguo Li, Shutao Xia
Generative model based image lossless compression algorithms have seen a great success in improving compression ratio.
no code implementations • CVPR 2022 • Tom Ryder, Chen Zhang, Ning Kang, Shifeng Zhang
Secondly, we define our coding framework, the autoregressive initial bits, that flexibly supports parallel coding and avoids -- for the first time -- many of the practicalities commonly associated with bits-back coding.
1 code implementation • ICLR 2022 • Liyuan Wang, Xingxing Zhang, Kuo Yang, Longhui Yu, Chongxuan Li, Lanqing Hong, Shifeng Zhang, Zhenguo Li, Yi Zhong, Jun Zhu
In this work, we propose memory replay with data compression (MRDC) to reduce the storage cost of old training samples and thus increase their amount that can be stored in the memory buffer.
no code implementations • NeurIPS 2021 • Chen Zhang, Shifeng Zhang, Fabio Maria Carlucci, Zhenguo Li
To eliminate the requirement of saving separate models for different target datasets, we propose a novel setting that starts from a pretrained deep generative model and compresses the data batches while adapting the model with a dynamical system for only one epoch.
no code implementations • NeurIPS 2021 • Shifeng Zhang, Ning Kang, Tom Ryder, Zhenguo Li
In this paper, we discuss lossless compression using normalizing flows which have demonstrated a great capacity for achieving high compression ratios.
Ranked #1 on Image Compression on ImageNet32
no code implementations • ICLR 2022 • Xiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan
Nonlinear ICA is a fundamental problem in machine learning, aiming to identify the underlying independent components (sources) from data which is assumed to be a nonlinear function (mixing function) of these sources.
4 code implementations • CVPR 2021 • Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li
We also propose a lossless compression algorithm based on iVPF.
1 code implementation • NeurIPS 2020 • Zhijie Deng, Yinpeng Dong, Shifeng Zhang, Jun Zhu
In this work, we decouple the training of a network with stochastic architectures (NSA) from NAS and provide a first systematical investigation on it as a stand-alone problem.
1 code implementation • ICML 2020 • Xiaobo Wang, Shuo Wang, Cheng Chi, Shifeng Zhang, Tao Mei
In face recognition, designing margin-based (e. g., angular, additive, additive angular margins) softmax loss functions plays an important role in learning discriminative features.
11 code implementations • CVPR 2020 • Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li
In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them.
Ranked #37 on Object Detection on COCO-O
no code implementations • 26 Nov 2019 • Xiaobo Wang, Shifeng Zhang, Shuo Wang, Tianyu Fu, Hailin Shi, Tao Mei
Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination.
no code implementations • 25 Sep 2019 • Shifeng Zhang, Yiliang Xie, Jun Wan, Hansheng Xia, Stan Z. Li, Guodong Guo
To narrow this gap and facilitate future pedestrian detection research, we introduce a large and diverse dataset named WiderPerson for dense pedestrian detection in the wild.
Ranked #3 on Object Detection on WiderPerson (mMR metric)
1 code implementation • 24 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou
Head and human detection have been rapidly improved with the development of deep convolutional neural networks.
no code implementations • 15 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou
Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.
no code implementations • 13 Sep 2019 • Hanwen Liang, Shifeng Zhang, Jiacheng Sun, Xingqiu He, Weiran Huang, Kechen Zhuang, Zhenguo Li
Therefore, we propose a simple and effective algorithm, named "DARTS+", to avoid the collapse and improve the original DARTS, by "early stopping" the search procedure when meeting a certain criterion.
no code implementations • 10 Sep 2019 • Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li
To improve the classification ability for high recall efficiency, STC first filters out most simple negatives from low level detection layers to reduce search space for subsequent classifier, then SML is applied to better distinguish faces from background at various scales and FSM is introduced to let the backbone learn more discriminative features for classification.
no code implementations • 28 Aug 2019 • Shifeng Zhang, Ajian Liu, Jun Wan, Yanyan Liang, Guogong Guo, Sergio Escalera, Hugo Jair Escalante, Stan Z. Li
To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and modalities.
no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou
This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.
no code implementations • 2 Feb 2019 • Shifeng Zhang, Jianmin Li, Bo Zhang
Hashing method maps similar high-dimensional data to binary hashcodes with smaller hamming distance, and it has received broad attention due to its low storage cost and fast retrieval speed.
no code implementations • 20 Jan 2019 • Shifeng Zhang, Rui Zhu, Xiaobo Wang, Hailin Shi, Tianyu Fu, Shuo Wang, Tao Mei, Stan Z. Li
With the availability of face detection benchmark WIDER FACE dataset, much of the progresses have been made by various algorithms in recent years.
3 code implementations • 29 Dec 2018 • Xiaobo Wang, Shuo Wang, Shifeng Zhang, Tianyu Fu, Hailin Shi, Tao Mei
Face recognition has witnessed significant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination.
Ranked #1 on Face Identification on Trillion Pairs Dataset
2 code implementations • CVPR 2019 • Shifeng Zhang, Xiaobo Wang, Ajian Liu, Chenxu Zhao, Jun Wan, Sergio Escalera, Hailin Shi, Zezheng Wang, Stan Z. Li
To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and visual modalities.
1 code implementation • CVPR 2019 • Rui Zhu, Shifeng Zhang, Xiaobo Wang, Longyin Wen, Hailin Shi, Liefeng Bo, Tao Mei
Taking this advantage, we are able to explore various types of networks for object detection, without suffering from the poor convergence.
3 code implementations • 7 Sep 2018 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou
In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.
Ranked #1 on Face Detection on PASCAL Face
no code implementations • ECCV 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li
Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other.
Ranked #10 on Pedestrian Detection on Caltech (using extra training data)
1 code implementation • 15 May 2018 • Shifeng Zhang, Jianmin Li, Bo Zhang
The resultant hashcodes form several compact clusters, which means hashcodes in the same cluster have similar semantic information.
no code implementations • 10 May 2018 • Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li
On the other hand, the learned classifier of softmax loss is weak.
12 code implementations • CVPR 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li
For object detection, the two-stage approach (e. g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e. g., SSD) has the advantage of high efficiency.
Ranked #175 on Object Detection on COCO test-dev
no code implementations • ICCV 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
10 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.
Ranked #3 on Face Detection on PASCAL Face
3 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
Ranked #2 on Face Detection on PASCAL Face
no code implementations • 28 Sep 2016 • Shifeng Zhang, Jianmin Li, Jinma Guo, Bo Zhang
Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed.