Search Results for author: Shifeng Zhang

Found 39 papers, 16 papers with code

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

no code implementations • 3 Apr 2024 • Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh, Gerasimos Lampouras, Ignacio Iacobacci, Sarah Parisot

To build MuLAn, we developed a training free pipeline which decomposes a monocular RGB image into a stack of RGBA layers comprising of background and isolated instances.

Prompt Engineering Text-to-Image Generation

Paper
Add Code

V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster

no code implementations • 25 Mar 2024 • Si Liu, Zihan Ding, Jiahui Fu, Hongyu Li, Siheng Chen, Shifeng Zhang, Xu Zhou

The point cluster inherently preserves object information while packing messages, with weak relevance to the collaboration range, and supports explicit structure modeling.

Object

Paper
Add Code

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

no code implementations • 7 Mar 2024 • Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, MingChuan Yang

Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values.

Language Modelling Large Language Model +2

Paper
Add Code

Accelerating Diffusion Sampling with Optimized Time Steps

no code implementations • 27 Feb 2024 • Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze Xie, Zhenguo Li

While this is a significant development, most sampling methods still employ uniform time steps, which is not optimal when using a small number of steps.

Image Generation

Paper
Add Code

Optimisation-Based Multi-Modal Semantic Image Editing

no code implementations • 28 Nov 2023 • Bowen Li, Yongxin Yang, Steven McDonagh, Shifeng Zhang, Petru-Daniel Tudosiu, Sarah Parisot

Image editing affords increased control over the aesthetics and content of generated images.

Paper
Add Code

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

1 code implementation • NeurIPS 2023 • Shuchen Xue, Mingyang Yi, Weijian Luo, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhi-Ming Ma

Based on our analysis, we propose SA-Solver, which is an improved efficient stochastic Adams method for solving diffusion SDE to generate data with high quality.

Ranked #11 on Image Generation on ImageNet 512x512

Image Generation

Paper
Code

Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

1 code implementation • NeurIPS 2023 • Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang

To demonstrate the effectiveness and universality of Diff-Instruct, we consider two scenarios: distilling pre-trained diffusion models and refining existing GAN models.

Paper
Code

PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework

no code implementations • CVPR 2022 • Ning Kang, Shanzhao Qiu, Shifeng Zhang, Zhenguo Li, Shutao Xia

Generative model based image lossless compression algorithms have seen a great success in improving compression ratio.

Paper
Add Code

Split Hierarchical Variational Compression

no code implementations • CVPR 2022 • Tom Ryder, Chen Zhang, Ning Kang, Shifeng Zhang

Secondly, we define our coding framework, the autoregressive initial bits, that flexibly supports parallel coding and avoids -- for the first time -- many of the practicalities commonly associated with bits-back coding.

Image Compression

Paper
Add Code

Memory Replay with Data Compression for Continual Learning

1 code implementation • ICLR 2022 • Liyuan Wang, Xingxing Zhang, Kuo Yang, Longhui Yu, Chongxuan Li, Lanqing Hong, Shifeng Zhang, Zhenguo Li, Yi Zhong, Jun Zhu

In this work, we propose memory replay with data compression (MRDC) to reduce the storage cost of old training samples and thus increase their amount that can be stored in the memory buffer.

Autonomous Driving Class Incremental Learning +5

Paper
Code

OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression

no code implementations • NeurIPS 2021 • Chen Zhang, Shifeng Zhang, Fabio Maria Carlucci, Zhenguo Li

To eliminate the requirement of saving separate models for different target datasets, we propose a novel setting that starts from a pretrained deep generative model and compresses the data batches while adapting the model with a dynamical system for only one epoch.

Density Estimation

Paper
Add Code

iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder

no code implementations • NeurIPS 2021 • Shifeng Zhang, Ning Kang, Tom Ryder, Zhenguo Li

In this paper, we discuss lossless compression using normalizing flows which have demonstrated a great capacity for achieving high compression ratios.

Ranked #1 on Image Compression on ImageNet32

Image Compression

Paper
Add Code

Nonlinear ICA Using Volume-Preserving Transformations

no code implementations • ICLR 2022 • Xiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan

Nonlinear ICA is a fundamental problem in machine learning, aiming to identify the underlying independent components (sources) from data which is assumed to be a nonlinear function (mixing function) of these sources.

Paper
Add Code

iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression

4 code implementations • CVPR 2021 • Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li

We also propose a lossless compression algorithm based on iVPF.

Paper
Code

Understanding and Exploring the Network with Stochastic Architectures

1 code implementation • NeurIPS 2020 • Zhijie Deng, Yinpeng Dong, Shifeng Zhang, Jun Zhu

In this work, we decouple the training of a network with stochastic architectures (NSA) from NAS and provide a first systematical investigation on it as a stand-alone problem.

Neural Architecture Search

Paper
Code

Loss Function Search for Face Recognition

1 code implementation • ICML 2020 • Xiaobo Wang, Shuo Wang, Cheng Chi, Shifeng Zhang, Tao Mei

In face recognition, designing margin-based (e. g., angular, additive, additive angular margins) softmax loss functions plays an important role in learning discriminative features.

AutoML Face Recognition

Paper
Code

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

11 code implementations • CVPR 2020 • Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li

In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them.

Ranked #37 on Object Detection on COCO-O

Object object-detection +1

27,790

Paper
Code

Mis-classified Vector Guided Softmax Loss for Face Recognition

no code implementations • 26 Nov 2019 • Xiaobo Wang, Shifeng Zhang, Shuo Wang, Tianyu Fu, Hailin Shi, Tao Mei

Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination.

Face Recognition

Paper
Add Code

WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild

no code implementations • 25 Sep 2019 • Shifeng Zhang, Yiliang Xie, Jun Wan, Hansheng Xia, Stan Z. Li, Guodong Guo

To narrow this gap and facilitate future pedestrian detection research, we introduce a large and diverse dataset named WiderPerson for dense pedestrian detection in the wild.

Ranked #3 on Object Detection on WiderPerson (mMR metric)

Object Detection Pedestrian Detection

Paper
Add Code

Relational Learning for Joint Head and Human Detection

1 code implementation • 24 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Head and human detection have been rapidly improved with the development of deep convolutional neural networks.

Head Detection Human Detection +1

Paper
Code

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

no code implementations • 15 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.

Data Augmentation Occlusion Handling +2

Paper
Add Code

DARTS+: Improved Differentiable Architecture Search with Early Stopping

no code implementations • 13 Sep 2019 • Hanwen Liang, Shifeng Zhang, Jiacheng Sun, Xingqiu He, Weiran Huang, Kechen Zhuang, Zhenguo Li

Therefore, we propose a simple and effective algorithm, named "DARTS+", to avoid the collapse and improve the original DARTS, by "early stopping" the search procedure when meeting a certain criterion.

Paper
Add Code

RefineFace: Refinement Neural Network for High Performance Face Detection

no code implementations • 10 Sep 2019 • Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li

To improve the classification ability for high recall efficiency, STC first filters out most simple negatives from low level detection layers to reduce search space for subsequent classifier, then SML is applied to better distinguish faces from background at various scales and FSM is introduced to let the backbone learn more discriminative features for classification.

Classification Face Detection +3

Paper
Add Code

CASIA-SURF: A Large-scale Multi-modal Benchmark for Face Anti-spoofing

no code implementations • 28 Aug 2019 • Shifeng Zhang, Ajian Liu, Jun Wan, Yanyan Liang, Guogong Guo, Sergio Escalera, Hugo Jair Escalante, Stan Z. Li

To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and modalities.

Face Anti-Spoofing Face Recognition

Paper
Add Code

WIDER Face and Pedestrian Challenge 2018: Methods and Results

no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou

This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.

Face Detection Pedestrian Detection +2

Paper
Add Code

Pairwise Teacher-Student Network for Semi-Supervised Hashing

no code implementations • 2 Feb 2019 • Shifeng Zhang, Jianmin Li, Bo Zhang

Hashing method maps similar high-dimensional data to binary hashcodes with smaller hamming distance, and it has received broad attention due to its low storage cost and fast retrieval speed.

Retrieval

Paper
Add Code

Improved Selective Refinement Network for Face Detection

no code implementations • 20 Jan 2019 • Shifeng Zhang, Rui Zhu, Xiaobo Wang, Hailin Shi, Tianyu Fu, Shuo Wang, Tao Mei, Stan Z. Li

With the availability of face detection benchmark WIDER FACE dataset, much of the progresses have been made by various algorithms in recent years.

Data Augmentation Face Detection +1

Paper
Add Code

Support Vector Guided Softmax Loss for Face Recognition

3 code implementations • 29 Dec 2018 • Xiaobo Wang, Shuo Wang, Shifeng Zhang, Tianyu Fu, Hailin Shi, Tao Mei

Face recognition has witnessed significant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination.

Ranked #1 on Face Identification on Trillion Pairs Dataset

Face Recognition

185

Paper
Code

A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing

4 code implementations • CVPR 2019 • Shifeng Zhang, Xiaobo Wang, Ajian Liu, Chenxu Zhao, Jun Wan, Sergio Escalera, Hailin Shi, Zezheng Wang, Stan Z. Li

Face Anti-Spoofing Face Recognition

908

Paper
Code

ScratchDet: Training Single-Shot Object Detectors from Scratch

1 code implementation • CVPR 2019 • Rui Zhu, Shifeng Zhang, Xiaobo Wang, Longyin Wen, Hailin Shi, Liefeng Bo, Tao Mei

Taking this advantage, we are able to explore various types of networks for object detection, without suffering from the poor convergence.

General Classification Object +2

229

Paper
Code

Selective Refinement Network for High Performance Face Detection

3 code implementations • 7 Sep 2018 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.

Ranked #1 on Face Detection on PASCAL Face

Face Detection General Classification +2

273

Paper
Code

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

no code implementations • ECCV 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other.

Ranked #10 on Pedestrian Detection on Caltech (using extra training data)

Pedestrian Detection

Paper
Add Code

Semantic Cluster Unary Loss for Efficient Deep Hashing

1 code implementation • 15 May 2018 • Shifeng Zhang, Jianmin Li, Bo Zhang

The resultant hashcodes form several compact clusters, which means hashcodes in the same cluster have similar semantic information.

Deep Hashing Information Retrieval

Paper
Code

Ensemble Soft-Margin Softmax Loss for Image Classification

no code implementations • 10 May 2018 • Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li

On the other hand, the learned classifier of softmax loss is weak.

Classification General Classification +1

Paper
Add Code

Single-Shot Refinement Neural Network for Object Detection

12 code implementations • CVPR 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

For object detection, the two-stage approach (e. g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e. g., SSD) has the advantage of high efficiency.

Ranked #164 on Object Detection on COCO test-dev

Object object-detection +1

1,437

Paper
Code

S3FD: Single Shot Scale-Invariant Face Detector

no code implementations • ICCV 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

Paper
Add Code

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

10 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.

Ranked #3 on Face Detection on PASCAL Face

Face Detection Vocal Bursts Intensity Prediction

840

Paper
Code

S$^3$FD: Single Shot Scale-invariant Face Detector

3 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Ranked #2 on Face Detection on PASCAL Face

Face Detection

520

Paper
Code

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization

no code implementations • 28 Sep 2016 • Shifeng Zhang, Jianmin Li, Jinma Guo, Bo Zhang

Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed.

Clustering Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.