Search Results for author: Chengjie Wang

Found 142 papers, 76 papers with code

SSCGAN: Facial Attribute Editing via Style Skip Connections

no code implementations • ECCV 2020 • Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

Each connection extracts the style feature of the latent feature maps in the encoder and then performs a residual learning based mapping function in the global information space guided by the target attributes.

Attribute Decoder +1

Paper
Add Code

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

no code implementations • 21 May 2024 • Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong liu

We observe that both face reenactment/swapping tasks essentially involve combinations of target structure, ID and attribute.

Paper
Add Code

Efficient Multimodal Large Language Models: A Survey

1 code implementation • 17 May 2024 • Yizhang Jin, Jian Li, Yexin Liu, Tianjun Gu, Kai Wu, Zhengkai Jiang, Muyang He, Bo Zhao, Xin Tan, Zhenye Gan, Yabiao Wang, Chengjie Wang, Lizhuang Ma

In the past year, Multimodal Large Language Models (MLLMs) have demonstrated remarkable performance in tasks such as visual question answering, visual understanding and reasoning.

Edge-computing Question Answering +1

Paper
Code

Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection

no code implementations • 17 Apr 2024 • Qiangang Du, Jinlong Peng, Changan Wang, Xu Chen, Qingdong He, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

Next, a shape-aware and a brightness-aware module are designed to improve the capacity for representation learning.

Change Detection Denoising +1

Paper
Add Code

Single-temporal Supervised Remote Change Detection for Domain Generalization

no code implementations • 17 Apr 2024 • Qiangang Du, Jinlong Peng, Xu Chen, Qingdong He, Liren He, Qiang Nie, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

In this paper, we propose a multimodal contrastive learning (ChangeCLIP) based on visual-language pre-training for change detection domain generalization.

Change Detection Contrastive Learning +1

Paper
Add Code

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

1 code implementation • 16 Apr 2024 • Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong liu, Guansong Pang, DaCheng Tao

Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods.

Anomaly Detection object-detection +2

Paper
Code

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

1 code implementation • 9 Apr 2024 • Haoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie

Recent advancements in anomaly detection have seen the efficacy of CNN- and transformer-based approaches.

Decoder Long-range modeling +1

Paper
Code

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation

no code implementations • 26 Mar 2024 • Qilin Wang, Jiangning Zhang, Chengming Xu, Weijian Cao, Ying Tai, Yue Han, Yanhao Ge, Hong Gu, Chengjie Wang, Yanwei Fu

Facial Appearance Editing (FAE) aims to modify physical attributes, such as pose, expression and lighting, of human facial images while preserving attributes like identity and background, showing great importance in photograph.

Attribute Semantic Composition

Paper
Add Code

Deepfake Generation and Detection: A Benchmark and Survey

1 code implementation • 26 Mar 2024 • Gan Pei, Jiangning Zhang, Menghan Hu, Zhenyu Zhang, Chengjie Wang, Yunsheng Wu, Guangtao Zhai, Jian Yang, Chunhua Shen, DaCheng Tao

Deepfake is a technology dedicated to creating highly realistic facial images and videos under specific conditions, which has significant application potential in fields such as entertainment, movie production, digital human creation, to name a few.

Attribute Face Reenactment +2

105

Paper
Code

SoftPatch: Unsupervised Anomaly Detection with Noisy Data

1 code implementation • NeurIPS 2022 • Xi Jiang, Ying Chen, Qiang Nie, Yong liu, Jianlin Liu, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng

Noise discriminators are utilized to generate outlier scores for patch-level noise elimination before coreset construction.

Unsupervised Anomaly Detection

Paper
Code

Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference

no code implementations • 21 Mar 2024 • Xi Jiang, Ying Chen, Qiang Nie, Jianlin Liu, Yong liu, Chengjie Wang, Feng Zheng

To address this issue, we introduce a Multi-class Implicit Neural representation Transformer for unified Anomaly Detection (MINT-AD), which leverages the fine-grained category information in the training stage.

Anomaly Detection Decoder

Paper
Add Code

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image

no code implementations • 20 Mar 2024 • Shijie Zhang, Boyan Jiang, Keke He, Junwei Zhu, Ying Tai, Chengjie Wang, yinda zhang, Yanwei Fu

Pixel2Mesh (P2M) is a classical approach for reconstructing 3D shapes from a single color image through coarse-to-fine mesh deformation.

3D Reconstruction Single-View 3D Reconstruction

Paper
Add Code

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

1 code implementation • 19 Mar 2024 • Chengjie Wang, Wenbing Zhu, Bin-Bin Gao, Zhenye Gan, Jianning Zhang, Zhihao Gu, Shuguang Qian, Mingang Chen, Lizhuang Ma

Finally, we report the results of popular IAD methods on the Real-IAD dataset, providing a highly challenging benchmark to promote the development of the IAD field.

Benchmarking Unsupervised Anomaly Detection

Paper
Code

DMAD: Dual Memory Bank for Real-World Anomaly Detection

no code implementations • 19 Mar 2024 • Jianlong Hu, Xu Chen, Zhenye Gan, Jinlong Peng, Shengchuan Zhang, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Liujuan Cao, Rongrong Ji

To address the challenge of real-world anomaly detection, we propose a new framework named Dual Memory bank enhanced representation learning for Anomaly Detection (DMAD).

Anomaly Detection Representation Learning

Paper
Add Code

HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching

no code implementations • 19 Mar 2024 • Ying Chen, Yong liu, Kai Wu, Qiang Nie, Shang Xu, Huifang Ma, Bing Wang, Chengjie Wang

Deep learning-based image matching methods play a crucial role in computer vision, yet they often suffer from substantial computational demands.

Paper
Add Code

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

no code implementations • 19 Mar 2024 • Yufei Liu, Junwei Zhu, Junshu Tang, Shijie Zhang, Jiangning Zhang, Weijian Cao, Chengjie Wang, Yunsheng Wu, Dongjin Huang

Texturing 3D humans with semantic UV maps remains a challenge due to the difficulty of acquiring reasonably unfolded UV.

Text to 3D Texture Synthesis

Paper
Add Code

Tuning-Free Image Customization with Image and Text Guidance

no code implementations • 19 Mar 2024 • Pengzhi Li, Qiang Nie, Ying Chen, Xi Jiang, Kai Wu, Yuhuan Lin, Yong liu, Jinlong Peng, Chengjie Wang, Feng Zheng

To our knowledge, this is the first tuning-free method that concurrently utilizes text and image guidance for image customization in specific regions.

Decoder Denoising +1

Paper
Add Code

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

no code implementations • 18 Mar 2024 • Liren He, Zhengkai Jiang, Jinlong Peng, Liang Liu, Qiangang Du, Xiaobin Hu, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

In the field of multi-class anomaly detection, reconstruction-based methods derived from single-class anomaly detection face the well-known challenge of ``learning shortcuts'', wherein the model fails to learn the patterns of normal samples as it should, opting instead for shortcuts such as identity mapping or artificial noise elimination.

Anomaly Detection

Paper
Add Code

PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models

no code implementations • 11 Mar 2024 • Qingdong He, Jinlong Peng, Zhengkai Jiang, Xiaobin Hu, Jiangning Zhang, Qiang Nie, Yabiao Wang, Chengjie Wang

On top of that, PointSeg can incorporate with various segmentation models and even surpasses the supervised methods.

Scene Segmentation

Paper
Add Code

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

no code implementations • 10 Mar 2024 • Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji

Our DiffuMatting shows several potential applications (e. g., matting-data generator, community-friendly art design and controllable generation).

Image Matting Object

Paper
Add Code

Dual-path Frequency Discriminators for Few-shot Anomaly Detection

no code implementations • 7 Mar 2024 • Yuhu Bai, Jiangning Zhang, Yuhang Dong, Guanzhong Tian, Liang Liu, Yunkang Cao, Yabiao Wang, Chengjie Wang

We consider anomaly detection as a discriminative classification problem, wherefore the dual-path feature discrimination module is employed to detect and locate the image-level and feature-level anomalies in the feature space.

Anomaly Detection

Paper
Add Code

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

1 code implementation • 7 Mar 2024 • Jialin Li, Qiang Nie, WeiFu Fu, Yuhuan Lin, Guangpin Tao, Yong liu, Chengjie Wang

Deep learning models, particularly those based on transformers, often employ numerous stacked structures, which possess identical architectures and perform similar functions.

Decoder

Paper
Code

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

no code implementations • 19 Feb 2024 • Xuelin Qian, Yu Wang, Simian Luo, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue, Bo Zhao, Tiejun Huang, Yunsheng Wu, Yanwei Fu

In this paper, we extend auto-regressive models to 3D domains, and seek a stronger ability of 3D shape generation by improving auto-regressive models at capacity and scalability simultaneously.

3D Generation 3D Shape Generation +1

Paper
Add Code

UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation

1 code implementation • 21 Jan 2024 • Qingdong He, Jinlong Peng, Zhengkai Jiang, Kai Wu, Xiaozhong Ji, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Mingang Chen, Yunsheng Wu

3D open-vocabulary scene understanding aims to recognize arbitrary novel categories beyond the base label space.

Instance Segmentation Scene Understanding +1

Paper
Code

Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection

no code implementations • 6 Jan 2024 • Yuanpeng Tu, Boshen Zhang, Liang Liu, Yuxi Li, Xuhai Chen, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhao

Industrial anomaly detection is generally addressed as an unsupervised task that aims at locating defects with only normal training samples.

Anomaly Detection

Paper
Add Code

Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt

1 code implementation • 2 Jan 2024 • Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong liu, Jinbao Wang, Chengjie Wang, Feng Zheng

Unsupervised Anomaly Detection (UAD) with incremental training is crucial in industrial manufacturing, as unpredictable defects make obtaining sufficient labeled data infeasible.

continual anomaly detection Continual Learning +2

Paper
Code

A Generalist FaceX via Learning Unified Facial Representation

1 code implementation • 31 Dec 2023 • Yue Han, Jiangning Zhang, Junwei Zhu, Xiangtai Li, Yanhao Ge, Wei Li, Chengjie Wang, Yong liu, Xiaoming Liu, Ying Tai

This work presents FaceX framework, a novel facial generalist model capable of handling diverse facial tasks simultaneously.

Facial Editing

Paper
Code

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

1 code implementation • 19 Dec 2023 • Yanqi Ge, Qiang Nie, Ye Huang, Yong liu, Chengjie Wang, Feng Zheng, Wen Li, Lixin Duan

By pulling the learned features to these semantic anchors, several advantages can be attained: 1) the intra-class compactness and naturally inter-class separability, 2) induced bias or errors from feature learning can be avoided, and 3) robustness to the long-tailed problem.

Disentanglement

Paper
Code

MatchDet: A Collaborative Framework for Image Matching and Object Detection

no code implementations • 18 Dec 2023 • Jinxiang Lai, Wenlong Wu, Bin-Bin Gao, Jun Liu, Jiawei Zhan, Congchong Nie, Yi Zeng, Chengjie Wang

Image matching and object detection are two fundamental and challenging tasks, while many related applications consider them two individual tasks (i. e. task-individual).

object-detection Object Detection

Paper
Add Code

Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection

1 code implementation • 12 Dec 2023 • Jiangning Zhang, Xuhai Chen, Yabiao Wang, Chengjie Wang, Yong liu, Xiangtai Li, Ming-Hsuan Yang, DaCheng Tao

Following this spirit, this paper explores plain ViT architecture for MUAD.

Unsupervised Anomaly Detection

Paper
Code

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

no code implementations • 11 Dec 2023 • Xu Peng, Junwei Zhu, Boyuan Jiang, Ying Tai, Donghao Luo, Jiangning Zhang, Wei Lin, Taisong Jin, Chengjie Wang, Rongrong Ji

Moreover, these methods often grapple with identity distortion and limited expression diversity.

Face Recognition Text-to-Image Generation

Paper
Add Code

DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection

1 code implementation • 11 Dec 2023 • Haoyang He, Jiangning Zhang, Hongxu Chen, Xuhai Chen, Zhishan Li, Xu Chen, Yabiao Wang, Chengjie Wang, Lei Xie

Reconstruction-based approaches have achieved remarkable outcomes in anomaly detection.

Anomaly Detection Denoising +1

Paper
Code

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model

1 code implementation • 10 Dec 2023 • Teng Hu, Jiangning Zhang, Ran Yi, Yuzhen Du, Xu Chen, Liang Liu, Yabiao Wang, Chengjie Wang

Existing anomaly inspection methods are limited in their performance due to insufficient anomaly data.

Image Generation

Paper
Code

GPT-4V-AD: Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection

1 code implementation • 5 Nov 2023 • Jiangning Zhang, Haoyang He, Xuhai Chen, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong liu

Large Multimodal Model (LMM) GPT-4V(ision) endows GPT-4 with visual grounding capabilities, making it possible to handle certain tasks through the Visual Question Answering (VQA) paradigm.

Anomaly Detection Question Answering +3

Paper
Code

CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection

no code implementations • 1 Nov 2023 • Xuhai Chen, Jiangning Zhang, Guanzhong Tian, Haoyang He, Wuhao Zhang, Yabiao Wang, Chengjie Wang, Yong liu

This paper considers zero-shot Anomaly Detection (AD), performing AD without reference images of the test objects.

Anomaly Detection Language Modelling +2

Paper
Add Code

Real3D-AD: A Dataset of Point Cloud Anomaly Detection

1 code implementation • NeurIPS 2023 • Jiaqi Liu, Guoyang Xie, Ruitao Chen, Xinpeng Li, Jinbao Wang, Yong liu, Chengjie Wang, Feng Zheng

High-precision point cloud anomaly detection is the gold standard for identifying the defects of advancing machining and precision manufacturing.

3D Anomaly Detection

Paper
Code

Dynamic Frame Interpolation in Wavelet Domain

1 code implementation • 7 Sep 2023 • Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Ying Tai, Chengjie Wang, Jie Yang

Video frame interpolation is an important low-level vision task, which can increase frame rate for more fluent visual experience.

Optical Flow Estimation Video Frame Interpolation

Paper
Code

Toward High Quality Facial Representation Learning

1 code implementation • 7 Sep 2023 • Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang

To improve the facial representation quality, we use feature map of a pre-trained visual backbone as a supervision item and use a partially pre-trained decoder for mask image modeling.

Contrastive Learning Decoder +3

Paper
Code

Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

2 code implementations • 7 Sep 2023 • Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To solve the problem, we propose Compositional Neural Painter, a novel stroke-based rendering framework which dynamically predicts the next painting region based on the current canvas, instead of dividing the image plane uniformly into painting regions.

Style Transfer

Paper
Code

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

1 code implementation • ICCV 2023 • Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To address these problems, we propose a novel phasic content fusing few-shot diffusion model with directional distribution consistency loss, which targets different learning objectives at distinct training stages of the diffusion model.

Domain Adaptation

Paper
Code

IIDM: Inter and Intra-domain Mixing for Semi-supervised Domain Adaptation in Semantic Segmentation

no code implementations • 30 Aug 2023 • WeiFu Fu, Qiang Nie, Jialin Li, Yuhuan Lin, Kai Wu, Jian Li, Yabiao Wang, Yong liu, Chengjie Wang

In this paper, we highlight the significance of exploiting the intra-domain information between the labeled target data and unlabeled target data.

Semantic Segmentation Semi-supervised Domain Adaptation +1

Paper
Add Code

PVG: Progressive Vision Graph for Vision Recognition

no code implementations • 1 Aug 2023 • Jiafu Wu, Jian Li, Jiangning Zhang, Boshen Zhang, Mingmin Chi, Yabiao Wang, Chengjie Wang

Convolution-based and Transformer-based vision backbone networks process images into the grid or sequence structures, respectively, which are inflexible for capturing irregular objects.

graph construction

Paper
Add Code

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

1 code implementation • ICCV 2023 • Junjie Fei, Teng Wang, Jinrui Zhang, Zhenyu He, Chengjie Wang, Feng Zheng

In this paper, we propose ViECap, a transferable decoding model that leverages entity-aware decoding to generate descriptions in both seen and unseen scenarios.

Caption Generation Hallucination +2

134

Paper
Code

RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation

1 code implementation • 12 Jul 2023 • Ke Fan, Changan Wang, Yabiao Wang, Chengjie Wang, Ran Yi, Lizhuang Ma

Glass-like objects are widespread in daily life but remain intractable to be segmented for most existing methods.

Semantic Segmentation

Paper
Code

Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection

1 code implementation • 30 May 2023 • Supeng Wang, Yuxi Li, Ming Xie, Mingmin Chi, Yabiao Wang, Chengjie Wang, Wenbing Zhu

In this paper, we revisit the importance of feature difference for change detection in RSI, and propose a series of operations to fully exploit the difference information: Alignment, Perturbation and Decoupling (APD).

Change Detection Decoder

Paper
Code

Dual Path Transformer with Partition Attention

no code implementations • 24 May 2023 • Zhengkai Jiang, Liang Liu, Jiangning Zhang, Yabiao Wang, Mingang Chen, Chengjie Wang

This paper introduces a novel attention mechanism, called dual attention, which is both efficient and effective.

Image Classification object-detection +2

Paper
Add Code

Learning Global-aware Kernel for Image Harmonization

no code implementations • ICCV 2023 • Xintian Shen, Jiangning Zhang, Jun Chen, Shipeng Bai, Yue Han, Yabiao Wang, Chengjie Wang, Yong liu

To address this issue, we propose a novel Global-aware Kernel Network (GKNet) to harmonize local regions with comprehensive consideration of long-distance background references.

Ranked #5 on Image Harmonization on iHarmony4

Image Harmonization

Paper
Add Code

High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning

no code implementations • CVPR 2023 • Chao Xu, Junwei Zhu, Jiangning Zhang, Yue Han, Wenqing Chu, Ying Tai, Chengjie Wang, Zhifeng Xie, Yong liu

Specifically, we supplement the emotion style in text prompts and use an Aligned Multi-modal Emotion encoder to embed the text, image, and audio emotion modality into a unified space, which inherits rich semantic prior from CLIP.

Talking Face Generation

Paper
Add Code

Clustered-patch Element Connection for Few-shot Learning

no code implementations • 20 Apr 2023 • Jinxiang Lai, Siqian Yang, JunHong Zhou, Wenlong Wu, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Chengjie Wang

According to this, we propose a novel Clustered-patch Element Connection (CEC) layer to correct the mismatch problem.

Ranked #48 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Add Code

NeRF-Loc: Visual Localization with Conditional Neural Radiance Field

1 code implementation • 17 Apr 2023 • Jianlin Liu, Qiang Nie, Yong liu, Chengjie Wang

We propose a novel visual re-localization method based on direct matching between the implicit 3D descriptors and the 2D image with transformer.

Neural Rendering Visual Localization

Paper
Code

Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution

1 code implementation • CVPR 2023 • Xuhai Chen, Jiangning Zhang, Chao Xu, Yabiao Wang, Chengjie Wang, Yong liu

Most of the existing blind image Super-Resolution (SR) methods assume that the blur kernels are space-invariant.

Image Super-Resolution SSIM

Paper
Code

Learning Versatile 3D Shape Generation with Improved AR Models

no code implementations • 26 Mar 2023 • Simian Luo, Xuelin Qian, Yanwei Fu, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue

Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space.

3D Shape Generation Image Generation +1

Paper
Add Code

MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection

1 code implementation • CVPR 2023 • Liang Liu, Boshen Zhang, Jiangning Zhang, Wuhao Zhang, Zhenye Gan, Guanzhong Tian, Wenbing Zhu, Yabiao Wang, Chengjie Wang

Despite the remarkable progress made by modern detection models, this challenge is particularly evident in the semi-supervised case.

Ranked #3 on Semi-Supervised Object Detection on COCO 2% labeled data

Object object-detection +3

Paper
Code

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning

1 code implementation • 15 Mar 2023 • Jinxiang Lai, Siqian Yang, Wenlong Wu, Tao Wu, Guannan Jiang, Xi Wang, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang

Then we derive two specific attention modules, named SpatialFormer Semantic Attention (SFSA) and SpatialFormer Target Attention (SFTA), to enhance the target object regions while reduce the background distraction.

Few-Shot Learning

Paper
Code

Calibrated Teacher for Sparsely Annotated Object Detection

1 code implementation • 14 Mar 2023 • Haohan Wang, Liang Liu, Boshen Zhang, Jiangning Zhang, Wuhao Zhang, Zhenye Gan, Yabiao Wang, Chengjie Wang, Haoqian Wang

Recent works on sparsely annotated object detection alleviate this problem by generating pseudo labels for the missing annotations.

Object object-detection +2

Paper
Code

Iterative Few-shot Semantic Segmentation from Image Label Text

1 code implementation • 10 Mar 2023 • Haohan Wang, Liang Liu, Wuhao Zhang, Jiangning Zhang, Zhenye Gan, Yabiao Wang, Chengjie Wang, Haoqian Wang

Few-shot semantic segmentation aims to learn to segment unseen class objects with the guidance of only a few support images.

Ranked #41 on Few-Shot Semantic Segmentation on COCO-20i (1-shot)

Few-Shot Semantic Segmentation Language Modelling +1

Paper
Code

Multimodal Industrial Anomaly Detection via Hybrid Fusion

1 code implementation • CVPR 2023 • Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Yabiao Wang, Chengjie Wang

2D-based Industrial Anomaly Detection has been widely discussed, however, multimodal industrial anomaly detection based on 3D point clouds and RGB images still has many untouched fields.

Ranked #3 on RGB+3D Anomaly Detection and Segmentation on MVTEC 3D-AD (using extra training data)

Contrastive Learning RGB+3D Anomaly Detection and Segmentation

123

Paper
Code

Learning with Noisy labels via Self-supervised Adversarial Noisy Masking

1 code implementation • CVPR 2023 • Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhao

Collecting large-scale datasets is crucial for training deep models, annotating the data, however, inevitably yields noisy labels, which poses challenges to deep learning algorithms.

Ranked #2 on Image Classification on Clothing1M (using extra training data)

Learning with noisy labels

Paper
Code

Learning from Noisy Labels with Decoupled Meta Label Purifier

1 code implementation • CVPR 2023 • Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Yabiao Wang, Chengjie Wang, Cai Rong Zhao

Training deep neural networks(DNN) with noisy labels is challenging since DNN can easily memorize inaccurate labels, leading to poor generalization ability.

Ranked #5 on Image Classification on Clothing1M (using clean data)

Image Classification Meta-Learning +1

Paper
Code

IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing

2 code implementations • 31 Jan 2023 • Guoyang Xie, Jinbao Wang, Jiaqi Liu, Jiayi Lyu, Yong liu, Chengjie Wang, Feng Zheng, Yaochu Jin

We realize that the lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications.

Anomaly Detection Continual Learning +1

Paper
Code

Deep Industrial Image Anomaly Detection: A Survey

1 code implementation • 27 Jan 2023 • Jiaqi Liu, Guoyang Xie, Jinbao Wang, Shangnian Li, Chengjie Wang, Feng Zheng, Yaochu Jin

In this paper, we provide a comprehensive review of deep learning-based image anomaly detection techniques, from the perspectives of neural network architectures, levels of supervision, loss functions, metrics and datasets.

Anomaly Detection

1,047

Paper
Code

Rethinking Mobile Block for Efficient Attention-based Models

1 code implementation • ICCV 2023 • Jiangning Zhang, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang, Chengjie Wang

This paper focuses on developing modern, efficient, lightweight models for dense predictions while trading off parameters, FLOPs, and performance.

Unity

216

Paper
Code

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

1 code implementation • 3 Jan 2023 • Yue Han, Jiangning Zhang, Zhucun Xue, Chao Xu, Xintian Shen, Yabiao Wang, Chengjie Wang, Yong liu, Xiangtai Li

In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework.

Benchmarking Few-Shot Object Detection +3

Paper
Code

Multi-Centroid Task Descriptor for Dynamic Class Incremental Inference

no code implementations • CVPR 2023 • Tenghao Cai, Zhizhong Zhang, Xin Tan, Yanyun Qu, Guannan Jiang, Chengjie Wang, Yuan Xie

As a result, our dynamic inference network is trained independently of baseline and provides a flexible, efficient solution to distinguish between tasks.

Class Incremental Learning Incremental Learning

Paper
Add Code

Instance and Category Supervision are Alternate Learners for Continual Learning

no code implementations • ICCV 2023 • Xudong Tian, Zhizhong Zhang, Xin Tan, Jun Liu, Chengjie Wang, Yanyun Qu, Guannan Jiang, Yuan Xie

Continual Learning (CL) is the constant development of complex behaviors by building upon previously acquired skills.

Continual Learning Self-Supervised Learning

Paper
Add Code

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models

no code implementations • ICCV 2023 • Simian Luo, Xuelin Qian, Yanwei Fu, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue

Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space.

3D Shape Generation Image Generation +1

Paper
Add Code

Learning To Measure the Point Cloud Reconstruction Loss in a Representation Space

no code implementations • CVPR 2023 • Tianxin Huang, Zhonggan Ding, Jiangning Zhang, Ying Tai, Zhenyu Zhang, Mingang Chen, Chengjie Wang, Yong liu

Specifically, we use the contrastive constraint to help CALoss learn a representation space with shape similarity, while we introduce the adversarial strategy to help CALoss mine differences between reconstructed results and ground truths.

Point cloud reconstruction

Paper
Add Code

Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the Wild

no code implementations • CVPR 2023 • Zhenyu Zhang, Renwang Chen, Weijian Cao, Ying Tai, Chengjie Wang

To address this problem, this paper presents a novel Neural Proto-face Field (NPF) for unsupervised robust 3D face modeling.

Paper
Add Code

Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection

2 code implementations • ICCV 2023 • Zhihao Gu, Liang Liu, Xu Chen, Ran Yi, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Annan Shu, Guannan Jiang, Lizhuang Ma

Specifically, we first propose a normality recall memory (NR Memory) to strengthen the normality of student-generated features by recalling the stored normal information.

Ranked #11 on Anomaly Detection on MVTec AD

Knowledge Distillation Unsupervised Anomaly Detection

Paper
Code

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

no code implementations • 29 Nov 2022 • Chengming Xu, Chen Liu, Xinwei Sun, Siqian Yang, Yabiao Wang, Chengjie Wang, Yanwei Fu

We theoretically show that such an augmentation mechanism, different from existing ones, is able to identify the causal features.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision

no code implementations • 23 Nov 2022 • Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao, Tianliang Zhang, Wenlong Wu, Wei zhang, Chengjie Wang, Yuan Xie

This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning.

Feature Correlation Multi-Label Image Classification

Paper
Add Code

Delving into Transformer for Incremental Semantic Segmentation

no code implementations • 18 Nov 2022 • Zekai Xu, Mingyi Zhang, Jiayue Hou, Xing Gong, Chuan Wen, Chengjie Wang, Junge Zhang

In contrast, a Transformer based method has a natural advantage in curbing catastrophic forgetting due to its ability to model both long-term and short-term tasks.

Segmentation Semantic Segmentation

Paper
Add Code

Rethinking the Metric in Few-shot Learning: From an Adaptive Multi-Distance Perspective

no code implementations • 2 Nov 2022 • Jinxiang Lai, Siqian Yang, Guannan Jiang, Xi Wang, Yuxi Li, Zihui Jia, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang

In this paper, for the first time, we investigate the contributions of different distance metrics, and propose an adaptive fusion scheme, bringing significant improvements in few-shot classification.

Few-Shot Learning

Paper
Add Code

tSF: Transformer-based Semantic Filter for Few-Shot Learning

1 code implementation • 2 Nov 2022 • Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, Bin-Bin Gao, Chengjie Wang

Few-Shot Learning (FSL) alleviates the data shortage challenge via embedding discriminative target-aware features among plenty seen (base) and few unseen (novel) labeled samples.

Few-Shot Learning object-detection +1

Paper
Code

Towards Continual Adaptation in Industrial Anomaly Detection

1 code implementation • ACMMM 2022 • Wujin Li, Jiawei Zhan, Jinbao Wang, Bizhong Xia, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng

We believe that the proposed task and benchmark will be beneficial to the field of AD.

Anomaly Detection continual anomaly detection +2

Paper
Code

Rethinking Dimensionality Reduction in Grid-based 3D Object Detection

no code implementations • 20 Sep 2022 • Dihe Huang, Ying Chen, Yikang Ding, Jinli Liao, Jianlin Liu, Kai Wu, Qiang Nie, Yong liu, Chengjie Wang, Zhiheng Li

In MDRNet, the Spatial-aware Dimensionality Reduction (SDR) is designed to dynamically focus on the valuable parts of the object during voxel-to-BEV feature transformation.

3D Object Detection Cloud Detection +3

Paper
Add Code

Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation

1 code implementation • Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022) 2022 • Bin-Bin Gao, Xiaochen Chen, Zhongyi Huang, Congchong Nie, Jun Liu, Jinxiang Lai, Guannan Jiang, Xi Wang, Chengjie Wang

This paper focus on few-shot object detection~(FSOD) and instance segmentation~(FSIS), which requires a model to quickly adapt to novel classes with a few labeled instances.

Ranked #3 on Few-Shot Object Detection on MS-COCO (1-shot)

Few-Shot Object Detection Instance Segmentation +2

Paper
Code

Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution

1 code implementation • 29 Aug 2022 • Yifeng Zhou, Chuming Lin, Donghao Luo, Yong liu, Ying Tai, Chengjie Wang, Mingang Chen

Although some Unsupervised Degradation Prediction (UDP) methods are proposed to bypass this problem, the \textit{inconsistency} between degradation embedding and SR feature is still challenging.

Blind Super-Resolution Image Super-Resolution +1

Paper
Code

Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection

no code implementations • 27 Jul 2022 • Jianshu Li, Man Luo, Jian Liu, Tao Chen, Chengjie Wang, Ziwei Liu, Shuo Liu, Kewei Yang, Xuning Shao, Kang Chen, Boyuan Liu, Mingyu Guo, Ying Guo, Yingying Ao, Pengfei Gao

In this paper, we present the solutions from the Top 3 teams, in order to boost the research work in the field of image forgery detection.

Image Forgery Detection Image Generation +1

Paper
Add Code

SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

1 code implementation • 21 Jul 2022 • Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu, Ying Tai, Chengjie Wang

Point cloud completion has become increasingly popular among generation tasks of 3D point clouds, as it is a challenging yet indispensable problem to recover the complete shape of a 3D object from its partial observation.

Ranked #7 on Point Cloud Completion on Completion3D

Point Cloud Completion

Paper
Code

Adaptive Assignment for Geometry Aware Local Feature Matching

1 code implementation • CVPR 2023 • Dihe Huang, Ying Chen, Shang Xu, Yong liu, Wenlong Wu, Yikang Ding, Chengjie Wang, Fan Tang

The detector-free feature matching approaches are currently attracting great attention thanks to their excellent performance.

Feature Correlation

Paper
Code

Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation

1 code implementation • 14 Jul 2022 • Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang

Unsupervised Domain Adaptation (UDA) aims to adapt the model trained on the labeled source domain to an unlabeled target domain.

Ranked #14 on Unsupervised Domain Adaptation on SYNTHIA-to-Cityscapes

Contrastive Learning Semantic Segmentation +1

Paper
Code

EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm

1 code implementation • 19 Jun 2022 • Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong liu, DaCheng Tao

Motivated by biological evolution, this paper explains the rationality of Vision Transformer by analogy with the proven practical Evolutionary Algorithm (EA) and derives that both have consistent mathematical formulation.

Image Classification

Paper
Code

How to Reduce Change Detection to Semantic Segmentation

1 code implementation • 15 Jun 2022 • Guo-Hua Wang, Bin-Bin Gao, Chengjie Wang

And most segmentation networks can be adapted to solve the CD problems with our MTF module.

Ranked #1 on Change Detection on PCD

Change Detection Scene Change Detection +2

Paper
Code

IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation

2 code implementations • CVPR 2022 • Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Xiaoming Huang, Ying Tai, Chengjie Wang, Jie Yang

Prevailing video frame interpolation algorithms, that generate the intermediate frames from consecutive inputs, typically rely on complex model architectures with heavy parameters or large delay, hindering them from diverse real-time applications.

Ranked #1 on Video Frame Interpolation on Middlebury

Decoder Optical Flow Estimation +1

241

Paper
Code

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

1 code implementation • 27 May 2022 • Guohang Yan, Liu Zhuochun, Chengjie Wang, Chunlei Shi, Pengjin Wei, Xinyu Cai, Tao Ma, Zhizheng Liu, Zebin Zhong, Yuqian Liu, Ming Zhao, Zheng Ma, Yikang Li

To this end, we present OpenCalib, a calibration toolbox that contains a rich set of various sensor calibration methods.

Autonomous Driving

2,105

Paper
Code

UniInst: Unique Representation for End-to-End Instance Segmentation

1 code implementation • 25 May 2022 • Yimin Ou, Rui Yang, Lufan Ma, Yong liu, Jiangpeng Yan, Shang Xu, Chengjie Wang, Xiu Li

Existing instance segmentation methods have achieved impressive performance but still suffer from a common dilemma: redundant representations (e. g., multiple boxes, grids, and anchor points) are inferred for one instance, which leads to multiple duplicated predictions.

Instance Segmentation Re-Ranking +2

132

Paper
Code

FRIH: Fine-grained Region-aware Image Harmonization

no code implementations • 13 May 2022 • Jinlong Peng, Zekun Luo, Liang Liu, Boshen Zhang, Tao Wang, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

Image harmonization aims to generate a more realistic appearance of foreground and background for a composite image.

Decoder Image Harmonization

Paper
Add Code

Surface Representation for Point Clouds

1 code implementation • CVPR 2022 • Haoxi Ran, Jun Liu, Chengjie Wang

Based on a simple baseline of PointNet++ (SSG version), Umbrella RepSurf surpasses the previous state-of-the-art by a large margin for classification, segmentation and detection on various benchmarks in terms of performance and efficiency.

Ranked #6 on 3D Point Cloud Classification on ModelNet40

3D Object Detection 3D Semantic Segmentation +2

323

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

High-resolution Iterative Feedback Network for Camouflaged Object Detection

1 code implementation • 22 Mar 2022 • Xiaobin Hu, Shuo Wang, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, Ling Shao

Spotting camouflaged objects that are visually assimilated into the background is tricky for both object detection algorithms and humans who are usually confused or cheated by the perfectly intrinsic similarities between the foreground objects and the background surroundings.

Object object-detection +2

Paper
Code

CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer Learning

no code implementations • 16 Mar 2022 • Yue Wang, Ran Yi, Luying Li, Ying Tai, Chengjie Wang, Lizhuang Ma

We propose a new encoder which embeds real faces into Z+ space and proposes a dual-path training strategy to better cope with the adapted decoder and eliminate the artifacts.

Decoder Image-to-Image Translation +1

Paper
Add Code

Learning Distinctive Margin toward Active Domain Adaptation

1 code implementation • CVPR 2022 • Ming Xie, Yuxi Li, Yabiao Wang, Zekun Luo, Zhenye Gan, Zhongyi Sun, Mingmin Chi, Chengjie Wang, Pei Wang

Despite plenty of efforts focusing on improving the domain adaptation ability (DA) under unsupervised or few-shot semi-supervised settings, recently the solution of active learning started to attract more attention due to its suitability in transferring model in a more practical way with limited annotation resource on target data.

Active Learning Domain Adaptation

Paper
Code

Class-Aware Contrastive Semi-Supervised Learning

1 code implementation • CVPR 2022 • Fan Yang, Kai Wu, Shuyi Zhang, Guannan Jiang, Yong liu, Feng Zheng, Wei zhang, Chengjie Wang, Long Zeng

Pseudo-label-based semi-supervised learning (SSL) has achieved great success on raw data utilization.

Ranked #1 on Semi-Supervised Image Classification on CIFAR-100 (250 Labels, ImageNet-100 Unlabeled)

Pseudo Label Semi-Supervised Image Classification

Paper
Code

A Survey of Visual Sensory Anomaly Detection

1 code implementation • 14 Feb 2022 • Xi Jiang, Guoyang Xie, Jinbao Wang, Yong liu, Chengjie Wang, Feng Zheng, Yaochu Jin

In this survey, we are the first one to provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.

Anomaly Detection

Paper
Code

STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation

no code implementations • 8 Feb 2022 • Zhengkai Jiang, Zhangxuan Gu, Jinlong Peng, Hang Zhou, Liang Liu, Yabiao Wang, Ying Tai, Chengjie Wang, Liqing Zhang

In contrast, we present a simple and efficient single-stage VIS framework based on the instance segmentation method CondInst by adding an extra tracking head.

Ranked #36 on Video Instance Segmentation on YouTube-VIS validation

Contrastive Learning Instance Segmentation +3

Paper
Add Code

ASFD: Automatic and Scalable Face Detector

no code implementations • 26 Jan 2022 • Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, Zhenyu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang, Yili Xia

Along with current multi-scale based detectors, Feature Aggregation and Enhancement (FAE) modules have shown superior performance gains for cutting-edge object detection.

Ranked #1 on Face Detection on WIDER Face (Medium)

Face Detection object-detection +1

Paper
Add Code

CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation

no code implementations • 13 Jan 2022 • Yifeng Chen, Wenqing Chu, Fangfang Wang, Ying Tai, Ran Yi, Zhenye Gan, Liang Yao, Chengjie Wang, Xi Li

Recently, there is growing attention on one-stage panoptic segmentation methods which aim to segment instances and stuff jointly within a fully convolutional pipeline efficiently.

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Learning To Restore 3D Face From In-the-Wild Degraded Images

no code implementations • CVPR 2022 • Zhenyu Zhang, Yanhao Ge, Ying Tai, Xiaoming Huang, Chengjie Wang, Hao Tang, Dongjin Huang, Zhifeng Xie

In-the-wild 3D face modelling is a challenging problem as the predicted facial geometry and texture suffer from a lack of reliable clues or priors, when the input images are degraded.

3D Face Modelling Face Reconstruction

Paper
Add Code

En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning

no code implementations • CVPR 2022 • Xia Kong, Zuodong Gao, Xiaofan Li, Ming Hong, Jun Liu, Chengjie Wang, Yuan Xie, Yanyun Qu

Our ICCE promotes intra-class compactness with inter-class separability on both seen and unseen classes in the embedding space and visual feature space.

Generalized Zero-Shot Learning

Paper
Add Code

Blind Face Restoration via Integrating Face Shape and Generative Priors

no code implementations • CVPR 2022 • Feida Zhu, Junwei Zhu, Wenqing Chu, Xinyi Zhang, Xiaozhong Ji, Chengjie Wang, Ying Tai

Moreover, we introduce hybrid-level losses to jointly train the shape and generative priors together with other network parts such that these two priors better adapt to our blind face restoration task.

3D Reconstruction Blind Face Restoration +1

Paper
Add Code

ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-High Resolution Segmentation

1 code implementation • CVPR 2022 • Shaohua Guo, Liang Liu, Zhenye Gan, Yabiao Wang, Wuhao Zhang, Chengjie Wang, Guannan Jiang, Wei zhang, Ran Yi, Lizhuang Ma, Ke Xu

The huge burden of computation and memory are two obstacles in ultra-high resolution image segmentation.

Image Segmentation Segmentation +1

Paper
Code

Learning To Memorize Feature Hallucination for One-Shot Image Generation

no code implementations • CVPR 2022 • Yu Xie, Yanwei Fu, Ying Tai, Yun Cao, Junwei Zhu, Chengjie Wang

In this paper, we propose a novel model to explicitly learn and memorize reusable features that can help hallucinate novel category images.

Hallucination Image Generation

Paper
Add Code

Physically-Guided Disentangled Implicit Rendering for 3D Face Modeling

no code implementations • CVPR 2022 • Zhenyu Zhang, Yanhao Ge, Ying Tai, Weijian Cao, Renwang Chen, Kunlin Liu, Hao Tang, Xiaoming Huang, Chengjie Wang, Zhifeng Xie, Dongjin Huang

This paper presents a novel Physically-guided Disentangled Implicit Rendering (PhyDIR) framework for high-fidelity 3D face modeling.

3D Face Modelling Neural Rendering

Paper
Add Code

LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

no code implementations • 10 Dec 2021 • Zhiwei Chen, Changan Wang, Yabiao Wang, Guannan Jiang, Yunhang Shen, Ying Tai, Chengjie Wang, Wei zhang, Liujuan Cao

In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies.

Inductive Bias Object +1

Paper
Add Code

Ranking Distance Calibration for Cross-Domain Few-Shot Learning

no code implementations • CVPR 2022 • Pan Li, Shaogang Gong, Chengjie Wang, Yanwei Fu

The calibrated distance in this target-aware non-linear subspace is complementary to that in the pre-trained representation.

cross-domain few-shot learning Image Retrieval +2

Paper
Add Code

APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

no code implementations • 24 Nov 2021 • Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao

In practice, it can adaptively generate multiple class-agnostic prototypes for query images and learn feature alignment in a self-contrastive manner.

Ranked #46 on Few-Shot Semantic Segmentation on COCO-20i (1-shot)

Few-Shot Semantic Segmentation Metric Learning +2

Paper
Add Code

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

1 code implementation • 19 Oct 2021 • Yuxi Li, Boshen Zhang, Jian Li, Yabiao Wang, Weiyao Lin, Chengjie Wang, Jilin Li, Feiyue Huang

We demonstrate that both temporal grains are beneficial to atomic action recognition.

Action Detection Atomic action recognition

Paper
Code

Robust Learning with Adaptive Sample Credibility Modeling

no code implementations • 29 Sep 2021 • Boshen Zhang, Yuxi Li, Yuanpeng Tu, Yabiao Wang, Yang Xiao, Cai Rong Zhao, Chengjie Wang

For the clean set, we deliberately design a memory-based modulation scheme to dynamically adjust the contribution of each sample in terms of its historical credibility sequence during training, thus to alleviate the effect from potential hard noisy samples in clean set.

Denoising

Paper
Add Code

Uniformity in Heterogeneity:Diving Deep into Count Interval Partition for Crowd Counting

3 code implementations • 27 Jul 2021 • Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu

Therefore, we propose a novel count interval partition criterion called Uniform Error Partition (UEP), which always keeps the expected counting error contributions equal for all intervals to minimize the prediction risk.

Crowd Counting Quantization

391

Paper
Code

Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework

3 code implementations • 27 Jul 2021 • Qingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu

In this paper, we propose a purely point-based framework for joint crowd counting and individual localization.

Ranked #4 on Crowd Counting on ShanghaiTech A

Crowd Counting

391

Paper
Code

Learning To Restore Hazy Video: A New Real-World Dataset and a New Method

no code implementations • CVPR 2021 • Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Fei Wang

On the other hand, the video dehazing algorithms, which can acquire more satisfying dehazing results by exploiting the temporal redundancy from neighborhood hazy frames, receive less attention due to the absence of the video dehazing datasets.

Image Dehazing

Paper
Add Code

HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping

1 code implementation • 18 Jun 2021 • YuHan Wang, Xu Chen, Junwei Zhu, Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Yongjian Wu, Feiyue Huang, Rongrong Ji

In this work, we propose a high fidelity face swapping method, called HifiFace, which can well preserve the face shape of the source face and generate photo-realistic results.

Ranked #7 on Face Swapping on FaceForensics++

3D Face Reconstruction Decoder +3

331

Paper
Code

Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection

1 code implementation • CVPR 2021 • Zhenyu Zhang, Yanhao Ge, Renwang Chen, Ying Tai, Yan Yan, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang

Non-parametric face modeling aims to reconstruct 3D face only from images without shape assumptions.

3D Face Modelling Attribute

151

Paper
Code

Context-Aware Image Inpainting with Learned Semantic Priors

1 code implementation • 14 Jun 2021 • Wendong Zhang, Junwei Zhu, Ying Tai, Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang

Based on the semantic priors, we further propose a context-aware image inpainting model, which adaptively integrates global semantics and local features in a unified image generator.

Image Inpainting Knowledge Distillation

Paper
Code

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model

1 code implementation • NeurIPS 2021 • Jiangning Zhang, Chao Xu, Jian Li, Wenzhou Chen, Yabiao Wang, Ying Tai, Shuo Chen, Chengjie Wang, Feiyue Huang, Yong liu

Inspired by biological evolution, we explain the rationality of Vision Transformer by analogy with the proven practical Evolutionary Algorithm (EA) and derive that both of them have consistent mathematical representation.

Image Retrieval Retrieval

Paper
Code

SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking

no code implementations • 24 May 2021 • Jinlong Peng, Zhengkai Jiang, Yueyang Gu, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

In addition, we add a localization branch to predict the localization accuracy, so that it can work as the replacement of the regression assistance link during inference.

Classification Object +2

Paper
Add Code

SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive Background Prototypes

no code implementations • 19 Apr 2021 • Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao

To this end, we generate self-contrastive background prototypes directly from the query image, with which we enable the construction of complete sample pairs and thus a complementary and auxiliary segmentation task to achieve the training of a better segmentation model.

Few-Shot Semantic Segmentation Metric Learning +2

Paper
Add Code

Learning Dynamic Alignment via Meta-filter for Few-shot Learning

1 code implementation • CVPR 2021 • Chengming Xu, Chen Liu, Li Zhang, Chengjie Wang, Jilin Li, Feiyue Huang, xiangyang xue, Yanwei Fu

Our insight is that these methods would lead to poor adaptation with redundant matching, and leveraging channel-wise adjustment is the key to well adapting the learned knowledge to new classes.

Few-Shot Learning Position

Paper
Code

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization

1 code implementation • CVPR 2021 • Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu

Temporal action localization is an important yet challenging task in video understanding.

Temporal Action Localization Temporal Localization +1

169

Paper
Code

Learning Comprehensive Motion Representation for Action Recognition

no code implementations • 23 Mar 2021 • Mingyu Wu, Boyuan Jiang, Donghao Luo, Junchi Yan, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xiaokang Yang

For action recognition learning, 2D CNN-based methods are efficient but may yield redundant features due to applying the same 2D convolution kernel to each frame.

Action Recognition

Paper
Add Code

Aurora Guard: Reliable Face Anti-Spoofing via Mobile Lighting System

no code implementations • 1 Feb 2021 • Jian Zhang, Ying Tai, Taiping Yao, Jia Meng, Shouhong Ding, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

Face authentication on mobile end has been widely applied in various scenarios.

Face Anti-Spoofing

Paper
Add Code

Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework

1 code implementation • ICCV 2021 • Qingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu

In this paper, we propose a purely point-based framework for joint crowd counting and individual localization.

Crowd Counting

391

Paper
Code

Uniformity in Heterogeneity: Diving Deep Into Count Interval Partition for Crowd Counting

1 code implementation • ICCV 2021 • Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu

Crowd Counting Quantization

Paper
Code

Frequency Consistent Adaptation for Real World Super Resolution

no code implementations • 18 Dec 2020 • Xiaozhong Ji, Guangpin Tao, Yun Cao, Ying Tai, Tong Lu, Chengjie Wang, Jilin Li, Feiyue Huang

From this point of view, we design a novel Frequency Consistent Adaptation (FCA) that ensures the frequency domain consistency when applying existing SR methods to the real scene.

Super-Resolution

Paper
Add Code

They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning

no code implementations • 27 Nov 2020 • Zhuo Huang, Ying Tai, Chengjie Wang, Jian Yang, Chen Gong

Semi-Supervised Learning (SSL) with mismatched classes deals with the problem that the classes-of-interests in the limited labeled data is only a subset of the classes in massive unlabeled data.

Domain Adaptation

Paper
Add Code

Adversarial Refinement Network for Human Motion Prediction

no code implementations • 23 Nov 2020 • Xianjin Chao, Yanrui Bin, Wenqing Chu, Xuan Cao, Yanhao Ge, Chengjie Wang, Jilin Li, Feiyue Huang, Howard Leung

Specifically, we take both the historical motion sequences and coarse prediction as input of our cascaded refinement network to predict refined human motion and strengthen the refinement network with adversarial error augmentation.

Human motion prediction motion prediction

Paper
Add Code

Adversarial Semantic Data Augmentation for Human Pose Estimation

1 code implementation • ECCV 2020 • Yanrui Bin, Xuan Cao, Xinya Chen, Yanhao Ge, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Changxin Gao, Nong Sang

Human pose estimation is the task of localizing body keypoints from still images.

Data Augmentation Pose Estimation

Paper
Code

Dense Scene Multiple Object Tracking with Box-Plane Matching

no code implementations • 30 Jul 2020 • Jinlong Peng, Yueyang Gu, Yabiao Wang, Chengjie Wang, Jilin Li, Feiyue Huang

Multiple Object Tracking (MOT) is an important task in computer vision.

Multiple Object Tracking Object

Paper
Add Code

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking

1 code implementation • ECCV 2020 • Jinlong Peng, Changan Wang, Fangbin Wan, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu

Existing Multiple-Object Tracking (MOT) methods either follow the tracking-by-detection paradigm to conduct object detection, feature extraction and data association separately, or have two of the three subtasks integrated to form a partially end-to-end solution.

Multiple Object Tracking Object +3

245

Paper
Code

Temporal Distinct Representation Learning for Action Recognition

no code implementations • ECCV 2020 • Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.

Action Recognition Representation Learning

Paper
Add Code

ACFD: Asymmetric Cartoon Face Detector

2 code implementations • 2 Jul 2020 • Bin Zhang, Jian Li, Yabiao Wang, Zhipeng Cui, Yili Xia, Chengjie Wang, Jilin Li, Feiyue Huang

Cartoon face detection is a more challenging task than human face detection due to many difficult scenarios is involved.

Binary Classification Face Detection

Paper
Code

Real-World Super-Resolution via Kernel Estimation and Noise Injection

2 code implementations • CVPRW 2020 • Xiaozhong Ji, Yun Cao, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang

Recent state-of-the-art super-resolution methods have achieved impressive performance on ideal datasets regardless of blur and noise.

Ranked #1 on Video Super-Resolution on MSU Super-Resolution for Video Compression

Image Super-Resolution Video Super-Resolution

1,061

Paper
Code

Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation

2 code implementations • CVPR 2020 • Liang Liu, Jiangning Zhang, Ruifei He, Yong liu, Yabiao Wang, Ying Tai, Donghao Luo, Chengjie Wang, Jilin Li, Feiyue Huang

Unsupervised learning of optical flow, which leverages the supervision from view synthesis, has emerged as a promising alternative to supervised methods.

Ranked #2 on Optical Flow Estimation on KITTI 2012 unsupervised

Decoder Optical Flow Estimation +1

248

Paper
Code

ASFD: Automatic and Scalable Face Detector

no code implementations • 25 Mar 2020 • Bin Zhang, Jian Li, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yili Xia, Wenjiang Pei, Rongrong Ji

In this paper, we propose a novel Automatic and Scalable Face Detector (ASFD), which is based on a combination of neural architecture search techniques as well as a new loss design.

Neural Architecture Search

Paper
Add Code

Adversarial Domain Adaptation with Domain Mixup

1 code implementation • 4 Dec 2019 • Minghao Xu, Jian Zhang, Bingbing Ni, Teng Li, Chengjie Wang, Qi Tian, Wenjun Zhang

In this paper, we present adversarial domain adaptation with domain mixup (DM-ADA), which guarantees domain-invariance in a more continuous latent space and guides the domain discriminator in judging samples' difference relative to source and target domains.

Domain Adaptation

159

Paper
Code

TEINet: Towards an Efficient Architecture for Video Recognition

no code implementations • 21 Nov 2019 • Zhao-Yang Liu, Donghao Luo, Yabiao Wang, Li-Min Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu

To relieve this problem, we propose an efficient temporal module, termed as Temporal Enhancement-and-Interaction (TEI Module), which could be plugged into the existing 2D CNNs (denoted by TEINet).

Action Recognition Video Recognition

Paper
Add Code

Fast Learning of Temporal Action Proposal via Dense Boundary Generator

3 code implementations • 11 Nov 2019 • Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

In this paper, we propose an efficient and unified framework to generate temporal action proposals named Dense Boundary Generator (DBG), which draws inspiration from boundary-sensitive methods and implements boundary classification and action completeness regression for densely distributed proposals.

Ranked #7 on Temporal Action Localization on FineAction

General Classification Optical Flow Estimation +2

345

Paper
Code

Anti-Confusing: Region-Aware Network for Human Pose Estimation

no code implementations • 3 May 2019 • Xuan Cao, Yanhao Ge, Ying Tai, Wei zhang, Jian Li, Chengjie Wang, Jilin Li, Feiyue Huang

In this work, we propose a novel framework named Region-Aware Network (RANet), which learns the ability of anti-confusing in case of heavy occlusion, nearby person and symmetric appearance, for human pose estimation.

Data Augmentation Pose Estimation

Paper
Add Code

Aurora Guard: Real-Time Face Anti-Spoofing via Light Reflection

no code implementations • 27 Feb 2019 • Yao Liu, Ying Tai, Jilin Li, Shouhong Ding, Chengjie Wang, Feiyue Huang, Dongyang Li, Wenshuai Qi, Rongrong Ji

In this paper, we propose a light reflection based face anti-spoofing method named Aurora Guard (AG), which is fast, simple yet effective that has already been deployed in real-world systems serving for millions of users.

Face Anti-Spoofing General Classification

Paper
Add Code

Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos

1 code implementation • 1 Nov 2018 • Ying Tai, Yicong Liang, Xiaoming Liu, Lei Duan, Jilin Li, Chengjie Wang, Feiyue Huang, Yu Chen

In recent years, heatmap regression based models have shown their effectiveness in face alignment and pose estimation.

Face Alignment Pose Estimation +3

Paper
Code

DSFD: Dual Shot Face Detector

4 code implementations • CVPR 2019 • Jian Li, Yabiao Wang, Changan Wang, Ying Tai, Jianjun Qian, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang

In this paper, we propose a novel face detection network with three novel contributions that address three key aspects of face detection, including better feature learning, progressive loss design and anchor assign based data augmentation, respectively.

Ranked #1 on Face Detection on FDDB

Data Augmentation Occluded Face Detection

2,866

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.