Search Results for author: Liujuan Cao

Found 43 papers, 29 papers with code

API-Net: Robust Generative Classifier via a Single Discriminator

1 code implementation • ECCV 2020 • Xinshuai Dong, Hong Liu, Rongrong Ji, Liujuan Cao, Qixiang Ye, Jianzhuang Liu, Qi Tian

On the contrary, a discriminative classifier only models the conditional distribution of labels given inputs, but benefits from effective optimization owing to its succinct structure.

Robust classification

Paper
Code

Multi-Modal Prompt Learning on Blind Image Quality Assessment

no code implementations • 23 Apr 2024 • Wensheng Pan, Timin Gao, Yan Zhang, Runze Hu, Xiawu Zheng, Enwei Zhang, Yuting Gao, Yutao Liu, Yunhang Shen, Ke Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji

Image Quality Assessment (IQA) models benefit significantly from semantic information, which allows them to treat different types of objects distinctly.

Paper
Add Code

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

1 code implementation • 23 Apr 2024 • Mingbao Lin, Zhihang Lin, Wengyi Zhan, Liujuan Cao, Rongrong Ji

Transforming large pre-trained low-resolution diffusion models to cater to higher-resolution demands, i. e., diffusion extrapolation, significantly improves diffusion adaptability.

Paper
Code

NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation

no code implementations • 22 Apr 2024 • Chi Huang, Xinyang Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji

As a preliminary work, NeRF-Det unifies the tasks of novel view synthesis and 3D perception, demonstrating that perceptual tasks can benefit from novel view synthesis methods like NeRF, significantly improving the performance of indoor multi-view 3D object detection.

Paper
Add Code

DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis

1 code implementation • 27 Mar 2024 • Zhongxi Chen, Ke Sun, Ziyin Zhou, Xianming Lin, Xiaoshuai Sun, Liujuan Cao, Rongrong Ji

The rapid progress in deep learning has given rise to hyper-realistic facial forgery methods, leading to concerns related to misinformation and security risks.

Image Generation Misinformation

Paper
Code

DMAD: Dual Memory Bank for Real-World Anomaly Detection

no code implementations • 19 Mar 2024 • Jianlong Hu, Xu Chen, Zhenye Gan, Jinlong Peng, Shengchuan Zhang, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Liujuan Cao, Rongrong Ji

To address the challenge of real-world anomaly detection, we propose a new framework named Dual Memory bank enhanced representation learning for Anomaly Detection (DMAD).

Anomaly Detection Representation Learning

Paper
Add Code

Weakly Supervised Open-Vocabulary Object Detection

no code implementations • 19 Dec 2023 • Jianghang Lin, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, Liujuan Cao

Despite weakly supervised object detection (WSOD) being a promising step toward evading strong instance-level annotations, its capability is confined to closed-set categories within a single training dataset.

Attribute Novel Concepts +6

Paper
Add Code

A Unified Framework for 3D Point Cloud Visual Grounding

1 code implementation • 23 Aug 2023 • Haojia Lin, Yongdong Luo, Xiawu Zheng, Lijiang Li, Fei Chao, Taisong Jin, Donghao Luo, Yan Wang, Liujuan Cao, Rongrong Ji

This elaborate design enables 3DRefTR to achieve both well-performing 3DRES and 3DREC capacities with only a 6% additional latency compared to the original 3DREC model.

Referring Expression Referring Expression Comprehension +1

Paper
Code

Pseudo-label Alignment for Semi-supervised Instance Segmentation

1 code implementation • ICCV 2023 • Jie Hu, Chen Chen, Liujuan Cao, Shengchuan Zhang, Annan Shu, Guannan Jiang, Rongrong Ji

Through extensive experiments conducted on the COCO and Cityscapes datasets, we demonstrate that PAIS is a promising framework for semi-supervised instance segmentation, particularly in cases where labeled data is severely limited.

Instance Segmentation Pseudo Label +3

Paper
Code

InterFormer: Real-time Interactive Image Segmentation

1 code implementation • ICCV 2023 • You Huang, Hao Yang, Ke Sun, Shengchuan Zhang, Liujuan Cao, Guannan Jiang, Rongrong Ji

Interactive image segmentation enables annotators to efficiently perform pixel-level annotation for segmentation tasks.

Computational Efficiency Image Segmentation +3

Paper
Code

Geometric-aware Pretraining for Vision-centric 3D Object Detection

1 code implementation • 6 Apr 2023 • Linyan Huang, Huijie Wang, Jia Zeng, Shengchuan Zhang, Liujuan Cao, Junchi Yan, Hongyang Li

We also conduct experiments on various image backbones and view transformations to validate the efficacy of our approach.

3D Object Detection Autonomous Driving +2

1,067

Paper
Code

You Only Segment Once: Towards Real-Time Panoptic Segmentation

2 code implementations • CVPR 2023 • Jie Hu, Linyan Huang, Tianhe Ren, Shengchuan Zhang, Rongrong Ji, Liujuan Cao

To reduce the computational overhead, we design a feature pyramid aggregator for the feature map extraction, and a separable dynamic decoder for the panoptic kernel generation.

Panoptic Segmentation Segmentation

227

Paper
Code

Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification

no code implementations • 20 Mar 2023 • Jiaer Xia, Lei Tan, Pingyang Dai, Mingbo Zhao, Yongjian Wu, Liujuan Cao

To address this issue, we propose a novel transformer-based Attention Disturbance and Dual-Path Constraint Network (ADP) to enhance the generalization of attention networks.

Person Re-Identification

Paper
Add Code

Active Teacher for Semi-Supervised Object Detection

1 code implementation • CVPR 2022 • Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji

In this paper, we study teacher-student learning from the perspective of data initialization and propose a novel algorithm called Active Teacher(Source code are available at: \url{https://github. com/HunterJ-Lin/ActiveTeacher}) for semi-supervised object detection (SSOD).

Object object-detection +2

Paper
Code

DistilPose: Tokenized Pose Regression with Heatmap Distillation

1 code implementation • CVPR 2023 • Suhang Ye, Yingyi Zhang, Jie Hu, Liujuan Cao, Shengchuan Zhang, Lei Shen, Jun Wang, Shouhong Ding, Rongrong Ji

Specifically, DistilPose maximizes the transfer of knowledge from the teacher model (heatmap-based) to the student model (regression-based) through Token-distilling Encoder (TDE) and Simulated Heatmaps.

Knowledge Distillation Pose Estimation +1

Paper
Code

Practical Cross-System Shilling Attacks with Limited Access to Data

1 code implementation • 14 Feb 2023 • Meifang Zeng, Ke Li, Bingchuan Jiang, Liujuan Cao, Hui Li

With the idea of Cross-system Attack, we design a Practical Cross-system Shilling Attack (PC-Attack) framework that requires little information about the victim RS model and the target RS data for conducting attacks.

Recommendation Systems

Paper
Code

Unsupervised Domain Adaptation on Person Re-Identification via Dual-level Asymmetric Mutual Learning

no code implementations • 29 Jan 2023 • Qiong Wu, Jiahan Li, Pingyang Dai, Qixiang Ye, Liujuan Cao, Yongjian Wu, Rongrong Ji

The knowledge transfer between two networks is based on an asymmetric mutual learning manner.

Person Re-Identification Pseudo Label +2

Paper
Add Code

Category-aware Allocation Transformer for Weakly Supervised Object Localization

no code implementations • ICCV 2023 • Zhiwei Chen, Jinren Ding, Liujuan Cao, Yunhang Shen, Shengchuan Zhang, Guannan Jiang, Rongrong Ji

Weakly supervised object localization (WSOL) aims to localize objects based on only image-level labels as supervision.

Object Weakly-Supervised Object Localization

Paper
Add Code

Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability

1 code implementation • 19 Jul 2022 • Xudong Mao, Liujuan Cao, Aurele T. Gnanha, Zhenguo Yang, Qing Li, Rongrong Ji

The recently proposed pivotal tuning model makes significant progress towards reconstruction and editability, by using a two-step approach that first inverts the input image into a latent code, called pivot code, and then alters the generator so that the input image can be accurately mapped into the pivot code.

Paper
Code

Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain

1 code implementation • 15 Jul 2022 • Jiazhen Ji, Huan Wang, Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding, Shengchuan Zhang, Liujuan Cao, Rongrong Ji

This paper proposes a privacy-preserving face recognition method using differential privacy in the frequency domain.

Face Recognition Privacy Preserving

1,223

Paper
Code

Knowledge Condensation Distillation

2 code implementations • 12 Jul 2022 • Chenxin Li, Mingbao Lin, Zhiyuan Ding, Nie Lin, Yihong Zhuang, Yue Huang, Xinghao Ding, Liujuan Cao

Knowledge Distillation (KD) transfers the knowledge from a high-capacity teacher network to strengthen a smaller student.

Knowledge Distillation

1,159

Paper
Code

Super Vision Transformer

1 code implementation • 23 May 2022 • Mingbao Lin, Mengzhao Chen, Yuxin Zhang, Chunhua Shen, Rongrong Ji, Liujuan Cao

Experimental results on ImageNet demonstrate that our SuperViT can considerably reduce the computational costs of ViT models with even performance increase.

Paper
Code

Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks

1 code implementation • 16 Apr 2022 • Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji

Despite the exciting performance, Transformer is criticized for its excessive parameters and computation cost.

Image Classification

Paper
Code

SeqTR: A Simple yet Universal Network for Visual Grounding

3 code implementations • 30 Mar 2022 • Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji

In this paper, we propose a simple yet universal network termed SeqTR for visual grounding tasks, e. g., phrase localization, referring expression comprehension (REC) and segmentation (RES).

Referring Expression Referring Expression Comprehension +1

122

Paper
Code

ARM: Any-Time Super-Resolution Method

1 code implementation • 21 Mar 2022 • Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji

To that effect, we construct an Edge-to-PSNR lookup table that maps the edge score of an image patch to the PSNR performance for each subnet, together with a set of computation costs for the subnets.

Image Super-Resolution

Paper
Code

Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest Filters

1 code implementation • 15 Feb 2022 • Mingbao Lin, Liujuan Cao, Yuxin Zhang, Ling Shao, Chia-Wen Lin, Rongrong Ji

Then, we introduce a recommendation-based filter selection scheme where each filter recommends a group of its closest filters.

Image Classification Network Pruning

Paper
Code

LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

no code implementations • 10 Dec 2021 • Zhiwei Chen, Changan Wang, Yabiao Wang, Guannan Jiang, Yunhang Shen, Ying Tai, Chengjie Wang, Wei zhang, Liujuan Cao

In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies.

Inductive Bias Object +1

Paper
Add Code

Prioritized Subnet Sampling for Resource-Adaptive Supernet Training

1 code implementation • 12 Sep 2021 • Bohong Chen, Mingbao Lin, Rongrong Ji, Liujuan Cao

At the end of training, our PSS-Net retains the best subnet in each pool to entitle a fast switch of high-quality subnets for inference when the available resources vary.

Paper
Code

Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation

no code implementations • CVPR 2021 • Yunhang Shen, Liujuan Cao, Zhiwei Chen, Feihong Lian, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji

To date, learning weakly supervised panoptic segmentation (WSPS) with only image-level labels remains unexplored.

Instance Segmentation Multiple Instance Learning +6

Paper
Add Code

ISTR: End-to-End Instance Segmentation with Transformers

1 code implementation • 3 May 2021 • Jie Hu, Liujuan Cao, Yao Lu, Shengchuan Zhang, Yan Wang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji

However, such an upgrade is not applicable to instance segmentation, due to its significantly higher output dimensions compared to object detection.

Ranked #21 on Instance Segmentation on COCO test-dev

Instance Segmentation object-detection +3

200

Paper
Code

SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance

1 code implementation • CVPR 2021 • Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang

Thus, we propose a novel unsupervised FIQA method that incorporates Similarity Distribution Distance for Face Image Quality Assessment (SDD-FIQA).

Face Image Quality Face Image Quality Assessment +1

1,223

Paper
Code

Image-to-image Translation via Hierarchical Style Disentanglement

1 code implementation • CVPR 2021 • Xinyang Li, Shengchuan Zhang, Jie Hu, Liujuan Cao, Xiaopeng Hong, Xudong Mao, Feiyue Huang, Yongjian Wu, Rongrong Ji

Recently, image-to-image translation has made significant progress in achieving both multi-label (\ie, translation conditioned on different labels) and multi-style (\ie, generation with diverse styles) tasks.

Disentanglement Multimodal Unsupervised Image-To-Image Translation +1

383

Paper
Code

DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results

2 code implementations • 18 Feb 2021 • Liming Jiang, Zhengkui Guo, Wayne Wu, Zhaoyang Liu, Ziwei Liu, Chen Change Loy, Shuo Yang, Yuanjun Xiong, Wei Xia, Baoying Chen, Peiyu Zhuang, Sili Li, Shen Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Liujuan Cao, Rongrong Ji, Changlei Lu, Ganchao Tan

This paper reports methods and results in the DeeperForensics Challenge 2020 on real-world face forgery detection.

valid

524

Paper
Code

Dual-Level Collaborative Transformer for Image Captioning

1 code implementation • 16 Jan 2021 • Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

Descriptive region features extracted by object detection networks have played an important role in the recent advancements of image captioning.

Descriptive Image Captioning +2

193

Paper
Code

Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation

no code implementations • ICCV 2021 • Yunhang Shen, Liujuan Cao, Zhiwei Chen, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji

Weakly supervised instance segmentation (WSIS) with only image-level labels has recently drawn much attention.

Instance Segmentation object-detection +5

Paper
Add Code

EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS

no code implementations • ICCV 2021 • Qinqin Zhou, Xiawu Zheng, Liujuan Cao, Bineng Zhong, Teng Xi, Gang Zhang, Errui Ding, Mingliang Xu, Rongrong Ji

EC-DARTS decouples different operations based on their categories to optimize the operation weights so that the operation gap between them is shrinked.

Paper
Add Code

Architecture Disentanglement for Deep Neural Networks

1 code implementation • ICCV 2021 • Jie Hu, Liujuan Cao, Qixiang Ye, Tong Tong, Shengchuan Zhang, Ke Li, Feiyue Huang, Rongrong Ji, Ling Shao

Based on the experimental results, we present three new findings that provide fresh insights into the inner logic of DNNs.

AutoML Disentanglement

Paper
Code

Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

1 code implementation • CVPR 2020 • Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji

In addition, we address a key challenge in this multi-task setup, i. e., the prediction conflict, with two innovative designs namely, Consistency Energy Maximization (CEM) and Adaptive Soft Non-Located Suppression (ASNLS).

Ranked #4 on Generalized Referring Expression Comprehension on gRefCOCO

Generalized Referring Expression Comprehension Referring Expression +2

131

Paper
Code

Filter Sketch for Network Pruning

1 code implementation • 23 Jan 2020 • Mingbao Lin, Liujuan Cao, Shaojie Li, Qixiang Ye, Yonghong Tian, Jianzhuang Liu, Qi Tian, Rongrong Ji

Our approach, referred to as FilterSketch, encodes the second-order information of pre-trained weights, which enables the representation capacity of pruned networks to be recovered with a simple fine-tuning procedure.

Network Pruning

Paper
Code

Hadamard Codebook Based Deep Hashing

no code implementations • 21 Oct 2019 • Shen Chen, Liujuan Cao, Mingbao Lin, Yan Wang, Xiaoshuai Sun, Chenglin Wu, Jingfei Qiu, Rongrong Ji

Specifically, we utilize an off-the-shelf algorithm to generate a binary Hadamard codebook to satisfy the requirement of bit independence and bit balance, which subsequently serves as the desired outputs of the hash functions learning.

Deep Hashing Image Retrieval

Paper
Add Code

Semantic-aware Image Deblurring

no code implementations • 9 Oct 2019 • Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xiaoshuai Sun, Chia-Wen Lin, Jiayi Ji, Baochang Zhang, Feiyue Huang, Liujuan Cao

Specially, we propose a novel Structured-Spatial Semantic Embedding model for image deblurring (termed S3E-Deblur), which introduces a novel Structured-Spatial Semantic tree model (S3-tree) to bridge two basic tasks in computer vision: image deblurring (ImD) and image captioning (ImC).

Deblurring Image Captioning +1

Paper
Add Code

Supervised Online Hashing via Similarity Distribution Learning

no code implementations • 31 May 2019 • Mingbao Lin, Rongrong Ji, Shen Chen, Feng Zheng, Xiaoshuai Sun, Baochang Zhang, Liujuan Cao, Guodong Guo, Feiyue Huang

In this paper, we propose to model the similarity distributions between the input data and the hashing codes, upon which a novel supervised online hashing method, dubbed as Similarity Distribution based Online Hashing (SDOH), is proposed, to keep the intrinsic semantic relationship in the produced Hamming space.

Retrieval

Paper
Add Code

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning

1 code implementation • CVPR 2019 • Shaohui Lin, Rongrong Ji, Chenqian Yan, Baochang Zhang, Liujuan Cao, Qixiang Ye, Feiyue Huang, David Doermann

In this paper, we propose an effective structured pruning approach that jointly prunes filters as well as other structures in an end-to-end manner.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.