Search Results for author: Di Xie

Found 53 papers, 18 papers with code

Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

1 code implementation • 8 Mar 2024 • Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, ShiLiang Pu

Recently, arbitrary-scale point cloud upsampling mechanism became increasingly popular due to its efficiency and convenience for practical applications.

point cloud upsampling

Paper
Code

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model

no code implementations • 8 Mar 2024 • Pengwei Yin, Guanzhong Zeng, Jingjing Wang, Di Xie

To overcome these limitations, we propose a novel framework called CLIP-Gaze that utilizes a pre-trained vision-language model to leverage its transferable knowledge.

Domain Generalization Gaze Estimation +1

Paper
Add Code

Learning Expressive And Generalizable Motion Features For Face Forgery Detection

no code implementations • 8 Mar 2024 • Jingyi Zhang, Peng Zhang, Jingjing Wang, Di Xie, ShiLiang Pu

However, current sequence-based face forgery detection methods use general video classification networks directly, which discard the special and discriminative motion information for face manipulation detection.

Anomaly Detection Classification +1

Paper
Add Code

"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

1 code implementation • 1 Mar 2024 • Lingyu Gu, Yongqi Du, Yuan Zhang, Di Xie, ShiLiang Pu, Robert C. Qiu, Zhenyu Liao

Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging.

Model Compression Quantization

Paper
Code

Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models

no code implementations • 25 Oct 2023 • WeiJie Chen, Haoyu Wang, Shicai Yang, Lei Zhang, Wei Wei, Yanning Zhang, Luojun Lin, Di Xie, Yueting Zhuang

Such a one-for-all adaptation paradigm allows us to adapt anything in the world using only one text-to-image generator as well as the corresponding unlabeled target data.

Domain Adaptation Image Classification

Paper
Add Code

Single Domain Dynamic Generalization for Iris Presentation Attack Detection

no code implementations • 22 May 2023 • Yachun Li, Jingjing Wang, Yuhui Chen, Di Xie, ShiLiang Pu

To tackle the above issues, we propose a Single Domain Dynamic Generalization (SDDG) framework, which simultaneously exploits domain-invariant and domain-specific features on a per-sample basis and learns to generalize to various unseen domains with numerous natural images.

Domain Generalization Meta-Learning

Paper
Add Code

Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation

1 code implementation • CVPR 2023 • Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, ShiLiang Pu

Most existing approaches for point cloud normal estimation aim to locally fit a geometric surface and calculate the normal from the fitted surface.

Paper
Code

1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection Track

no code implementations • 12 Jan 2023 • Wei Zhao, Binbin Chen, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang

The domain adaptation part is implemented as a Source-Free Domain Adaptation paradigm, which only uses the pre-trained model and the unlabeled target data to further optimize in a self-supervised training manner.

Domain Generalization object-detection +3

Paper
Add Code

1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track

no code implementations • 12 Jan 2023 • Yilu Guo, Xingyue Shi, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang

In the test-time training stage, we use the pre-trained model to assign noisy label for the unlabeled target data, and propose a Label-Periodically-Updated DivideMix method for noisy label learning.

Data Augmentation Domain Generalization +2

Paper
Add Code

Unsupervised Prompt Tuning for Text-Driven Object Detection

no code implementations • ICCV 2023 • Weizhen He, WeiJie Chen, Binbin Chen, Shicai Yang, Di Xie, Luojun Lin, Donglian Qi, Yueting Zhuang

In this paper, we delve into this problem and propose an Unsupervised Prompt Tuning framework for text-driven object detection, which is composed of two novel mean teaching mechanisms.

Data Augmentation Object +4

Paper
Add Code

NeRF-Gaze: A Head-Eye Redirection Parametric Model for Gaze Estimation

no code implementations • 30 Dec 2022 • Pengwei Yin, Jiawu Dai, Jingjing Wang, Di Xie, ShiLiang Pu

Gaze estimation is the fundamental basis for many visual tasks.

Domain Generalization Gaze Estimation +1

Paper
Add Code

Attention Diversification for Domain Generalization

1 code implementation • 9 Oct 2022 • Rang Meng, Xianfeng Li, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, ShiLiang Pu

Under this guidance, a novel Attention Diversification framework is proposed, in which Intra-Model and Inter-Model Attention Diversification Regularization are collaborated to reassign appropriate attention to diverse task-related features.

Domain Generalization

Paper
Code

Multi-Scale Wavelet Transformer for Face Forgery Detection

no code implementations • 8 Oct 2022 • Jie Liu, Jingjing Wang, Peng Zhang, Chunmao Wang, Di Xie, ShiLiang Pu

To overcome these limitations, we propose a multi-scale wavelet transformer framework for face forgery detection.

Paper
Add Code

Point Cloud Upsampling via Cascaded Refinement Network

1 code implementation • 8 Oct 2022 • Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, ShiLiang Pu

In this manner, the proposed cascaded refinement network can be easily optimized without extra learning strategies.

point cloud upsampling

Paper
Code

FBNet: Feedback Network for Point Cloud Completion

1 code implementation • 8 Oct 2022 • Xuejun Yan, Hongyu Yan, Jingjing Wang, Hang Du, Zhihong Wu, Di Xie, ShiLiang Pu, Li Lu

The rapid development of point cloud learning has driven point cloud completion into a new era.

Point Cloud Completion

Paper
Code

Semi-supervised Ranking for Object Image Blur Assessment

1 code implementation • 13 Jul 2022 • Qiang Li, Zhaoliang Yao, Jingjing Wang, Ye Tian, Pengju Yang, Di Xie, ShiLiang Pu

Based on this dataset, we propose a method to obtain the blur scores only with the pairwise rank labels as supervision.

Object Object Recognition +1

Paper
Code

Label Matching Semi-Supervised Object Detection

3 code implementations • CVPR 2022 • Binbin Chen, WeiJie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, ShiLiang Pu, Mingli Song, Yueting Zhuang

To remedy this issue, we present a novel label assignment mechanism for self-training framework, namely proposal self-assignment, which injects the proposals from student into teacher and generates accurate pseudo labels to match each proposal in the student model accordingly.

Object object-detection +2

137

Paper
Code

Slimmable Domain Adaptation

1 code implementation • CVPR 2022 • Rang Meng, WeiJie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, ShiLiang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang

In this paper, we introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank, from which models of different capacities can be sampled to accommodate different accuracy-efficiency trade-offs.

Domain Generalization Unsupervised Domain Adaptation

Paper
Code

2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach

no code implementations • 13 Jun 2022 • Yilu Guo, Shicai Yang, WeiJie Chen, Liang Ma, Di Xie, ShiLiang Pu

Therefore, it is crucial to study how to learn more discriminative representations while avoiding over-fitting.

Data Augmentation Image Classification

Paper
Add Code

Learning Domain Adaptive Object Detection with Probabilistic Teacher

2 code implementations • 13 Jun 2022 • Meilin Chen, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Yunfeng Yan, Donglian Qi, Yueting Zhuang, Di Xie, ShiLiang Pu

In addition, we conduct anchor adaptation in parallel with localization adaptation, since anchor can be regarded as a learnable parameter.

Object object-detection +1

Paper
Code

Transductive CLIP with Class-Conditional Contrastive Learning

no code implementations • 13 Jun 2022 • Junchu Huang, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang

This framework can reduce the impact of noisy labels from CLIP model effectively by combining both techniques.

Contrastive Learning Pseudo Label +1

Paper
Add Code

Self-distilled Knowledge Delegator for Exemplar-free Class Incremental Learning

no code implementations • 23 May 2022 • Fanfan Ye, Liang Ma, Qiaoyong Zhong, Di Xie, ShiLiang Pu

The knowledge extracted by the delegator is then utilized to maintain the performance of the model on old tasks in incremental learning.

Class Incremental Learning Incremental Learning

Paper
Add Code

KRNet: Towards Efficient Knowledge Replay

no code implementations • 23 May 2022 • Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu

However, the number of stored latent codes in autoencoder increases linearly with the scale of data and the trained encoder is redundant for the replaying stage.

Continual Learning Domain Adaptation

Paper
Add Code

Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks

1 code implementation • 31 Mar 2022 • Da-Wei Zhou, Han-Jia Ye, Liang Ma, Di Xie, ShiLiang Pu, De-Chuan Zhan

In this work, we propose a new paradigm for FSCIL based on meta-learning by LearnIng Multi-phase Incremental Tasks (LIMIT), which synthesizes fake FSCIL tasks from the base dataset.

Ranked #5 on Few-Shot Class-Incremental Learning on CIFAR-100

Few-Shot Class-Incremental Learning Incremental Learning +1

Paper
Code

Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition

1 code implementation • 8 Dec 2021 • Kailin Xu, Fanfan Ye, Qiaoyong Zhong, Di Xie

In particular, we develop a novel cross-channel feature augmentation module, which is a combo of map-attend-group-map operations.

Action Recognition Skeleton Based Action Recognition

155

Paper
Code

Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning

no code implementations • 6 Sep 2021 • Ning Wei, Jiahua Liang, Di Xie, ShiLiang Pu

Designing optimal reward functions has been desired but extremely difficult in reinforcement learning (RL).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

TransForensics: Image Forgery Localization with Dense Self-Attention

no code implementations • ICCV 2021 • Jing Hao, Zhixin Zhang, Shicai Yang, Di Xie, ShiLiang Pu

Nowadays advanced image editing tools and technical skills produce tampered images more realistically, which can easily evade image forensic systems and make authenticity verification of images more difficult.

Paper
Add Code

Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly Detection

no code implementations • ICCV 2021 • Jinlei Hou, Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu, Hong Zhou

Surprisingly, by varying the granularity of division on feature maps, we are able to modulate the reconstruction capability of the model for both normal and abnormal samples.

Unsupervised Anomaly Detection

Paper
Add Code

Modulating Localization and Classification for Harmonized Object Detection

no code implementations • 16 Mar 2021 • Taiheng Zhang, Qiaoyong Zhong, ShiLiang Pu, Di Xie

Object detection involves two sub-tasks, i. e. localizing objects in an image and classifying them into various categories.

Classification General Classification +3

Paper
Add Code

Self-Supervised Noisy Label Learning for Source-Free Unsupervised Domain Adaptation

no code implementations • 23 Feb 2021 • WeiJie Chen, Luojun Lin, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang, Wenqi Ren

Usually, the given source domain pre-trained model is expected to optimize with only unlabeled target data, which is termed as source-free unsupervised domain adaptation.

Self-Supervised Learning Unsupervised Domain Adaptation

Paper
Add Code

Box Re-Ranking: Unsupervised False Positive Suppression for Domain Adaptive Pedestrian Detection

no code implementations • 1 Feb 2021 • WeiJie Chen, Yilu Guo, Shicai Yang, Zhaoyang Li, Zhenxin Ma, Binbin Chen, Long Zhao, Di Xie, ShiLiang Pu, Yueting Zhuang

Therefore, it yields our attention to suppress false positive in each target domain in an unsupervised way.

object-detection Object Detection +2

Paper
Add Code

A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data

no code implementations • 10 Dec 2020 • Xianfeng Li, WeiJie Chen, Di Xie, Shicai Yang, Peng Yuan, ShiLiang Pu, Yueting Zhuang

However, it is difficult to evaluate the quality of pseudo labels since no labels are available in target domain.

Data Augmentation Learning with noisy labels +4

Paper
Add Code

MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination

no code implementations • 2 Oct 2020 • Shengyu Zhang, Donghui Wang, Zhou Zhao, Siliang Tang, Di Xie, Fei Wu

In this paper, we investigate the problem of text-to-pedestrian synthesis, which has many potential applications in art, design, and video surveillance.

Generative Adversarial Network Image Generation

Paper
Add Code

Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

1 code implementation • 29 Jul 2020 • Fanfan Ye, ShiLiang Pu, Qiaoyong Zhong, Chao Li, Di Xie, Huiming Tang

The key lies in the design of the graph structure, which encodes skeleton topology information.

Ranked #10 on Skeleton Based Action Recognition on Kinetics-Skeleton dataset

Action Recognition Skeleton Based Action Recognition

155

Paper
Code

Unsupervised Image Classification for Deep Representation Learning

1 code implementation • 20 Jun 2020 • Wei-Jie Chen, ShiLiang Pu, Di Xie, Shicai Yang, Yilu Guo, Luojun Lin

Extensive experiments on ImageNet dataset have been conducted to prove the effectiveness of our method.

Classification Clustering +13

Paper
Code

IROS 2019 Lifelong Robotic Vision Challenge -- Lifelong Object Recognition Report

no code implementations • 26 Apr 2020 • Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, ShiLiang Pu, Debdoot Sheet, Soonyong Song, Youngsung Son, Zhengwei Wang, Tomas E. Ward, Jianwen Wu, Meiqing Wu, Di Xie, Yangsheng Xu, Lin Yang, Qiaoyong Zhong, Liguang Zhou

This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams).

Continual Learning Object +1

Paper
Add Code

Neural Inheritance Relation Guided One-Shot Layer Assignment Search

no code implementations • 28 Feb 2020 • Rang Meng, Wei-Jie Chen, Di Xie, Yuan Zhang, ShiLiang Pu

In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100.

Neural Architecture Search Relation

Paper
Add Code

An End-to-End Audio Classification System based on Raw Waveforms and Mix-Training Strategy

no code implementations • 21 Nov 2019 • Jiaxu Chen, Jing Hao, Kai Chen, Di Xie, Shicai Yang, ShiLiang Pu

This paper introduces an end-to-end audio classification system based on raw waveforms and mix-training strategy.

Audio Classification General Classification +1

Paper
Add Code

Fast Task Adaptation for Few-Shot Learning

no code implementations • 25 Sep 2019 • Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu

The key lies in generalization of prior knowledge learned from large-scale base classes and fast adaptation of the classifier to novel classes.

Few-Shot Learning

Paper
Add Code

All You Need is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification

3 code implementations • CVPR 2019 • Wei-Jie Chen, Di Xie, Yuan Zhang, ShiLiang Pu

In this family of architectures, the basic block is only composed by 1x1 convolutional layers with only a few shift operations applied to the intermediate feature maps.

General Classification Image Classification +1

Paper
Code

Collaborative Spatio-temporal Feature Learning for Video Action Recognition

1 code implementation • 4 Mar 2019 • Chao Li, Qiaoyong Zhong, Di Xie, ShiLiang Pu

By sharing the convolution kernels of different views, spatial and temporal features are collaboratively learned and thus benefit from each other.

Action Recognition In Videos Temporal Action Localization +1

Paper
Code

Learning Incremental Triplet Margin for Person Re-identification

no code implementations • 17 Dec 2018 • Yingying Zhang, Qiaoyong Zhong, Liang Ma, Di Xie, ShiLiang Pu

In particular, we propose a novel multi-stage training strategy which learns incremental triplet margin and improves triplet loss effectively.

Metric Learning Person Re-Identification

Paper
Add Code

A Layer Decomposition-Recomposition Framework for Neuron Pruning towards Accurate Lightweight Networks

no code implementations • 17 Dec 2018 • Wei-Jie Chen, Yuan Zhang, Di Xie, ShiLiang Pu

A better alternative is to propagate the entire useful information to reconstruct the pruned layer instead of directly discarding the less important neurons.

Paper
Add Code

Small-scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation

no code implementations • ECCV 2018 • Tao Song, Leiyu Sun, Di Xie, Haiming Sun, ShiLiang Pu

A critical issue in pedestrian detection is to detect small-scale objects that will introduce feeble contrast and motion blur in images and videos, which in our opinion should partially resort to deep-rooted annotation bias.

Pedestrian Detection

Paper
Add Code

Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions

no code implementations • 27 Aug 2018 • Ke Ning, Linchao Zhu, Ming Cai, Yi Yang, Di Xie, Fei Wu

We validate the effectiveness of our ASST on two large-scale datasets.

Translation Video Description

Paper
Add Code

Extreme Network Compression via Filter Group Approximation

no code implementations • ECCV 2018 • Bo Peng, Wenming Tan, Zheyang Li, Shun Zhang, Di Xie, ShiLiang Pu

In this paper we propose a novel decomposition method based on filter group approximation, which can significantly reduce the redundancy of deep convolutional neural networks (CNNs) while maintaining the majority of feature representation.

General Classification Image Classification