no code implementations • 22 May 2023 • Yachun Li, Jingjing Wang, Yuhui Chen, Di Xie, ShiLiang Pu
To tackle the above issues, we propose a Single Domain Dynamic Generalization (SDDG) framework, which simultaneously exploits domain-invariant and domain-specific features on a per-sample basis and learns to generalize to various unseen domains with numerous natural images.
1 code implementation • CVPR 2023 • Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, ShiLiang Pu
Most existing approaches for point cloud normal estimation aim to locally fit a geometric surface and calculate the normal from the fitted surface.
no code implementations • 12 Jan 2023 • Yilu Guo, Xingyue Shi, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang
In the test-time training stage, we use the pre-trained model to assign noisy label for the unlabeled target data, and propose a Label-Periodically-Updated DivideMix method for noisy label learning.
no code implementations • 12 Jan 2023 • Wei Zhao, Binbin Chen, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang
The domain adaptation part is implemented as a Source-Free Domain Adaptation paradigm, which only uses the pre-trained model and the unlabeled target data to further optimize in a self-supervised training manner.
no code implementations • 30 Dec 2022 • Pengwei Yin, Jiawu Dai, Jingjing Wang, Di Xie, ShiLiang Pu
Gaze estimation is the fundamental basis for many visual tasks.
1 code implementation • 9 Oct 2022 • Rang Meng, Xianfeng Li, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, ShiLiang Pu
Under this guidance, a novel Attention Diversification framework is proposed, in which Intra-Model and Inter-Model Attention Diversification Regularization are collaborated to reassign appropriate attention to diverse task-related features.
no code implementations • 8 Oct 2022 • Jie Liu, Jingjing Wang, Peng Zhang, Chunmao Wang, Di Xie, ShiLiang Pu
To overcome these limitations, we propose a multi-scale wavelet transformer framework for face forgery detection.
1 code implementation • 8 Oct 2022 • Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, ShiLiang Pu
In this manner, the proposed cascaded refinement network can be easily optimized without extra learning strategies.
1 code implementation • 8 Oct 2022 • Xuejun Yan, Hongyu Yan, Jingjing Wang, Hang Du, Zhihong Wu, Di Xie, ShiLiang Pu, Li Lu
The rapid development of point cloud learning has driven point cloud completion into a new era.
1 code implementation • 13 Jul 2022 • Qiang Li, Zhaoliang Yao, Jingjing Wang, Ye Tian, Pengju Yang, Di Xie, ShiLiang Pu
Based on this dataset, we propose a method to obtain the blur scores only with the pairwise rank labels as supervision.
1 code implementation • CVPR 2022 • Rang Meng, WeiJie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, ShiLiang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang
In this paper, we introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank, from which models of different capacities can be sampled to accommodate different accuracy-efficiency trade-offs.
3 code implementations • CVPR 2022 • Binbin Chen, WeiJie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, ShiLiang Pu, Mingli Song, Yueting Zhuang
To remedy this issue, we present a novel label assignment mechanism for self-training framework, namely proposal self-assignment, which injects the proposals from student into teacher and generates accurate pseudo labels to match each proposal in the student model accordingly.
no code implementations • 13 Jun 2022 • Yilu Guo, Shicai Yang, WeiJie Chen, Liang Ma, Di Xie, ShiLiang Pu
Therefore, it is crucial to study how to learn more discriminative representations while avoiding over-fitting.
no code implementations • 13 Jun 2022 • Junchu Huang, WeiJie Chen, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang
This framework can reduce the impact of noisy labels from CLIP model effectively by combining both techniques.
1 code implementation • 13 Jun 2022 • Meilin Chen, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Yunfeng Yan, Donglian Qi, Yueting Zhuang, Di Xie, ShiLiang Pu
In addition, we conduct anchor adaptation in parallel with localization adaptation, since anchor can be regarded as a learnable parameter.
no code implementations • 23 May 2022 • Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu
However, the number of stored latent codes in autoencoder increases linearly with the scale of data and the trained encoder is redundant for the replaying stage.
no code implementations • 23 May 2022 • Fanfan Ye, Liang Ma, Qiaoyong Zhong, Di Xie, ShiLiang Pu
The knowledge extracted by the delegator is then utilized to maintain the performance of the model on old tasks in incremental learning.
1 code implementation • 31 Mar 2022 • Da-Wei Zhou, Han-Jia Ye, Liang Ma, Di Xie, ShiLiang Pu, De-Chuan Zhan
In this work, we propose a new paradigm for FSCIL based on meta-learning by LearnIng Multi-phase Incremental Tasks (LIMIT), which synthesizes fake FSCIL tasks from the base dataset.
Ranked #4 on
Few-Shot Class-Incremental Learning
on CIFAR-100
class-incremental learning
Few-Shot Class-Incremental Learning
+2
1 code implementation • 8 Dec 2021 • Kailin Xu, Fanfan Ye, Qiaoyong Zhong, Di Xie
In particular, we develop a novel cross-channel feature augmentation module, which is a combo of map-attend-group-map operations.
no code implementations • 6 Sep 2021 • Ning Wei, Jiahua Liang, Di Xie, ShiLiang Pu
Designing optimal reward functions has been desired but extremely difficult in reinforcement learning (RL).
no code implementations • ICCV 2021 • Jing Hao, Zhixin Zhang, Shicai Yang, Di Xie, ShiLiang Pu
Nowadays advanced image editing tools and technical skills produce tampered images more realistically, which can easily evade image forensic systems and make authenticity verification of images more difficult.
no code implementations • ICCV 2021 • Jinlei Hou, Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu, Hong Zhou
Surprisingly, by varying the granularity of division on feature maps, we are able to modulate the reconstruction capability of the model for both normal and abnormal samples.
no code implementations • 16 Mar 2021 • Taiheng Zhang, Qiaoyong Zhong, ShiLiang Pu, Di Xie
Object detection involves two sub-tasks, i. e. localizing objects in an image and classifying them into various categories.
no code implementations • 23 Feb 2021 • WeiJie Chen, Luojun Lin, Shicai Yang, Di Xie, ShiLiang Pu, Yueting Zhuang, Wenqi Ren
Usually, the given source domain pre-trained model is expected to optimize with only unlabeled target data, which is termed as source-free unsupervised domain adaptation.
no code implementations • 1 Feb 2021 • WeiJie Chen, Yilu Guo, Shicai Yang, Zhaoyang Li, Zhenxin Ma, Binbin Chen, Long Zhao, Di Xie, ShiLiang Pu, Yueting Zhuang
Therefore, it yields our attention to suppress false positive in each target domain in an unsupervised way.
no code implementations • 10 Dec 2020 • Xianfeng Li, WeiJie Chen, Di Xie, Shicai Yang, Peng Yuan, ShiLiang Pu, Yueting Zhuang
However, it is difficult to evaluate the quality of pseudo labels since no labels are available in target domain.
no code implementations • 2 Oct 2020 • Shengyu Zhang, Donghui Wang, Zhou Zhao, Siliang Tang, Di Xie, Fei Wu
In this paper, we investigate the problem of text-to-pedestrian synthesis, which has many potential applications in art, design, and video surveillance.
1 code implementation • 29 Jul 2020 • Fanfan Ye, ShiLiang Pu, Qiaoyong Zhong, Chao Li, Di Xie, Huiming Tang
The key lies in the design of the graph structure, which encodes skeleton topology information.
1 code implementation • 20 Jun 2020 • Wei-Jie Chen, ShiLiang Pu, Di Xie, Shicai Yang, Yilu Guo, Luojun Lin
Extensive experiments on ImageNet dataset have been conducted to prove the effectiveness of our method.
no code implementations • 26 Apr 2020 • Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, ShiLiang Pu, Debdoot Sheet, Soonyong Song, Youngsung Son, Zhengwei Wang, Tomas E. Ward, Jianwen Wu, Meiqing Wu, Di Xie, Yangsheng Xu, Lin Yang, Qiaoyong Zhong, Liguang Zhou
This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams).
no code implementations • 28 Feb 2020 • Rang Meng, Wei-Jie Chen, Di Xie, Yuan Zhang, ShiLiang Pu
In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100.
no code implementations • 21 Nov 2019 • Jiaxu Chen, Jing Hao, Kai Chen, Di Xie, Shicai Yang, ShiLiang Pu
This paper introduces an end-to-end audio classification system based on raw waveforms and mix-training strategy.
no code implementations • 25 Sep 2019 • Yingying Zhang, Qiaoyong Zhong, Di Xie, ShiLiang Pu
The key lies in generalization of prior knowledge learned from large-scale base classes and fast adaptation of the classifier to novel classes.
3 code implementations • CVPR 2019 • Wei-Jie Chen, Di Xie, Yuan Zhang, ShiLiang Pu
In this family of architectures, the basic block is only composed by 1x1 convolutional layers with only a few shift operations applied to the intermediate feature maps.
1 code implementation • 4 Mar 2019 • Chao Li, Qiaoyong Zhong, Di Xie, ShiLiang Pu
By sharing the convolution kernels of different views, spatial and temporal features are collaboratively learned and thus benefit from each other.
Action Recognition In Videos
Temporal Action Localization
+1
no code implementations • 17 Dec 2018 • Wei-Jie Chen, Yuan Zhang, Di Xie, ShiLiang Pu
A better alternative is to propagate the entire useful information to reconstruct the pruned layer instead of directly discarding the less important neurons.
no code implementations • 17 Dec 2018 • Yingying Zhang, Qiaoyong Zhong, Liang Ma, Di Xie, ShiLiang Pu
In particular, we propose a novel multi-stage training strategy which learns incremental triplet margin and improves triplet loss effectively.
no code implementations • ECCV 2018 • Tao Song, Leiyu Sun, Di Xie, Haiming Sun, ShiLiang Pu
A critical issue in pedestrian detection is to detect small-scale objects that will introduce feeble contrast and motion blur in images and videos, which in our opinion should partially resort to deep-rooted annotation bias.
no code implementations • 27 Aug 2018 • Ke Ning, Linchao Zhu, Ming Cai, Yi Yang, Di Xie, Fei Wu
We validate the effectiveness of our ASST on two large-scale datasets.
no code implementations • ECCV 2018 • Bo Peng, Wenming Tan, Zheyang Li, Shun Zhang, Di Xie, ShiLiang Pu
In this paper we propose a novel decomposition method based on filter group approximation, which can significantly reduce the redundancy of deep convolutional neural networks (CNNs) while maintaining the majority of feature representation.
no code implementations • 4 Jul 2018 • Tao Song, Leiyu Sun, Di Xie, Haiming Sun, ShiLiang Pu
A critical issue in pedestrian detection is to detect small-scale objects that will introduce feeble contrast and motion blur in images and videos, which in our opinion should partially resort to deep-rooted annotation bias.
Ranked #17 on
Pedestrian Detection
on CityPersons
no code implementations • 16 May 2018 • Xiaodan Song, Jiabao Yao, Lulu Zhou, Li Wang, Xiaoyang Wu, Di Xie, ShiLiang Pu
It aims to design a single CNN model with low redundancy to adapt to decoded frames with different qualities and ensure consistency.
Multimedia
6 code implementations • 17 Apr 2018 • Chao Li, Qiaoyong Zhong, Di Xie, ShiLiang Pu
Skeleton-based human action recognition has recently drawn increasing attentions with the availability of large-scale skeleton datasets.
Ranked #2 on
Skeleton Based Action Recognition
on PKU-MMD
no code implementations • 30 Oct 2017 • Qiaoyong Zhong, Chao Li, Yingying Zhang, Di Xie, Shicai Yang, ShiLiang Pu
Deep region-based object detector consists of a region proposal step and a deep object recognition step.
1 code implementation • 25 Apr 2017 • Chao Li, Qiaoyong Zhong, Di Xie, ShiLiang Pu
Current state-of-the-art approaches to skeleton-based action recognition are mostly based on recurrent neural networks (RNN).
Ranked #3 on
Skeleton Based Action Recognition
on PKU-MMD
no code implementations • CVPR 2017 • Di Xie, Jiang Xiong, ShiLiang Pu
Moreover, we can successfully train plain CNNs to match the performance of the residual counterparts.
no code implementations • 19 Oct 2016 • Haiming Sun, Di Xie, ShiLiang Pu
Semantic segmentation is challenging as it requires both object-level information and pixel-level accuracy.