Search Results for author: Xingjian Li

Found 25 papers, 10 papers with code

Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

1 code implementation • 16 Apr 2023 • Wenke Xia, Xingjian Li, Andong Deng, Haoyi Xiong, Dejing Dou, Di Hu

However, such semantic consistency from the synchronization is hard to guarantee in unconstrained videos, due to the irrelevant modality noise and differentiated semantic correlation.

Action Recognition Audio Tagging +3

Paper
Code

Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources

1 code implementation • 14 Jul 2022 • Ji Liu, daxiang dong, Xi Wang, An Qin, Xingjian Li, Patrick Valduriez, Dejing Dou, dianhai yu

Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time.

Knowledge Distillation

169

Paper
Code

Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

no code implementations • 12 Jun 2022 • Hang Hua, Xingjian Li, Dejing Dou, Cheng-Zhong Xu, Jiebo Luo

The advent of large-scale pre-trained language models has contributed greatly to the recent progress in natural language processing.

Domain Generalization Language Modelling +3

Paper
Add Code

Deep Active Learning with Noise Stability

no code implementations • 26 May 2022 • Xingjian Li, Pengkun Yang, Yangcheng Gu, Xueying Zhan, Tianyang Wang, Min Xu, Chengzhong Xu

We provide theoretical analyses by leveraging the small Gaussian noise theory and demonstrate that our method favors a subset with large and diverse gradients.

Active Learning

Paper
Add Code

Towards Inadequately Pre-trained Models in Transfer Learning

no code implementations • ICCV 2023 • Andong Deng, Xingjian Li, Di Hu, Tianyang Wang, Haoyi Xiong, Chengzhong Xu

Based on the contradictory phenomenon between FE and FT that better feature extractor fails to be fine-tuned better accordingly, we conduct comprehensive analyses on features before softmax layer to provide insightful explanations.

Transfer Learning

Paper
Add Code

Boosting Active Learning via Improving Test Performance

1 code implementation • 10 Dec 2021 • Tianyang Wang, Xingjian Li, Pengkun Yang, Guosheng Hu, Xiangrui Zeng, Siyu Huang, Cheng-Zhong Xu, Min Xu

In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in better test performance.

Active Learning Electron Tomography +2

125

Paper
Code

Noise Stability Regularization for Improving BERT Fine-tuning

no code implementations • NAACL 2021 • Hang Hua, Xingjian Li, Dejing Dou, Cheng-Zhong Xu, Jiebo Luo

The brittleness of this process is often reflected by the sensitivity to random seeds.

Paper
Add Code

SMILE: Self-Distilled MIxup for Efficient Transfer LEarning

no code implementations • 25 Mar 2021 • Xingjian Li, Haoyi Xiong, Chengzhong Xu, Dejing Dou

Performing mixup for transfer learning with pre-trained models however is not that simple, a high capacity pre-trained model with a large fully-connected (FC) layer could easily overfit to the target dataset even with samples-to-labels mixed up.

Transfer Learning

Paper
Add Code

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

1 code implementation • 19 Mar 2021 • Xuhong LI, Haoyi Xiong, Xingjian Li, Xuanyu Wu, Xiao Zhang, Ji Liu, Jiang Bian, Dejing Dou

Then, to understand the interpretation results, we also survey the performance metrics for evaluating interpretation algorithms.

Adversarial Robustness

228

Paper
Code

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

1 code implementation • CVPR 2021 • Abulikemu Abuduweili, Xingjian Li, Humphrey Shi, Cheng-Zhong Xu, Dejing Dou

To better exploit the value of both pre-trained weights and unlabeled target examples, we introduce adaptive consistency regularization that consists of two complementary components: Adaptive Knowledge Consistency (AKC) on the examples between the source and target model, and Adaptive Representation Consistency (ARC) on the target model between labeled and unlabeled examples.

Pseudo Label Transfer Learning

100

Paper
Code

Can We Use Gradient Norm as a Measure of Generalization Error for Model Selection in Practice?

no code implementations • 1 Jan 2021 • Haozhe An, Haoyi Xiong, Xuhong LI, Xingjian Li, Dejing Dou, Zhanxing Zhu

The recent theoretical investigation (Li et al., 2020) on the upper bound of generalization error of deep neural networks (DNNs) demonstrates the potential of using the gradient norm as a measure that complements validation accuracy for model selection in practice.

Model Selection

Paper
Add Code

Model information as an analysis tool in deep learning

no code implementations • 1 Jan 2021 • Xiao Zhang, Di Hu, Xingjian Li, Dejing Dou, Ji Wu

We demonstrate using model information as a general analysis tool to gain insight into problems that arise in deep learning.

Paper
Add Code

Temporal Relational Modeling with Self-Supervision for Action Segmentation

1 code implementation • 14 Dec 2020 • Dong Wang, Di Hu, Xingjian Li, Dejing Dou

The main reason is that large number of nodes (i. e., video frames) makes GCNs hard to capture and model temporal relations in videos.

Ranked #23 on Action Segmentation on Breakfast

Action Recognition Action Segmentation +1

Paper
Code

A Neural Network Approach Applied to Multi-Agent Optimal Control

1 code implementation • 9 Nov 2020 • Derek Onken, Levon Nurbekyan, Xingjian Li, Samy Wu Fung, Stanley Osher, Lars Ruthotto

Our approach is grid-free and scales efficiently to dimensions where grids become impractical or infeasible.

Optimization and Control

Paper
Code

Towards Accurate Knowledge Transfer via Target-awareness Representation Disentanglement

no code implementations • 16 Oct 2020 • Xingjian Li, Di Hu, Xuhong LI, Haoyi Xiong, Zhi Ye, Zhipeng Wang, Chengzhong Xu, Dejing Dou

Fine-tuning deep neural networks pre-trained on large scale datasets is one of the most practical transfer learning paradigm given limited quantity of training samples.

Disentanglement Transfer Learning

Paper
Add Code

Measuring Information Transfer in Neural Networks

no code implementations • 16 Sep 2020 • Xiao Zhang, Xingjian Li, Dejing Dou, Ji Wu

We propose a practical measure of the generalizable information in a neural network model based on prequential coding, which we term Information Transfer ($L_{IT}$).

Continual Learning Transfer Learning

Paper
Add Code

XMixup: Efficient Transfer Learning with Auxiliary Samples by Cross-domain Mixup

no code implementations • 20 Jul 2020 • Xingjian Li, Haoyi Xiong, Haozhe An, Cheng-Zhong Xu, Dejing Dou

While the existing multitask learning algorithms need to run backpropagation over both the source and target datasets and usually consume a higher gradient complexity, XMixup transfers the knowledge from source to target tasks more efficiently: for every class of the target task, XMixup selects the auxiliary samples from the source dataset and augments training samples via the simple mixup strategy.

Transfer Learning

Paper
Add Code

RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr

1 code implementation • ICML 2020 • Xingjian Li, Haoyi Xiong, Haozhe An, Cheng-Zhong Xu, Dejing Dou

RIFLE brings meaningful updates to the weights of deep CNN layers and improves low-level feature learning, while the effects of randomization can be easily converged throughout the overall learning procedure.

Transfer Learning

Paper
Code

OT-Flow: Fast and Accurate Continuous Normalizing Flows via Optimal Transport

3 code implementations • 29 May 2020 • Derek Onken, Samy Wu Fung, Xingjian Li, Lars Ruthotto

On five high-dimensional density estimation and generative modeling tasks, OT-Flow performs competitively to state-of-the-art CNFs while on average requiring one-fourth of the number of weights with an 8x speedup in training time and 24x speedup in inference.

Density Estimation

Paper
Code

Pay Attention to Features, Transfer Learn Faster CNNs

no code implementations • ICLR 2020 • Kafeng Wang, Xitong Gao, Yiren Zhao, Xingjian Li, Dejing Dou, Cheng-Zhong Xu

Deep convolutional neural networks are now widely deployed in vision applications, but a limited size of training data can restrict their task performance.

Transfer Learning

Paper
Add Code

COLAM: Co-Learning of Deep Neural Networks and Soft Labels via Alternating Minimization

no code implementations • 26 Apr 2020 • Xingjian Li, Haoyi Xiong, Haozhe An, Dejing Dou, Chengzhong Xu

Softening labels of training datasets with respect to data representations has been frequently used to improve the training of deep neural networks (DNNs).

General Classification

Paper
Add Code

Towards Making Deep Transfer Learning Never Hurt

no code implementations • 18 Nov 2019 • Ruosi Wan, Haoyi Xiong, Xingjian Li, Zhanxing Zhu, Jun Huan

The empirical results show that the proposed descent direction estimation strategy DTNH can always improve the performance of deep transfer learning tasks based on all above regularizers, even when transferring pre-trained weights from inappropriate networks.

Knowledge Distillation Transfer Learning

Paper
Add Code

Improving Adversarial Robustness via Attention and Adversarial Logit Pairing

no code implementations • 23 Aug 2019 • Dou Goodman, Xingjian Li, Ji Liu, Dejing Dou, Tao Wei

Finally, we conduct extensive experiments using a wide range of datasets and the experiment results show that our AT+ALP achieves the state of the art defense performance.

Adversarial Robustness

Paper
Add Code

An Empirical Study on Regularization of Deep Neural Networks by Local Rademacher Complexity

no code implementations • 3 Feb 2019 • Yingzhen Yang, Jiahui Yu, Xingjian Li, Jun Huan, Thomas S. Huang

In this paper, we investigate the role of Rademacher complexity in improving generalization of DNNs and propose a novel regularizer rooted in Local Rademacher Complexity (LRC).

Neural Architecture Search

Paper
Add Code

DELTA: DEep Learning Transfer using Feature Map with Attention for Convolutional Networks

2 code implementations • ICLR 2019 • Xingjian Li, Haoyi Xiong, Hanchao Wang, Yuxuan Rao, Li-Ping Liu, Zeyu Chen, Jun Huan

Instead of constraining the weights of neural network, DELTA aims to preserve the outer layer outputs of the target network.

Transfer Learning

3,138

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.