Search Results for author: Yulin Wang

Found 26 papers, 19 papers with code

Implicit Semantic Data Augmentation for Deep Networks

1 code implementation • NeurIPS 2019 • Yulin Wang, Xuran Pan, Shiji Song, Hong Zhang, Cheng Wu, Gao Huang

Our work is motivated by the intriguing property that deep networks are surprisingly good at linearizing features, such that certain directions in the deep feature space correspond to meaningful semantic transformations, e. g., adding sunglasses or changing backgrounds.

Image Augmentation

575

Paper
Code

Regularizing Deep Networks with Semantic Data Augmentation

1 code implementation • 21 Jul 2020 • Yulin Wang, Gao Huang, Shiji Song, Xuran Pan, Yitong Xia, Cheng Wu

The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i. e., certain directions in the deep feature space correspond to meaningful semantic transformations, e. g., changing the background or view angle of an object.

Data Augmentation

575

Paper
Code

Deep Incubation: Training Large Models by Divide-and-Conquering

3 code implementations • ICCV 2023 • Zanlin Ni, Yulin Wang, Jiangwei Yu, Haojun Jiang, Yue Cao, Gao Huang

In this paper, we present Deep Incubation, a novel approach that enables the efficient and effective training of large models by dividing them into smaller sub-modules that can be trained separately and assembled seamlessly.

Image Segmentation object-detection +2

254

Paper
Code

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition

2 code implementations • NeurIPS 2021 • Yulin Wang, Rui Huang, Shiji Song, Zeyi Huang, Gao Huang

Inspired by this phenomenon, we propose a Dynamic Transformer to automatically configure a proper number of tokens for each input image.

Ranked #29 on Image Classification on CIFAR-100 (using extra training data)

Computational Efficiency Image Classification

241

Paper
Code

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

1 code implementation • NeurIPS 2020 • Yulin Wang, Kangchen Lv, Rui Huang, Shiji Song, Le Yang, Gao Huang

The accuracy of deep convolutional neural networks (CNNs) generally improves when fueled with high resolution images.

Computational Efficiency General Classification +1

180

Paper
Code

Glance and Focus Networks for Dynamic Visual Recognition

1 code implementation • 9 Jan 2022 • Gao Huang, Yulin Wang, Kangchen Lv, Haojun Jiang, Wenhui Huang, Pengfei Qi, Shiji Song

Spatial redundancy widely exists in visual recognition tasks, i. e., discriminative features in an image or video frame usually correspond to only a subset of pixels, while the remaining regions are irrelevant to the task at hand.

Image Classification Video Recognition

180

Paper
Code

Adaptive Focus for Efficient Video Recognition

1 code implementation • ICCV 2021 • Yulin Wang, Zhaoxi Chen, Haojun Jiang, Shiji Song, Yizeng Han, Gao Huang

In this paper, we explore the spatial redundancy in video recognition with the aim to improve the computational efficiency.

Computational Efficiency Video Recognition

120

Paper
Code

Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

1 code implementation • 26 Jan 2021 • Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang

Due to the need to store the intermediate activations for back-propagation, end-to-end (E2E) training of deep networks usually suffers from high GPUs memory footprint.

Paper
Code

Adaptive Rotated Convolution for Rotated Object Detection

1 code implementation • ICCV 2023 • Yifan Pu, Yiru Wang, Zhuofan Xia, Yizeng Han, Yulin Wang, Weihao Gan, Zidong Wang, Shiji Song, Gao Huang

In our ARC module, the convolution kernels rotate adaptively to extract object features with varying orientations in different images, and an efficient conditional computation mechanism is introduced to accommodate the large orientation variations of objects within an image.

Ranked #3 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +2

Paper
Code

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

1 code implementation • CVPR 2022 • Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang

Recent works have shown that the computational efficiency of video recognition can be significantly improved by reducing the spatial redundancy.

Computational Efficiency Video Recognition

Paper
Code

CondenseNet V2: Sparse Feature Reactivation for Deep Networks

1 code implementation • CVPR 2021 • Le Yang, Haojun Jiang, Ruojin Cai, Yulin Wang, Shiji Song, Gao Huang, Qi Tian

Reusing features in deep networks through dense connectivity is an effective way to achieve high computational efficiency.

Computational Efficiency Image Classification +2

Paper
Code

Transferable Semantic Augmentation for Domain Adaptation

1 code implementation • CVPR 2021 • Shuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li

To remedy this, we propose a Transferable Semantic Augmentation (TSA) approach to enhance the classifier adaptation ability through implicitly generating source features towards target semantics.

Domain Adaptation

Paper
Code

MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

1 code implementation • CVPR 2021 • Shuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng

Real-world training data usually exhibits long-tailed distribution, where several majority classes have a significantly larger number of samples than the remaining minority classes.

Ranked #2 on Long-tail Learning on CIFAR-100-LT (ρ=200)

Data Augmentation Image Classification +2

Paper
Code

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

1 code implementation • ICCV 2023 • Yulin Wang, Yang Yue, Rui Lu, Tianjiao Liu, Zhao Zhong, Shiji Song, Gao Huang

It is also effective for self-supervised learning (e. g., MAE).

Data Augmentation Self-Supervised Learning

Paper
Code

Dynamic Perceiver for Efficient Visual Recognition

1 code implementation • ICCV 2023 • Yizeng Han, Dongchen Han, Zeyu Liu, Yulin Wang, Xuran Pan, Yifan Pu, Chao Deng, Junlan Feng, Shiji Song, Gao Huang

Early exits are placed exclusively within the classification branch, thus eliminating the need for linear separability in low-level features.

Action Recognition Classification +4

Paper
Code

Fine-grained Recognition with Learnable Semantic Data Augmentation

1 code implementation • 1 Sep 2023 • Yifan Pu, Yizeng Han, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang

Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories.

Data Augmentation Fine-Grained Image Recognition +2

Paper
Code

Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm

1 code implementation • ICCV 2023 • Wenxuan Ma, Shuang Li, Jinming Zhang, Chi Harold Liu, Jingxuan Kang, Yulin Wang, Gao Huang

To address this issue, this paper presents a novel approach that seeks to leverage linguistic knowledge for data-efficient visual learning.

Domain Generalization Few-Shot Learning +1

Paper
Code

Probabilistic Contrastive Learning for Long-Tailed Visual Recognition

1 code implementation • 11 Mar 2024 • Chaoqun Du, Yulin Wang, Shiji Song, Gao Huang

To overcome this obstacle, we propose a novel probabilistic contrastive (ProCo) learning algorithm that estimates the data distribution of the samples from each class in the feature space, and samples contrastive pairs accordingly.

Ranked #8 on Long-tail Learning on iNaturalist 2018

Long-tail Learning

Paper
Code

Making the Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation

1 code implementation • 2 Aug 2022 • Wenxuan Ma, Jinming Zhang, Shuang Li, Chi Harold Liu, Yulin Wang, Wei Li

To alleviate these issues, we propose to simultaneously conduct feature alignment in two individual spaces focusing on different domains, and create for each space a domain-oriented classifier tailored specifically for that domain.

Pseudo Label Unsupervised Domain Adaptation

Paper
Code

Meta-Semi: A Meta-learning Approach for Semi-supervised Learning

no code implementations • 5 Jul 2020 • Yulin Wang, Jiayi Guo, Shiji Song, Gao Huang

In this paper, we propose a novel meta-learning based SSL algorithm (Meta-Semi) that requires tuning only one additional hyper-parameter, compared with a standard supervised deep learning algorithm, to achieve competitive performance under various conditions of SSL.

Meta-Learning

Paper
Add Code

Revisiting Locally Supervised Training of Deep Neural Networks

no code implementations • ICLR 2021 • Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang

As InfoPro loss is difficult to compute in its original form, we derive a feasible upper bound as a surrogate optimization objective, yielding a simple but effective algorithm.

Paper
Add Code

Dynamic Neural Networks: A Survey

no code implementations • 9 Feb 2021 • Yizeng Han, Gao Huang, Shiji Song, Le Yang, Honghui Wang, Yulin Wang

Dynamic neural network is an emerging research topic in deep learning.

Computational Efficiency Decision Making

Paper
Add Code

Exploiting Both Domain-specific and Invariant Knowledge via a Win-win Transformer for Unsupervised Domain Adaptation

no code implementations • 25 Nov 2021 • Wenxuan Ma, Jinming Zhang, Shuang Li, Chi Harold Liu, Yulin Wang, Wei Li

Unsupervised Domain Adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain.

Transfer Learning Unsupervised Domain Adaptation

Paper
Add Code

AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition

no code implementations • 27 Sep 2022 • Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang

Recent research has revealed that reducing the temporal and spatial redundancy are both effective approaches towards efficient video recognition, e. g., allocating the majority of computation to a task-relevant subset of frames or the most valuable image regions of each frame.

Video Recognition

Paper
Add Code

Computation-efficient Deep Learning for Computer Vision: A Survey

no code implementations • 27 Aug 2023 • Yulin Wang, Yizeng Han, Chaofei Wang, Shiji Song, Qi Tian, Gao Huang

Over the past decade, deep learning models have exhibited considerable advancements, reaching or even exceeding human-level performance in a range of visual perception tasks.

Autonomous Vehicles Edge-computing +1

Paper
Add Code

DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase

no code implementations • 6 Nov 2023 • Dawei Li, Yaxuan Li, Dheeraj Mekala, Shuyao Li, Yulin Wang, Xueqi Wang, William Hogan, Jingbo Shang

DAIL leverages the intuition that large language models are more familiar with the content generated by themselves.

Data Augmentation In-Context Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.