Data Augmentation

2509 papers with code • 2 benchmarks • 63 datasets

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Benchmarks

Add a Result

These leaderboards are used to track progress in Data Augmentation

Trend	Dataset	Best Model	Paper	Code	Compare
	ImageNet	DeiT-B (+MixPro)			See all
	CIFAR-10	Shake-Shake (26 2×96d) (Faster AA)			See all

Libraries

Use these libraries to find Data Augmentation models and implementations

Westlake-AI/openmixup

15 papers

567

faceonlive/ai-research

8 papers

124

rwightman/pytorch-image-models

7 papers

29,648

makcedward/nlpaug

7 papers

4,290

See all 7 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

no code yet • 16 Apr 2024

Distributed Distributional DrQ is a model-free and off-policy RL algorithm for continuous control tasks based on the state and observation of the agent, which is an actor-critic method with the data-augmentation and the distributional perspective of critic value function.

Paper
Add Code

Offline Trajectory Generalization for Offline Reinforcement Learning

no code yet • 16 Apr 2024

Then we propose four strategies to use World Transformers to generate high-rewarded trajectory simulation by perturbing the offline data.

Paper
Add Code

Clustering and Data Augmentation to Improve Accuracy of Sleep Assessment and Sleep Individuality Analysis

no code yet • 16 Apr 2024

Recently, growing health awareness, novel methods allow individuals to monitor sleep at home.

Paper
Add Code

Classification of Prostate Cancer in 3D Magnetic Resonance Imaging Data based on Convolutional Neural Networks

no code yet • 16 Apr 2024

Prostate cancer is a commonly diagnosed cancerous disease among men world-wide.

Paper
Add Code

Awareness of uncertainty in classification using a multivariate model and multi-views

no code yet • 16 Apr 2024

The proposed model regularizes uncertain predictions, and trains to calculate both the predictions and their uncertainty estimations.

Paper
Add Code

Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?

no code yet • 15 Apr 2024

Self-supervised learning (SSL) has emerged as a promising solution for addressing the challenge of limited labeled data in deep neural networks (DNNs), offering scalability potential.

Paper
Add Code

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features

no code yet • 15 Apr 2024

The task of face reenactment is to transfer the head motion and facial expressions from a driving video to the appearance of a source image, which may be of a different person (cross-reenactment).

Paper
Add Code

Accelerating Ensemble Error Bar Prediction with Single Models Fits

no code yet • 15 Apr 2024

Ensemble models can be used to estimate prediction uncertainties in machine learning models.

Paper
Add Code

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

no code yet • 14 Apr 2024

Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.

Paper
Add Code

Improving Personalisation in Valence and Arousal Prediction using Data Augmentation

no code yet • 13 Apr 2024

This paper presents our work on an enhanced personalisation strategy, that leverages data augmentation to develop tailored models for continuous valence and arousal prediction.

Paper
Add Code

Data Augmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result