Text Augmentation

33 papers with code • 0 benchmarks • 0 datasets

You can read these blog posts to get an overview of the approaches.

Libraries

Use these libraries to find Text Augmentation models and implementations
3 papers
4,288
2 papers
370

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

faceonlive/ai-research 7 Apr 2024

Acknowledging this limitation, our objective is to devise a framework capable of concurrently augmenting medical image and text data.

104
07 Apr 2024

EDDA: A Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection

szu-ddj/edda 23 Mar 2024

To address these issues, we propose an encoder-decoder data augmentation (EDDA) framework.

0
23 Mar 2024

A Survey on Data Augmentation in Large Model Era

mlgroup-jlu/llm-data-aug-survey 27 Jan 2024

Leveraging large models, these data augmentation techniques have outperformed traditional approaches.

71
27 Jan 2024

Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

kinit-sk/llm-div-incts 12 Jan 2024

The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models.

0
12 Jan 2024

Teaching Specific Scientific Knowledge into Large Language Models through Additional Training

kanhatakeyama/Additional-training-Llama2 6 Dec 2023

Through additional training, we explore embedding specialized scientific knowledge into the Llama 2 Large Language Model (LLM).

3
06 Dec 2023

COVID-19 Vaccine Misinformation in Middle Income Countries

zzoliman/covid-vaccine-misinfo-mic 30 Nov 2023

This paper introduces a multilingual dataset of COVID-19 vaccine misinformation, consisting of annotated tweets from three middle-income countries: Brazil, Indonesia, and Nigeria.

0
30 Nov 2023

Pretraining Language Models with Text-Attributed Heterogeneous Graphs

hope-rita/thlm 19 Oct 2023

In many real-world scenarios (e. g., academic networks, social platforms), different types of entities are not only associated with texts but also connected by various relationships, which can be abstracted as Text-Attributed Heterogeneous Graphs (TAHGs).

10
19 Oct 2023

Distributional Data Augmentation Methods for Low Resource Language

mosh98/text_aug_low_res 9 Sep 2023

One of the current state-of-the-art text augmentation techniques is easy data augmentation (EDA), which augments the training data by injecting and replacing synonyms and randomly permuting sentences.

0
09 Sep 2023

Story Visualization by Online Text Augmentation with Context Memory

yonseivnl/cmota ICCV 2023

Story visualization (SV) is a challenging text-to-image generation task for the difficulty of not only rendering visual details from the text descriptions but also encoding a long-term context across multiple sentences.

7
15 Aug 2023