Search Results for author: Maria Minakova

Found 2 papers, 0 papers with code

Comprehensive Benchmarking of Entropy and Margin Based Scoring Metrics for Data Selection

no code implementations27 Nov 2023 Anusha Sabbineni, Nikhil Anand, Maria Minakova

While data selection methods have been studied extensively in active learning, data pruning, and data augmentation settings, there is little evidence for the efficacy of these methods in industry scale settings, particularly in low-resource languages.

Active Learning Benchmarking +2

Influence Scores at Scale for Efficient Language Data Sampling

no code implementations27 Nov 2023 Nikhil Anand, Joshua Tan, Maria Minakova

Modern ML systems ingest data aggregated from diverse sources, such as synthetic, human-annotated, and live customer traffic.

Cannot find the paper you are looking for? You can Submit a new open access paper.