Search Results for author: Apaar Shanker

Found 2 papers, 0 papers with code

Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs

no code implementations29 Sep 2024 Yung-Chieh Chan, George Pu, Apaar Shanker, Parth Suresh, Penn Jenks, John Heyer, Sam Denton

We provide a practical framework for selecting the appropriate augmentation method across settings, taking into account additional factors such as the scalability of each method, the importance of verifying synthetic data, and the use of different LLMs for synthetic data generation.

Synthetic Data Generation

Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding

no code implementations9 Jan 2024 Yatong Bai, Utsav Garg, Apaar Shanker, Haoming Zhang, Samyak Parajuli, Erhan Bas, Isidora Filipovic, Amelia N. Chu, Eugenia D Fomitcheva, Elliot Branson, Aerin Kim, Somayeh Sojoudi, Kyunghyun Cho

Vision and vision-language applications of neural networks, such as image classification and captioning, rely on large-scale annotated datasets that require non-trivial data-collecting processes.

Image Captioning Image Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.