Search Results for author: Aradhana Sinha

Found 3 papers, 0 papers with code

Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images

no code implementations • 25 Jan 2024 • Hansa Srinivasan, Candice Schumann, Aradhana Sinha, David Madras, Gbolahan Oluwafemi Olanubi, Alex Beutel, Susanna Ricco, Jilin Chen

First, a text-guided approach is used to extract a person-diversity representation from a pre-trained image-text model.

Attribute

Paper
Add Code

Improving Few-shot Generalization of Safety Classifiers via Data Augmented Parameter-Efficient Fine-Tuning

no code implementations • 25 Oct 2023 • Ananth Balashankar, Xiao Ma, Aradhana Sinha, Ahmad Beirami, Yao Qin, Jilin Chen, Alex Beutel

As large language models (LLMs) are widely adopted, new safety issues and policies emerge, to which existing safety classifiers do not generalize well.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks

no code implementations • 25 Oct 2023 • Aradhana Sinha, Ananth Balashankar, Ahmad Beirami, Thi Avrahami, Jilin Chen, Alex Beutel

We demonstrate the advantages of this system on the ANLI and hate speech detection benchmark datasets - both collected via an iterative, adversarial human-and-model-in-the-loop procedure.

Hate Speech Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.