Search Results for author: Swanand Ravindra Kadhe

Found 3 papers, 0 papers with code

FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs

no code implementations12 Dec 2023 Swanand Ravindra Kadhe, Anisa Halimi, Ambrish Rawat, Nathalie Baracaldo

We evaluate the performance-fairness trade-off for SISA, and empirically demsontrate that SISA can indeed reduce fairness in LLMs.

Fairness Unsupervised Pre-training

Forcing Generative Models to Degenerate Ones: The Power of Data Poisoning Attacks

no code implementations7 Dec 2023 Shuli Jiang, Swanand Ravindra Kadhe, Yi Zhou, Ling Cai, Nathalie Baracaldo

Growing applications of large language models (LLMs) trained by a third party raise serious concerns on the security vulnerability of LLMs. It has been demonstrated that malicious actors can covertly exploit these vulnerabilities in LLMs through poisoning attacks aimed at generating undesirable outputs.

Data Poisoning object-detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.