Search Results for author: Jeffrey Wang

Found 1 papers, 0 papers with code

MoPe: Model Perturbation-based Privacy Attacks on Language Models

no code implementations • 22 Oct 2023 • Marvin Li, Jason Wang, Jeffrey Wang, Seth Neel

In this paper, we present Model Perturbations (MoPe), a new method to identify with high confidence if a given text is in the training data of a pre-trained language model, given white-box access to the models parameters.

Language Modelling Memorization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.