no code implementations • 19 Oct 2023 • Sergey Berezin, Reza Farahbakhsh, Noel Crespi
We introduce a simple yet efficient sentence-level attack on black-box toxicity detector models.
no code implementations • 3 Oct 2023 • Sergey Berezin, Reza Farahbakhsh, Noel Crespi
The fundamental problem in toxicity detection task lies in the fact that the toxicity is ill-defined.
no code implementations • sdp (COLING) 2022 • Sergey Berezin, Tatiana Batura
After that, this model is used to mask named entities in the text and the BART model is trained to reconstruct them.