no code implementations • 26 Feb 2024 • Massieh Kordi Boroujeny, Ya Jiang, Kai Zeng, Brian Mark
Methods for watermarking large language models have been proposed that distinguish AI-generated text from human-generated text by slightly altering the model output distribution, but they also distort the quality of the text, exposing the watermark to adversarial detection.