Search Results for author: Zhenhong Zhou

Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Large Language Models (LLMs) have been demonstrated to generate illegal or unethical responses, particularly when subjected to "jailbreak."

Paper
Add Code

Quantifying language model memorization helps evaluate potential privacy risks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.