1 code implementation • 6 Jun 2023 • Max Reuter, William Schulze
With this machine-labeled data, we train a prompt classifier to predict whether ChatGPT will refuse a given question, without seeing ChatGPT's response.