Search Results for author: Bob M. Jacobs

Found 1 papers, 0 papers with code

Social Choice for AI Alignment: Dealing with Diverse Human Feedback

no code implementations • 16 Apr 2024 • Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mossé, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker

Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, so that, for example, they refuse to comply with requests for help with committing crimes or with producing racist text.

Ethics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.