1 code implementation • 1 Nov 2023 • Hugo Fry, Seamus Fallows, Ian Fan, Jamie Wright, Nandi Schoots
We investigate the optimization target of Contrast-Consistent Search (CCS), which aims to recover the internal representations of truth of a large language model.