no code implementations • 21 Dec 2023 • Andrea Wynn, Ilia Sucholutsky, Thomas L. Griffiths
We propose that this kind of representational alignment between machine learning (ML) models and humans can also support value alignment, allowing ML systems to conform to human values and societal norms.