1 code implementation • 2 Mar 2025 • Silas Alberti, Kenan Hasanaliyev, Manav Shah, Stefano Ermon
Existing concept unlearning techniques require an anchor prompt/class/distribution to guide unlearning, which is not available in the data unlearning setting.
no code implementations • 22 Dec 2024 • Alexander von Recum, Christoph Schnabl, Gabor Hollbeck, Silas Alberti, Philip Blinde, Marvin Von Hagen
Refusals - instances where large language models (LLMs) decline or fail to fully execute user instructions - are crucial for both AI safety and AI capabilities and the reduction of hallucinations in particular.
3 code implementations • 28 Feb 2024 • Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Ré
In this work, we explore whether we can improve language model efficiency (e. g. by reducing memory consumption) without compromising on recall.
no code implementations • 25 Jan 2024 • Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell
External audits of AI systems are increasingly recognized as a key mechanism for AI governance.
1 code implementation • 25 Oct 2023 • Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti
Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).
1 code implementation • CVPR 2024 • Lukas Haas, Michal Skreta, Silas Alberti, Chelsea Finn
We train two models for evaluations on street-level data and general-purpose image geolocalization; the first model, PIGEON, is trained on data from the game of Geoguessr and is capable of placing over 40% of its guesses within 25 kilometers of the target location globally.
Ranked #1 on
Photo geolocation estimation
on YFCC4k
no code implementations • 5 Jul 2023 • Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok
Natural language processing (NLP) made an impressive jump with the introduction of Transformers.
1 code implementation • 1 Feb 2023 • Lukas Haas, Silas Alberti, Michal Skreta
Image geolocalization is the challenging task of predicting the geographic coordinates of origin for a given photo.
Ranked #1 on
Photo geolocation estimation
on Im2GPS
(Training images metric)