Search Results for author: Silas Alberti

Found 8 papers, 5 papers with code

Data Unlearning in Diffusion Models

1 code implementation2 Mar 2025 Silas Alberti, Kenan Hasanaliyev, Manav Shah, Stefano Ermon

Existing concept unlearning techniques require an anchor prompt/class/distribution to guide unlearning, which is not available in the data unlearning setting.

Machine Unlearning Memorization

Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs

no code implementations22 Dec 2024 Alexander von Recum, Christoph Schnabl, Gabor Hollbeck, Silas Alberti, Philip Blinde, Marvin Von Hagen

Refusals - instances where large language models (LLMs) decline or fail to fully execute user instructions - are crucial for both AI safety and AI capabilities and the reduction of hallucinations in particular.

Simple linear attention language models balance the recall-throughput tradeoff

3 code implementations28 Feb 2024 Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Ré

In this work, we explore whether we can improve language model efficiency (e. g. by reducing memory consumption) without compromising on recall.

Language Modelling Mamba +1

SuperHF: Supervised Iterative Learning from Human Feedback

1 code implementation25 Oct 2023 Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).

Language Modelling Safety Alignment

PIGEON: Predicting Image Geolocations

1 code implementation CVPR 2024 Lukas Haas, Michal Skreta, Silas Alberti, Chelsea Finn

We train two models for evaluations on street-level data and general-purpose image geolocalization; the first model, PIGEON, is trained on data from the game of Geoguessr and is capable of placing over 40% of its guesses within 25 kilometers of the target location globally.

Photo geolocation estimation

Sumformer: Universal Approximation for Efficient Transformers

no code implementations5 Jul 2023 Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok

Natural language processing (NLP) made an impressive jump with the introduction of Transformers.

Learning Generalized Zero-Shot Learners for Open-Domain Image Geolocalization

1 code implementation1 Feb 2023 Lukas Haas, Silas Alberti, Michal Skreta

Image geolocalization is the challenging task of predicting the geographic coordinates of origin for a given photo.

 Ranked #1 on Photo geolocation estimation on Im2GPS (Training images metric)

Generalized Zero-Shot Learning Meta-Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.