1 code implementation • 24 Jun 2024 • Michal Golovanevsky, William Rudman, Vedant Palit, Ritambhara Singh, Carsten Eickhoff
To address this, we introduce NOTICE, the first Noise-free Text-Image Corruption and Evaluation pipeline for mechanistic interpretability in VLMs.
1 code implementation • 26 Oct 2023 • William Rudman, Catherine Chen, Carsten Eickhoff
Representations from large language models (LLMs) are known to be dominated by a small subset of dimensions with exceedingly high variance.
1 code implementation • 30 May 2023 • William Rudman, Carsten Eickhoff
Given the success of Large Language Models (LLMs), there has been considerable interest in studying the properties of model activations.
1 code implementation • 24 May 2022 • William Jurayj, William Rudman, Carsten Eickhoff
In recent years, large-scale transformer decoders such as the GPT-x family of models have become increasingly popular.
1 code implementation • Findings (ACL) 2022 • William Rudman, Nate Gillman, Taylor Rayne, Carsten Eickhoff
We propose IsoScore: a novel tool that quantifies the degree to which a point cloud uniformly utilizes the ambient vector space.