Search Results for author: Eric Winsor

Found 4 papers, 3 papers with code

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

1 code implementation • 13 Dec 2023 • Alexandre Variengien, Eric Winsor

We find that LMs internally decompose retrieval tasks in a modular way: middle layers at the last token position process the request, while late layers retrieve the correct entity from the context.

Attribute Question Answering +1

Paper
Code

Interpreting Neural Networks through the Polytope Lens

no code implementations • 22 Nov 2022 • Sid Black, Lee Sharkey, Leo Grinsztajn, Eric Winsor, Dan Braun, Jacob Merizian, Kip Parker, Carlos Ramón Guevara, Beren Millidge, Gabriel Alfour, Connor Leahy

Previous mechanistic descriptions have used individual neurons or their linear combinations to understand the representations a network has learned.

Paper
Add Code

Scatterbrain: Unifying Sparse and Low-rank Attention Approximation

1 code implementation • NeurIPS 2021 • Beidi Chen, Tri Dao, Eric Winsor, Zhao Song, Atri Rudra, Christopher Ré

Recent advances in efficient Transformers have exploited either the sparsity or low-rank properties of attention matrices to reduce the computational and memory bottlenecks of modeling long sequences.

Image Generation Language Modelling

173

Paper
Code

Scatterbrain: Unifying Sparse and Low-rank Attention

1 code implementation • NeurIPS 2021 • Beidi Chen, Tri Dao, Eric Winsor, Zhao Song, Atri Rudra, Christopher Ré

Image Generation Language Modelling

173

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.