Search Results for author: Katherine Harvey

Finding Neurons in a Haystack: Case Studies with Sparse Probing

Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood.

2,056

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.