Search Results for author: Dawn Lu

Found 1 papers, 0 papers with code

Investigating Bias Representations in Llama 2 Chat via Activation Steering

no code implementations1 Feb 2024 Dawn Lu, Nina Rimsky

We address the challenge of societal bias in Large Language Models (LLMs), focusing on the Llama 2 7B Chat model.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.