Search Results for author: Dawn Lu

Investigating Bias Representations in Llama 2 Chat via Activation Steering

We address the challenge of societal bias in Large Language Models (LLMs), focusing on the Llama 2 7B Chat model.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.