Search Results for author: Simon Lermen

Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability

There has been increasing interest in evaluations of language models for a variety of risks and characteristics.

Paper
Add Code

Our fine-tuning method retains general performance, which we validate by comparing our fine-tuned models against Llama 2-Chat across two benchmarks.

Paper
Add Code

Llama 2-Chat is a collection of large language models that Meta developed and released to the public.

Paper
Add Code

Recently, there has been an increase in interest in evaluating large language models for emergent and dangerous capabilities.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.