Search Results for author: Balázs Galambosi

Found 1 papers, 1 papers with code

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

1 code implementation6 Apr 2024 Yann Dubois, Balázs Galambosi, Percy Liang, Tatsunori B. Hashimoto

Even simple, known confounders such as preference for longer outputs remain in existing automated evaluation metrics.

Chatbot counterfactual

Cannot find the paper you are looking for? You can Submit a new open access paper.