Search Results for author: Aaron Ho

Found 1 papers, 0 papers with code

Evaluating Language-Model Agents on Realistic Autonomous Tasks

no code implementations18 Dec 2023 Megan Kinniment, Lucas Jun Koba Sato, Haoxing Du, Brian Goodrich, Max Hasin, Lawrence Chan, Luke Harold Miles, Tao R. Lin, Hjalmar Wijk, Joel Burget, Aaron Ho, Elizabeth Barnes, Paul Christiano

We find that these language model agents can only complete the easiest tasks from this list, although they make some progress on the more challenging tasks.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.