Search Results for author: Euan McLean

Found 5 papers, 3 papers with code

Can Go AIs be adversarially robust?

2 code implementations18 Jun 2024 Tom Tseng, Euan McLean, Kellin Pelrine, Tony T. Wang, Adam Gleave

Prior work found that superhuman Go AIs can be defeated by simple adversarial strategies, especially "cyclic" attacks.

Diversity

Exploiting Novel GPT-4 APIs

1 code implementation21 Dec 2023 Kellin Pelrine, Mohammad Taufeeque, Michał Zając, Euan McLean, Adam Gleave

Language model attacks typically assume one of two extreme threat models: full white-box access to model weights, or black-box access limited to a text generation API.

Language Modeling Language Modelling +2

Language models are better than humans at next-token prediction

1 code implementation21 Dec 2022 Buck Shlegeris, Fabien Roger, Lawrence Chan, Euan McLean

Current language models are considered to have sub-human capabilities at natural language tasks like question-answering or writing code.

Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.