Search Results for author: Hjalmar Wijk

Found 3 papers, 2 papers with code

Evaluating Language-Model Agents on Realistic Autonomous Tasks

no code implementations18 Dec 2023 Megan Kinniment, Lucas Jun Koba Sato, Haoxing Du, Brian Goodrich, Max Hasin, Lawrence Chan, Luke Harold Miles, Tao R. Lin, Hjalmar Wijk, Joel Burget, Aaron Ho, Elizabeth Barnes, Paul Christiano

We find that these language model agents can only complete the easiest tasks from this list, although they make some progress on the more challenging tasks.

Language Modelling

Robustness Guarantees for Credal Bayesian Networks via Constraint Relaxation over Probabilistic Circuits

1 code implementation11 May 2022 Hjalmar Wijk, Benjie Wang, Marta Kwiatkowska

In many domains, worst-case guarantees on the performance (e. g., prediction accuracy) of a decision function subject to distributional shifts and uncertainty about the environment are crucial.

Shielding Atari Games with Bounded Prescience

1 code implementation20 Jan 2021 Mirco Giacobbe, Mohammadhosein Hasanbeig, Daniel Kroening, Hjalmar Wijk

We present the first exact method for analysing and ensuring the safety of DRL agents for Atari games.

Atari Games Autonomous Driving

Cannot find the paper you are looking for? You can Submit a new open access paper.