no code implementations • 18 Dec 2023 • Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate
This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning.
no code implementations • 28 Nov 2023 • Jun Wang, Hosein Hasanbeig, Kaiyuan Tan, Zihe Sun, Yiannis Kantaros
We consider robots with unknown stochastic dynamics operating in environments with unknown geometric structure.
no code implementations • 30 Sep 2023 • Safoora Yousefi, Leo Betthauser, Hosein Hasanbeig, Raphaël Millière, Ida Momennejad
In this work, we investigate how LLM embeddings and attention representations change following in-context-learning, and how these changes mediate improvement in behavior.
no code implementations • 24 Sep 2023 • Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad
From grading papers to summarizing medical documents, large language models (LLMs) are evermore used for evaluation of text generated by humans and AI alike.
1 code implementation • 21 Sep 2022 • Hosein Hasanbeig, Daniel Kroening, Alessandro Abate
LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability.
1 code implementation • 2 Feb 2019 • Hosein Hasanbeig, Daniel Kroening, Alessandro Abate
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems.