Search Results for author: Hosein Hasanbeig

Found 6 papers, 2 papers with code

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

no code implementations18 Dec 2023 Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate

This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning.

Bayesian Inference Reinforcement Learning (RL)

Decoding In-Context Learning: Neuroscience-inspired Analysis of Representations in Large Language Models

no code implementations30 Sep 2023 Safoora Yousefi, Leo Betthauser, Hosein Hasanbeig, Raphaël Millière, Ida Momennejad

In this work, we investigate how LLM embeddings and attention representations change following in-context-learning, and how these changes mediate improvement in behavior.

In-Context Learning Reading Comprehension

ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning

no code implementations24 Sep 2023 Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad

From grading papers to summarizing medical documents, large language models (LLMs) are evermore used for evaluation of text generated by humans and AI alike.

In-Context Learning

LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning

1 code implementation21 Sep 2022 Hosein Hasanbeig, Daniel Kroening, Alessandro Abate

LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability.

reinforcement-learning Reinforcement Learning (RL)

Certified Reinforcement Learning with Logic Guidance

1 code implementation2 Feb 2019 Hosein Hasanbeig, Daniel Kroening, Alessandro Abate

Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems.

Decision Making Decision Making Under Uncertainty +4

Cannot find the paper you are looking for? You can Submit a new open access paper.