Search Results for author: Shahaf S. Shperberg

Found 3 papers, 1 papers with code

Enhancing Numeric-SAM for Learning with Few Observations

no code implementations17 Dec 2023 Argaman Mordoch, Shahaf S. Shperberg, Roni Stern, Berndan Juba

It runs in polynomial time and is guaranteed to output an action model that is safe, in the sense that plans generated by it are applicable and will achieve their intended goals.

A Formal Metareasoning Model of Concurrent Planning and Execution

1 code implementation5 Mar 2023 Amihay Elboher, Ava Bensoussan, Erez Karpas, Wheeler Ruml, Shahaf S. Shperberg, Solomon E. Shimony

When timing is tight, there may be insufficient time to complete the search for a plan before it is time to act.

Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake

no code implementations19 Feb 2022 Shahaf S. Shperberg, Bo Liu, Peter Stone

When humans make catastrophic mistakes, they are expected to learn never to repeat them, such as a toddler who touches a hot stove and immediately learns never to do so again.

Continual Learning Safe Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.