Search Results for author: Shahaf S. Shperberg

Found 3 papers, 1 papers with code

Enhancing Numeric-SAM for Learning with Few Observations

no code implementations • 17 Dec 2023 • Argaman Mordoch, Shahaf S. Shperberg, Roni Stern, Berndan Juba

It runs in polynomial time and is guaranteed to output an action model that is safe, in the sense that plans generated by it are applicable and will achieve their intended goals.

Paper
Add Code

A Formal Metareasoning Model of Concurrent Planning and Execution

1 code implementation • 5 Mar 2023 • Amihay Elboher, Ava Bensoussan, Erez Karpas, Wheeler Ruml, Shahaf S. Shperberg, Solomon E. Shimony

When timing is tight, there may be insufficient time to complete the search for a plan before it is time to act.

Paper
Code

Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake

no code implementations • 19 Feb 2022 • Shahaf S. Shperberg, Bo Liu, Peter Stone

When humans make catastrophic mistakes, they are expected to learn never to repeat them, such as a toddler who touches a hot stove and immediately learns never to do so again.

Continual Learning Safe Reinforcement Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.