Search Results for author: Siow Meng Low

Found 2 papers, 1 papers with code

Safe MDP Planning by Learning Temporal Patterns of Undesirable Trajectories and Averting Negative Side Effects

1 code implementation6 Apr 2023 Siow Meng Low, Akshat Kumar, Scott Sanner

In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects.

Sample-efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs

no code implementations23 Mar 2022 Siow Meng Low, Akshat Kumar, Scott Sanner

This novel formulation of DRP learning as iterative lower bound optimization (ILBO) is particularly appealing because (i) each step is structurally easier to optimize than the overall objective, (ii) it guarantees a monotonically improving objective under certain theoretical conditions, and (iii) it reuses samples between iterations thus lowering sample complexity.

Cannot find the paper you are looking for? You can Submit a new open access paper.