no code implementations • 30 Dec 2024 • HyunJi Nam, Allen Nie, Ge Gao, Vasilis Syrgkanis, Emma Brunskill
In two simulated healthcare examples--HIV and sepsis management--we show that our estimators can provide accurate predictions about the policy value only after observing 10\% of the full horizon data.
1 code implementation • NeurIPS 2021 • HyunJi Nam, Scott Fleming, Emma Brunskill
Many real-world problems that require making optimal sequences of decisions under uncertainty involve costs when the agent wishes to obtain information about its environment.
1 code implementation • 22 Oct 2020 • Annie S. Chen, HyunJi Nam, Suraj Nair, Chelsea Finn
Concretely, we propose an exploration technique, Batch Exploration with Examples (BEE), that explores relevant regions of the state-space, guided by a modest number of human provided images of important states.