no code implementations • 22 Feb 2022 • Sumedh A Sontakke, Buvaneswari Ramanan, Laurent Itti, Thomas Woo
Our work can be employed as a post-processing method whereby an inference-time ML system can convert a trained model into an OOD detector.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 29 Oct 2021 • Sumedh A Sontakke, Stephen Iota, Zizhao Hu, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf
Extending the successes in supervised learning methods to the reinforcement learning (RL) setting, however, is difficult due to the data generating process - RL agents actively query their environment for data, and the data are a function of the policy followed by the agent.
Out of Distribution (OOD) Detection Reinforcement Learning (RL)
no code implementations • 8 Sep 2021 • Sumedh A Sontakke, Sumegh Roychowdhury, Mausoom Sarkar, Nikaash Puri, Balaji Krishnamurthy, Laurent Itti
Humans excel at learning long-horizon tasks from demonstrations augmented with textual commentary, as evidenced by the burgeoning popularity of tutorial videos online.