Search Results for author: Sumedh A Sontakke

Found 3 papers, 0 papers with code

Model2Detector:Widening the Information Bottleneck for Out-of-Distribution Detection using a Handful of Gradient Steps

no code implementations • 22 Feb 2022 • Sumedh A Sontakke, Buvaneswari Ramanan, Laurent Itti, Thomas Woo

Our work can be employed as a post-processing method whereby an inference-time ML system can convert a trained model into an OOD detector.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Add Code

GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL

no code implementations • 29 Oct 2021 • Sumedh A Sontakke, Stephen Iota, Zizhao Hu, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

Extending the successes in supervised learning methods to the reinforcement learning (RL) setting, however, is difficult due to the data generating process - RL agents actively query their environment for data, and the data are a function of the policy followed by the agent.

Out of Distribution (OOD) Detection Reinforcement Learning (RL)

Paper
Add Code

Video2Skill: Adapting Events in Demonstration Videos to Skills in an Environment using Cyclic MDP Homomorphisms

no code implementations • 8 Sep 2021 • Sumedh A Sontakke, Sumegh Roychowdhury, Mausoom Sarkar, Nikaash Puri, Balaji Krishnamurthy, Laurent Itti

Humans excel at learning long-horizon tasks from demonstrations augmented with textual commentary, as evidenced by the burgeoning popularity of tutorial videos online.

Decision Making

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.