no code implementations • 13 Dec 2023 • Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel, Joydeep Biswas, Swarat Chaudhuri, Isil Dillig, Alex Dimakis, P. Brighten Godfrey, Daehyeok Kim, Chris Rossbach, Gang Wang
This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes).
no code implementations • 7 Jul 2021 • Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai
The agent interacts with the environment over episodes, with each episode having different context distributions; this results in the `best expert' changing across episodes.
no code implementations • 19 Feb 2020 • Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai
In the stochastic case, we propose the non-optimistic Global Under-Explore (GLUE) algorithm which employs the inferred subgaussian estimates to adapt the rate of exploration for the arms.
1 code implementation • 23 Feb 2018 • Rajat Sen, Karthikeyan Shanmugam, Nihal Sharma, Sanjay Shakkottai
We consider the problem of contextual bandits with stochastic experts, which is a variation of the traditional stochastic contextual bandit with experts problem.