Search Results for author: I-Hong Hou

Found 4 papers, 2 papers with code

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

1 code implementation NeurIPS 2021 Khaled Nakhleh, Santosh Ganji, Ping-Chun Hsieh, I-Hong Hou, Srinivas Shakkottai

This paper proposes NeurWIN, a neural Whittle index network that seeks to learn the Whittle indices for any restless bandits by leveraging mathematical properties of the Whittle indices.

DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs

1 code implementation18 Sep 2022 Khaled Nakhleh, I-Hong Hou

We consider the problem of learning the optimal threshold policy for control problems.

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making

no code implementations22 Mar 2024 Xin Chen, I-Hong Hou

This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making.

Decision Making Multi-Armed Bandits

Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback

no code implementations6 Apr 2024 I-Hong Hou

In addition to the exploration-exploitation dilemma in the traditional multi-armed bandit problem, we show that the consideration of multiple stages introduces a third component, education, where an agent needs to choose its actions to facilitate the learning of agents in the next stage.

Cannot find the paper you are looking for? You can Submit a new open access paper.