Search Results for author: Hsu Kao

Found 2 papers, 0 papers with code

Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure

no code implementations1 Nov 2021 Hsu Kao, Chen-Yu Wei, Vijay Subramanian

For the bandit setting, we propose a hierarchical bandit algorithm that achieves a near-optimal gap-independent regret of $\widetilde{\mathcal{O}}(\sqrt{ABT})$ and a near-optimal gap-dependent regret of $\mathcal{O}(\log(T))$, where $A$ and $B$ are the numbers of actions of the leader and the follower, respectively, and $T$ is the number of steps.

Multi-agent Reinforcement Learning Multi-Armed Bandits +2

Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning

no code implementations25 Oct 2021 Hsu Kao, Vijay Subramanian

Due to information asymmetry, finding optimal policies for Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) is hard with the complexity growing doubly exponentially in the horizon length.

Multi-agent Reinforcement Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.