1 code implementation • 28 Nov 2017 • Shayegan Omidshafiei, Dong-Ki Kim, Jason Pazis, Jonathan P. How
This paper presents the Crossmodal Attentive Skill Learner (CASL), integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture [Harb et al., 2017] to enable hierarchical reinforcement learning across multiple sensory inputs.
no code implementations • ICML 2017 • Shayegan Omidshafiei, Jason Pazis, Christopher Amato, Jonathan P. How, John Vian
Many real-world tasks involve multiple agents with partial observability and limited communication.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • NeurIPS 2016 • Jason Pazis, Ronald E. Parr, Jonathan P. How
We present the first application of the median of means in a PAC exploration algorithm for MDPs.