no code implementations • ICML 2020 • Abhimanyu Dubey, Alex `Sandy' Pentland
We study the heavy-tailed stochastic bandit problem in the cooperative multiagent setting, where a group of agents interact with a common bandit problem, while communicating on a network with delays.
no code implementations • ICML 2020 • Abhimanyu Dubey, Alex `Sandy' Pentland
We propose Coop-KernelUCB that provides near-optimal bounds on the per-agent regret in this setting, and is both computationally and communicatively efficient.
1 code implementation • 15 Nov 2023 • Robert Mahari, Dominik Stammbach, Elliott Ash, Alex `Sandy' Pentland
We present the Legal Passage Retrieval Dataset LePaRD.
no code implementations • 19 Dec 2019 • Masahiro Kazama, Yoshihiko Suhara, Andrey Bogomolov, Alex `Sandy' Pentland
We also analyzed the differences between the expert and non-expert machine algorithms based on their neural representations to evaluate the performances, providing insight into the human experts' and non-experts' cognitive abilities.
no code implementations • 30 Nov 2018 • Dhaval Adjodah, Dan Calacci, Abhimanyu Dubey, Peter Krafft, Esteban Moro, Alex `Sandy' Pentland
This is an important problem because a common technique to improve speed and robustness of learning in deep reinforcement learning and many other machine learning algorithms is to run multiple learning agents in parallel.