no code implementations • 30 Apr 2015 • Orly Avner, Shie Mannor
Inspired by cognitive radio networks, we consider a setting where multiple users share several channels modeled as a multi-user multi-armed bandit (MAB) problem.
no code implementations • 22 Apr 2014 • Orly Avner, Shie Mannor
Even the number of users may be unknown and can vary as users join or leave the network.
no code implementations • 14 Aug 2018 • Orly Avner, Shie Mannor
Communication networks shared by many users are a widespread challenge nowadays.
no code implementations • 28 Mar 2023 • Ori Linial, Orly Avner, Dotan Di Castro
We introduce a method for inferring an explicit PDE from a data sample generated by previously unseen dynamics, based on a learned context.
no code implementations • 3 Feb 2024 • Nitsan Soffair, Dotan Di-Castro, Orly Avner, Shie Mannor
We implement SQT on top of TD3/TD7 code and test it against the state-of-the-art (SOTA) actor-critic algorithms, DDPG, TD3 and TD7 on seven popular MuJoCo and Bullet tasks.