no code implementations • 17 Feb 2024 • Yaniv Cohen, Tomer Gafni, Ronen Greenberg, Kobi Cohen
We propose a novel multi-agent reinforcement learning (RL) framework for distributed DCA, named Channel Allocation RL To Overlapped Networks (CARLTON).
Multi-agent Reinforcement Learning Reinforcement Learning (RL)
no code implementations • 30 Mar 2023 • Tomer Gafni, Kobi Cohen, Yonina C. Eldar
To handle statistical heterogeneity of users data, which is a second major challenge in FL, we extend BAAF to allow for appropriate local updates by the users and develop the Controlled Bayesian Air Aggregation Federated-learning (COBAAF) algorithm.
no code implementations • 28 Feb 2022 • Tomer Gafni, Michal Yemini, Kobi Cohen
Motivated by recent studies on related RMAB settings, the regret is defined as the reward loss with respect to a player that knows the dynamics of the problem, and plays at each time t the arm that maximizes the expected immediate value.
no code implementations • 17 Dec 2021 • Tomer Gafni, Michal Yemini, Kobi Cohen
Motivated by recent studies on related RMAB settings, the regret is defined as the reward loss with respect to a player that knows the dynamics of the problem, and plays at each time $t$ the arm that maximizes the expected immediate value.
no code implementations • 31 Mar 2021 • Tomer Gafni, Nir Shlezinger, Kobi Cohen, Yonina C. Eldar, H. Vincent Poor
Learning in a federated manner differs from conventional centralized machine learning, and poses several core unique challenges and requirements, which are closely related to classical problems studied in the areas of signal processing and communications.
no code implementations • 27 Jan 2021 • Tomer Gafni, Kobi Cohen
By contrast, we consider a more general and practical model, where each channel yields a different expected rate for each user.
no code implementations • 19 Jun 2019 • Tomer Gafni, Kobi Cohen
Although existing methods have shown a logarithmic regret order with time in this RMAB setting, the theoretical analysis shows a significant improvement in the regret scaling with respect to the system parameters under ASR.