no code implementations • 14 May 2023 • Francesco Pase, Szymon Kobus, Deniz Gunduz, Michele Zorzi
The transmitter applies a learning algorithm to the available examples, and extracts knowledge from the data by optimizing a probability distribution over a set of models, i. e., known functions, which can better describe the observed data, and so potentially the underlying concepts.
3 code implementations • 4 Feb 2021 • Mahdi Boloursaz Mashhadi, Mikolaj Jankowski, Tze-Yang Tung, Szymon Kobus, Deniz Gunduz
Efficient link configuration in millimeter wave (mmWave) communication systems is a crucial yet challenging task due to the overhead imposed by beam selection.
no code implementations • 2 Jan 2021 • Tze-Yang Tung, Szymon Kobus, Joan Roig Pujol, Deniz Gunduz
Specifically, we consider a multi-agent partially observable Markov decision process (MA-POMDP), in which the agents, in addition to interacting with the environment can also communicate with each other over a noisy communication channel.
Multi-agent Reinforcement Learning reinforcement-learning +1