1 code implementation • 18 Jul 2023 • Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi
Maximizing long-term rewards is the primary goal in sequential decision-making problems.
no code implementations • 18 Jul 2023 • Saeed Ghoorchian, Setareh Maghsudi
We develop a policy that learns the structural dependencies from delayed feedback and utilizes that to optimize the decision-making while adapting to drifts.
no code implementations • 25 Dec 2022 • Behzad Nourani-Koliji, Saeed Ghoorchian, Setareh Maghsudi
The objective is to maximize the long-term average payoff, which is a linear function of the base arms' rewards and depends strongly on the network topology.
1 code implementation • 7 Feb 2022 • Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi
Our proposed recommender system employs this policy to learn the users' item preferences online while minimizing runtime.
no code implementations • 12 Apr 2019 • Saeed Ghoorchian, Setareh Maghsudi
The full potential of edge computing becomes realized only if a smart device selects the most appropriate server in terms of the latency and energy consumption, among many available ones.