2 code implementations • 20 Jul 2024 • Manuel Eberhardinger, Florian Rupp, Johannes Maucher, Setareh Maghsudi
Despite tremendous progress, machine learning and deep learning still suffer from incomprehensible predictions.
no code implementations • 24 Jun 2024 • Mariam Yahya, Alexander Conzelmann, Setareh Maghsudi
We study the problem of decentralized task offloading and load-balancing in a dense network with numerous devices and a set of edge servers.
no code implementations • 29 May 2024 • Xiaotong Cheng, Ioannis Tsetis, Setareh Maghsudi
We model this problem as a bandit convex optimization problem with constraints that correspond to each node's limitations for energy production.
no code implementations • 19 May 2024 • Kweiguu Liu, Setareh Maghsudi
The delayed feedback of testing results, thus insufficient information for learning, degraded the efficiency of resource allocation.
1 code implementation • 19 Apr 2024 • Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi
We then leverage this upper bound to propose a novel regularizer, namely BEllman Equation-based automatic rank Regularizer (BEER).
no code implementations • 31 Mar 2024 • Steven Bilaj, Sofien Dhouib, Setareh Maghsudi
We study the problem of meta-learning several contextual stochastic bandits tasks by leveraging their concentration around a low-dimensional affine subspace, which we learn via online principal component analysis to reduce the expected regret over the encountered bandits.
1 code implementation • 7 Sep 2023 • Manuel Eberhardinger, Johannes Maucher, Setareh Maghsudi
Understanding the interactions of agents trained with deep reinforcement learning is crucial for deploying agents in games or the real world.
no code implementations • 31 Aug 2023 • Mariam Yahya, Setareh Maghsudi, Slawomir Stanczak
We then develop a model and solution based on the multi-objective multi-armed bandit theory to maximize the network coverage while minimizing the FL delay.
no code implementations • 26 Jul 2023 • Behzad Nourani-Koliji, Steven Bilaj, Amir Rezaei Balef, Setareh Maghsudi
In our nonstationary environment, variations in the base arms' distributions, causal relationships between rewards, or both, change the reward generation process.
1 code implementation • 18 Jul 2023 • Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi
Maximizing long-term rewards is the primary goal in sequential decision-making problems.
no code implementations • 18 Jul 2023 • Saeed Ghoorchian, Setareh Maghsudi
We develop a policy that learns the structural dependencies from delayed feedback and utilizes that to optimize the decision-making while adapting to drifts.
no code implementations • 29 Jun 2023 • Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi
In ERC, we propose a regularizer that guides the approximation error tending towards the 1-eigensubspace, resulting in a more efficient and stable path of value approximation.
no code implementations • 9 Jun 2023 • Xiaotong Cheng, Setareh Maghsudi
We study a structured multi-agent multi-armed bandit (MAMAB) problem in a dynamic environment.
no code implementations • 30 May 2023 • Haniyeh Barghi, Xiaotong Cheng, Setareh Maghsudi
We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension $d$ whereas the reward function depends on only a limited set of features - precisely $s_0 \ll d$.
no code implementations • 8 May 2023 • Yuwen Cao, Tomoaki Ohtsuki, Setareh Maghsudi, Tony Q. S. Quek
In this paper, we develop a deep learning (DL)-guided hybrid beam and power allocation approach for multiuser millimeter-wave (mmWave) networks, which facilitates swift beamforming at the base station (BS).
1 code implementation • 10 Feb 2023 • Amir Rezaei Balef, Setareh Maghsudi
Without this assumption, the regret bound of our algorithm is $\gamma_T\log(T)$.
2 code implementations • 7 Feb 2023 • Ahmad-Reza Ehyaei, Amir-Hossein Karimi, Bernhard Schölkopf, Setareh Maghsudi
Algorithmic recourse aims to disclose the inner workings of the black-box decision process in situations where decisions have significant consequences, by providing recommendations to empower beneficiaries to achieve a more favorable outcome.
no code implementations • 25 Dec 2022 • Behzad Nourani-Koliji, Saeed Ghoorchian, Setareh Maghsudi
The objective is to maximize the long-term average payoff, which is a linear function of the base arms' rewards and depends strongly on the network topology.
no code implementations • 14 Nov 2022 • Steven Bilaj, Sofien Dhouib, Setareh Maghsudi
We consider the problem of contextual multi-armed bandits in the setting of hypothesis transfer learning.
no code implementations • 19 May 2022 • Navneet Agrawal, Yuqin Qiu, Matthias Frey, Igor Bjelakovic, Setareh Maghsudi, Slawomir Stanczak, Jingge Zhu
Lagrange coded computation (LCC) is essential to solving problems about matrix polynomials in a coded distributed fashion; nevertheless, it can only solve the problems that are representable as matrix polynomials.
no code implementations • 28 Mar 2022 • Xiaotong Cheng, Setareh Maghsudi
One strategy, namely bandit gradient ascent with momentum, is an online convex optimization algorithm with bandit feedback.
no code implementations • 9 Mar 2022 • Sofien Dhouib, Setareh Maghsudi
Recent advances in domain adaptation establish that requiring a low risk on the source domain and equal feature marginals degrade the adaptation's performance.
1 code implementation • 7 Feb 2022 • Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi
Our proposed recommender system employs this policy to learn the users' item preferences online while minimizing runtime.
no code implementations • 19 Jan 2021 • Setareh Maghsudi, Andrew Lan, Jie Xu, Mihaela van der Schaar
The objective of personalized learning is to design an effective knowledge acquisition track that matches the learner's strengths and bypasses her weaknesses to ultimately meet her desired goal.
no code implementations • 14 Aug 2020 • Arash Bozorgchenani, Setareh Maghsudi, Daniele Tarchi, Ekram Hossain
Moreover, we demonstrate that the proposed joint base station selection and the relaying mechanism minimize the task loss in a vehicular environment.
no code implementations • 26 Jun 2020 • Firas Fredj, Yasser Al-Eryani, Setareh Maghsudi, Mohamed Akrout, Ekram Hossain
First, we propose a fully centralized beamforming method that uses the deep deterministic policy gradient algorithm (DDPG) with continuous space.
no code implementations • 13 Apr 2020 • Setareh Maghsudi, Mihaela van der Schaar
The former problem boils down to a stochastic knapsack problem, and we cast the latter as a multi-armed bandit problem.
no code implementations • 12 Apr 2019 • Saeed Ghoorchian, Setareh Maghsudi
The full potential of edge computing becomes realized only if a smart device selects the most appropriate server in terms of the latency and energy consumption, among many available ones.
no code implementations • 30 Apr 2016 • Setareh Maghsudi, Ekram Hossain
In this article, we study the distributed cell association problem for energy harvesting IoT devices in UD-SCNs.
no code implementations • 27 Jan 2016 • Setareh Maghsudi, Ekram Hossain
We consider a distributed downlink user association problem in a small cell network, where small cells obtain the required energy for providing wireless services to users through ambient energy harvesting.
no code implementations • 2 Oct 2015 • Setareh Maghsudi, Ekram Hossain
This requires the next generation wireless networks to move towards new networking paradigms that are able to efficiently support resource-demanding applications such as personalized mobile services.