Search Results for author: Atilla Eryilmaz

Found 10 papers, 0 papers with code

Non-Federated Multi-Task Split Learning for Heterogeneous Sources

no code implementations31 May 2024 Yilin Zheng, Atilla Eryilmaz

With the development of edge networks and mobile computing, the need to serve heterogeneous data sources at the network edge requires the design of new distributed machine learning mechanisms.

Federated Learning Image Classification +1

Recurrent Natural Policy Gradient for POMDPs

no code implementations28 May 2024 Semih Cayci, Atilla Eryilmaz

In this paper, we study a natural policy gradient method based on recurrent neural networks (RNNs) for partially-observable Markov decision processes, whereby RNNs are used for policy parameterization and policy evaluation to address curse of dimensionality in non-Markovian reinforcement learning.

Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis

no code implementations19 Feb 2024 Semih Cayci, Atilla Eryilmaz

We analyze recurrent neural networks trained with gradient descent in the supervised learning setting for dynamical systems, and prove that gradient descent can achieve optimality \emph{without} massive overparameterization.

A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback

no code implementations9 Jun 2021 Semih Cayci, Yilin Zheng, Atilla Eryilmaz

In a wide variety of applications including online advertising, contractual hiring, and wireless scheduling, the controller is constrained by a stringent budget constraint on the available resources, which are consumed in a random amount by each action, and a stochastic feasibility constraint that may impose important operational limitations on decision-making.

Decision Making Scheduling

Continuous-Time Multi-Armed Bandits with Controlled Restarts

no code implementations30 Jun 2020 Semih Cayci, Atilla Eryilmaz, R. Srikant

Time-constrained decision processes have been ubiquitous in many fundamental applications in physics, biology and computer science.

Multi-Armed Bandits

Group-Fair Online Allocation in Continuous Time

no code implementations NeurIPS 2020 Semih Cayci, Swati Gupta, Atilla Eryilmaz

Furthermore, as a consequence of certain ethical and economic concerns, the controller may impose deadlines on the completion of each task, and require fairness across different groups in the allocation of total time budget $B$.

Cloud Computing Decision Making +2

Budget-Constrained Bandits over General Cost and Reward Distributions

no code implementations29 Feb 2020 Semih Cayci, Atilla Eryilmaz, R. Srikant

We prove a regret lower bound for this problem, and show that the proposed algorithms achieve tight problem-dependent regret bounds, which are optimal up to a universal constant factor in the case of jointly Gaussian cost and reward pairs.

Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks

no code implementations27 Nov 2018 Semih Cayci, Atilla Eryilmaz

We study the problem of serving randomly arriving and delay-sensitive traffic over a multi-channel communication system with time-varying channel states and unknown statistics.

Thompson Sampling

Combinatorial Multi-Objective Multi-Armed Bandit Problem

no code implementations11 Mar 2018 Doruk Öner, Altuğ Karakurt, Atilla Eryilmaz, Cem Tekin

In this paper, we introduce the COmbinatorial Multi-Objective Multi-Armed Bandit (COMO-MAB) problem that captures the challenges of combinatorial and multi-objective online learning simultaneously.

Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks

no code implementations26 Apr 2017 Swapna Buccapatnam, Fang Liu, Atilla Eryilmaz, Ness B. Shroff

We study the stochastic multi-armed bandit (MAB) problem in the presence of side-observations across actions that occur as a result of an underlying network structure.

Cannot find the paper you are looking for? You can Submit a new open access paper.