Search Results for author: Arunselvan Ramaswamy

Found 15 papers, 1 papers with code

A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks

no code implementations • 20 May 2023 • Arunselvan Ramaswamy, Shalabh Bhatnagar, Naman Saxena

We show, in theory and through experiments, that our algorithm updates have low variance, and the training loss reduces in a smooth manner.

Q-Learning reinforcement-learning +1

Paper
Add Code

Stability and Convergence of Distributed Stochastic Approximations with large Unbounded Stochastic Information Delays

no code implementations • 11 May 2023 • Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We generalize the Borkar-Meyn stability Theorem (BMT) to distributed stochastic approximations (SAs) with information delays that possess an arbitrary moment bound.

Paper
Add Code

Distributed gradient-based optimization in the presence of dependent aperiodic communication

no code implementations • 27 Jan 2022 • Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We show: If for any $p \ge0$ the processes that describe the success of communication between agents in a SSC network are $\alpha$-mixing with $n^{p-1}\alpha(n)$ summable, then the associated AoI processes are stochastically dominated by a random variable with finite $p$-th moment.

Distributed Optimization

Paper
Add Code

3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for Networked Multi-Agent Systems

no code implementations • 3 Jan 2022 • Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We prove the asymptotic convergence of 3DPG even in the presence of potentially unbounded Age of Information (AoI).

Paper
Add Code

Deep Q-Learning: Theoretical Insights from an Asymptotic Analysis

no code implementations • 25 Aug 2020 • Arunselvan Ramaswamy, Eyke Hüllermeier

Deep Q-Learning is an important reinforcement learning algorithm, which involves training a deep neural network, called Deep Q-Network (DQN), to approximate the well-known Q-function.

Decision Making Q-Learning

Paper
Add Code

Deep reinforcement learning for scheduling in large-scale networked control systems

no code implementations • 15 May 2019 • Adrian Redder, Arunselvan Ramaswamy, Daniel E. Quevedo

This work considers the problem of control and resource scheduling in networked systems.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

DSPG: Decentralized Simultaneous Perturbations Gradient Descent Scheme

no code implementations • 17 Mar 2019 • Arunselvan Ramaswamy

This algorithm is a two fold improvement over the classic Simultaneous Perturbation Stochastic Approximations (SPSA) algorithm.

Paper
Add Code

Multi-Stage Reinforcement Learning For Object Detection

1 code implementation • 15 Oct 2018 • Jonas Koenig, Simon Malberg, Martin Martens, Sebastian Niehaus, Artus Krohn-Grimberghe, Arunselvan Ramaswamy

We compare an approach that is based purely on zoom actions with one that is extended by a second refinement stage to fine-tune the bounding box after each zoom step.

Object object-detection +3

Paper
Code

DeepCAS: A Deep Reinforcement Learning Algorithm for Control-Aware Scheduling

no code implementations • 8 Mar 2018 • Burak Demirel, Arunselvan Ramaswamy, Daniel E. Quevedo, Holger Karl

The main contribution of this paper is to develop a deep reinforcement learning-based \emph{control-aware} scheduling (\textsc{DeepCAS}) algorithm to tackle these issues.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning

no code implementations • 22 Feb 2018 • Arunselvan Ramaswamy, Shalabh Bhatnagar, Daniel E. Quevedo

In this paper, we present verifiable sufficient conditions for stability and convergence of asynchronous SAs with biased approximation errors.

Multi-agent Reinforcement Learning Policy Gradient Methods

Paper
Add Code

Analyzing Approximate Value Iteration Algorithms

no code implementations • 14 Sep 2017 • Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper, we consider the stochastic iterative counterpart of the value iteration scheme wherein only noisy and possibly biased approximations of the Bellman operator are available.

Paper
Add Code

Analysis of gradient descent methods with non-diminishing, bounded errors

no code implementations • 1 Apr 2016 • Arunselvan Ramaswamy, Shalabh Bhatnagar

The main aim of this paper is to provide an analysis of gradient descent (GD) algorithms with gradient errors that do not necessarily vanish, asymptotically.

Paper
Add Code

Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

no code implementations • 23 Apr 2015 • Arunselvan Ramaswamy, Shalabh Bhatnagar

Analyzing this class of algorithms is important, since many reinforcement learning (RL) algorithms can be cast as SAs driven by a `controlled Markov' process.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Generalization of the Borkar-Meyn Theorem for Stochastic Recursive Inclusions

no code implementations • 6 Feb 2015 • Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper the stability theorem of Borkar and Meyn is extended to include the case when the mean field is a differential inclusion.

Paper
Add Code

Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem

no code implementations • 6 Feb 2015 • Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper we present a framework to analyze the asymptotic behavior of two timescale stochastic approximation algorithms including those with set-valued mean fields.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.