Search Results for author: Arunselvan Ramaswamy

Found 15 papers, 1 papers with code

A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks

no code implementations20 May 2023 Arunselvan Ramaswamy, Shalabh Bhatnagar, Naman Saxena

We show, in theory and through experiments, that our algorithm updates have low variance, and the training loss reduces in a smooth manner.

Q-Learning reinforcement-learning +1

Stability and Convergence of Distributed Stochastic Approximations with large Unbounded Stochastic Information Delays

no code implementations11 May 2023 Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We generalize the Borkar-Meyn stability Theorem (BMT) to distributed stochastic approximations (SAs) with information delays that possess an arbitrary moment bound.

Distributed gradient-based optimization in the presence of dependent aperiodic communication

no code implementations27 Jan 2022 Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We show: If for any $p \ge0$ the processes that describe the success of communication between agents in a SSC network are $\alpha$-mixing with $n^{p-1}\alpha(n)$ summable, then the associated AoI processes are stochastically dominated by a random variable with finite $p$-th moment.

Distributed Optimization

3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for Networked Multi-Agent Systems

no code implementations3 Jan 2022 Adrian Redder, Arunselvan Ramaswamy, Holger Karl

We prove the asymptotic convergence of 3DPG even in the presence of potentially unbounded Age of Information (AoI).

Deep Q-Learning: Theoretical Insights from an Asymptotic Analysis

no code implementations25 Aug 2020 Arunselvan Ramaswamy, Eyke Hüllermeier

Deep Q-Learning is an important reinforcement learning algorithm, which involves training a deep neural network, called Deep Q-Network (DQN), to approximate the well-known Q-function.

Decision Making Q-Learning

DSPG: Decentralized Simultaneous Perturbations Gradient Descent Scheme

no code implementations17 Mar 2019 Arunselvan Ramaswamy

This algorithm is a two fold improvement over the classic Simultaneous Perturbation Stochastic Approximations (SPSA) algorithm.

Multi-Stage Reinforcement Learning For Object Detection

1 code implementation15 Oct 2018 Jonas Koenig, Simon Malberg, Martin Martens, Sebastian Niehaus, Artus Krohn-Grimberghe, Arunselvan Ramaswamy

We compare an approach that is based purely on zoom actions with one that is extended by a second refinement stage to fine-tune the bounding box after each zoom step.

Object object-detection +3

DeepCAS: A Deep Reinforcement Learning Algorithm for Control-Aware Scheduling

no code implementations8 Mar 2018 Burak Demirel, Arunselvan Ramaswamy, Daniel E. Quevedo, Holger Karl

The main contribution of this paper is to develop a deep reinforcement learning-based \emph{control-aware} scheduling (\textsc{DeepCAS}) algorithm to tackle these issues.

reinforcement-learning Reinforcement Learning (RL) +1

Analyzing Approximate Value Iteration Algorithms

no code implementations14 Sep 2017 Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper, we consider the stochastic iterative counterpart of the value iteration scheme wherein only noisy and possibly biased approximations of the Bellman operator are available.

Analysis of gradient descent methods with non-diminishing, bounded errors

no code implementations1 Apr 2016 Arunselvan Ramaswamy, Shalabh Bhatnagar

The main aim of this paper is to provide an analysis of gradient descent (GD) algorithms with gradient errors that do not necessarily vanish, asymptotically.

Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

no code implementations23 Apr 2015 Arunselvan Ramaswamy, Shalabh Bhatnagar

Analyzing this class of algorithms is important, since many reinforcement learning (RL) algorithms can be cast as SAs driven by a `controlled Markov' process.

reinforcement-learning Reinforcement Learning (RL)

A Generalization of the Borkar-Meyn Theorem for Stochastic Recursive Inclusions

no code implementations6 Feb 2015 Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper the stability theorem of Borkar and Meyn is extended to include the case when the mean field is a differential inclusion.

Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem

no code implementations6 Feb 2015 Arunselvan Ramaswamy, Shalabh Bhatnagar

In this paper we present a framework to analyze the asymptotic behavior of two timescale stochastic approximation algorithms including those with set-valued mean fields.

Cannot find the paper you are looking for? You can Submit a new open access paper.