Search Results for author: Srinivas Shakkottai

Found 17 papers, 7 papers with code

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

no code implementations7 Oct 2024 Guojun Xiong, Ujwal Dinesha, Debajoy Mukherjee, Jian Li, Srinivas Shakkottai

In this paper, we introduce Pref-RMAB, a new RMAB model in the presence of preference signals, where the decision maker only observes pairwise preference feedback rather than scalar reward from the activated arms at each decision epoch.

Multi-Armed Bandits

Multi-Provider Resource Scheduling in Massive MIMO Radio Access Networks

no code implementations12 Jul 2024 Qing An, Divyanshu Pandey, Rahman Doost-Mohammady, Ashutosh Sabharwal, Srinivas Shakkottai

In this work, we introduce a channel-aware and SLA-aware RAN slicing framework for massive multiple input multiple output (MIMO) networks where resource allocation extends to incorporate the spatial dimension available through beamforming.

Scheduling

CONGO: Compressive Online Gradient Optimization with Application to Microservices Management

no code implementations8 Jul 2024 Jeremy Carleton, Prathik Vijaykumar, Divyanshu Saxena, Dheeraj Narasimha, Srinivas Shakkottai, Aditya Akella

While the number of microservices is substantial, the latency function primarily reacts to resource changes in a few, rendering the gradient sparse.

Compressive Sensing Management

Windex: Realtime Neural Whittle Indexing for Scalable Service Guarantees in NextG Cellular Networks

no code implementations4 Jun 2024 Archana Bura, Ushasi Ghosh, Dinesh Bharadia, Srinivas Shakkottai

We address the resource allocation challenges in NextG cellular radio access networks (RAN), where heterogeneous user applications demand guarantees on throughput and service regularity.

Scheduling

Structured Reinforcement Learning for Media Streaming at the Wireless Edge

no code implementations10 Apr 2024 Archana Bura, Sarat Chandra Bobbili, Shreyas Rameshkumar, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

The goal of this work is to develop and demonstrate learning-based policies for optimal decision making to determine which clients to dynamically prioritize in a video streaming setting.

reinforcement-learning Reinforcement Learning

LLMZip: Lossless Text Compression using Large Language Models

2 code implementations6 Jun 2023 Chandra Shekhara Kaushik Valmeekam, Krishna Narayanan, Dileep Kalathil, Jean-Francois Chamberland, Srinivas Shakkottai

We provide new estimates of an asymptotic upper bound on the entropy of English using the large language model LLaMA-7B as a predictor for the next token given a window of past tokens.

Language Modelling Large Language Model +1

Federated Ensemble-Directed Offline Reinforcement Learning

1 code implementation4 May 2023 Desik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas Shakkottai

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies.

Continuous Control Ensemble Learning +5

Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

no code implementations2 Nov 2022 Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui

The transition towards carbon-neutral electricity is one of the biggest game changers in addressing climate change since it addresses the dual challenges of removing carbon emissions from the two largest sectors of emitters: electricity and transportation.

Decision Making

Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

1 code implementation26 Sep 2022 Desik Rengarajan, Sapana Chaudhary, Jaewon Kim, Dileep Kalathil, Srinivas Shakkottai

Meta reinforcement learning (Meta-RL) is an approach wherein the experience gained from solving a variety of tasks is distilled into a meta-policy.

Meta Reinforcement Learning reinforcement-learning +2

OpenGridGym: An Open-Source AI-Friendly Toolkit for Distribution Market Simulation

1 code implementation6 Mar 2022 Rayan El Helou, Kiyeob Lee, Dongqi Wu, Le Xie, Srinivas Shakkottai, Vijay Subramanian

This paper presents OpenGridGym, an open-source Python-based package that allows for seamless integration of distribution market simulation with state-of-the-art artificial intelligence (AI) decision-making algorithms.

Decision Making

Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration

1 code implementation ICLR 2022 Desik Rengarajan, Gargi Vaidya, Akshay Sarvesh, Dileep Kalathil, Srinivas Shakkottai

We demonstrate the superior performance of our algorithm over state-of-the-art approaches on a number of benchmark environments with sparse rewards and censored state.

reinforcement-learning Reinforcement Learning +1

DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning

1 code implementation1 Dec 2021 Archana Bura, Aria HasanzadeZonuzy, Dileep Kalathil, Srinivas Shakkottai, Jean-Francois Chamberland

Safe reinforcement learning is extremely challenging--not only must the agent explore an unknown environment, it must do so while ensuring no safety constraint violations.

reinforcement-learning Reinforcement Learning +3

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

1 code implementation NeurIPS 2021 Khaled Nakhleh, Santosh Ganji, Ping-Chun Hsieh, I-Hong Hou, Srinivas Shakkottai

This paper proposes NeurWIN, a neural Whittle index network that seeks to learn the Whittle indices for any restless bandits by leveraging mathematical properties of the Whittle indices.

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

no code implementations1 Aug 2020 Aria HasanzadeZonuzy, Archana Bura, Dileep Kalathil, Srinivas Shakkottai

Many physical systems have underlying safety considerations that require that the policy employed ensures the satisfaction of a set of constraints.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning for Mean Field Games with Strategic Complementarities

no code implementations21 Jun 2020 Kiyeob Lee, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

We introduce a natural refinement to the equilibrium concept that we call Trembling-Hand-Perfect MFE (T-MFE), which allows agents to employ a measure of randomization while accounting for the impact of such randomization on their payoffs.

reinforcement-learning Reinforcement Learning +1

QFlow: A Learning Approach to High QoE Video Streaming at the Wireless Edge

no code implementations4 Jan 2019 Rajarshi Bhattacharyya, Archana Bura, Desik Rengarajan, Mason Rumuly, Bainan Xia, Srinivas Shakkottai, Dileep Kalathil, Ricky K. P. Mok, Amogh Dhamdhere

The predominant use of wireless access networks is for media streaming applications, which are only gaining popularity as ever more devices become available for this purpose.

Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.