Search Results for author: Sumitra Ganesh

Found 21 papers, 2 papers with code

In-Context Learning with Topological Information for Knowledge Graph Completion

no code implementations11 Dec 2024 Udari Madhushani Sehwag, Kassiani Papasotiriou, Jared Vann, Sumitra Ganesh

In-context learning has recently emerged as a promising approach for leveraging pretrained knowledge of LLMs across a range of natural language processing tasks and has been widely adopted in both academia and industry.

Approximate Equivariance in Reinforcement Learning

no code implementations6 Nov 2024 Jung Yeon Park, Sujay Bhatt, Sihan Zeng, Lawson L. S. Wong, Alec Koppel, Sumitra Ganesh, Robin Walters

Equivariant neural networks have shown great success in reinforcement learning, improving sample efficiency and generalization when there is symmetry in the task.

continuous-control Continuous Control +3

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

no code implementations10 Oct 2024 Yuancheng Xu, Udari Madhushani Sehwag, Alec Koppel, Sicheng Zhu, Bang An, Furong Huang, Sumitra Ganesh

Traditional training-time methods finetune LLMs using human preference datasets but incur significant training costs and require repeated training to handle diverse user preferences.

Text Generation

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning

no code implementations1 Feb 2024 Benjamin Patrick Evans, Sumitra Ganesh

Agent-based models (ABMs) have shown promise for modelling various real world phenomena incompatible with traditional equilibrium analysis.

Multi-agent Reinforcement Learning

Learning Payment-Free Resource Allocation Mechanisms

no code implementations18 Nov 2023 Sihan Zeng, Sujay Bhatt, Eleonora Kreacic, Parisa Hassanzadeh, Alec Koppel, Sumitra Ganesh

We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks.

Fairness

Sequential Fair Resource Allocation under a Markov Decision Process Framework

no code implementations10 Jan 2023 Parisa Hassanzadeh, Eleonora Kreacic, Sihan Zeng, Yuchen Xiao, Sumitra Ganesh

We propose a new algorithm, SAFFE, that makes fair allocations with respect to the entire demands revealed over the horizon by accounting for expected future demands at each arrival time.

Decision Making Fairness +1

Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

no code implementations28 Nov 2022 Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy.

reinforcement-learning Reinforcement Learning +2

Phantom -- A RL-driven multi-agent framework to model complex systems

1 code implementation12 Oct 2022 Leo Ardon, Jared Vann, Deepeka Garg, Tom Spooner, Sumitra Ganesh

Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions.

Decision Making Multi-agent Reinforcement Learning

Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems

no code implementations21 Jun 2022 Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, Furong Huang

Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to share information and make good decisions.

Multi-agent Reinforcement Learning

Mixture of basis for interpretable continual learning with distribution shifts

no code implementations5 Jan 2022 Mengda Xu, Sumitra Ganesh, Pranay Pasula

In this paper we consider settings in which the data distribution(task) shifts abruptly and the timing of these shifts are not known.

Continual Learning

Towards a fully RL-based Market Simulator

no code implementations13 Oct 2021 Leo Ardon, Nelson Vadori, Thomas Spooner, Mengda Xu, Jared Vann, Sumitra Ganesh

We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective.

Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures

no code implementations4 Jun 2021 Nelson Vadori, Rahul Savani, Thomas Spooner, Sumitra Ganesh

Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative.

Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs

no code implementations NeurIPS 2021 Thomas Spooner, Nelson Vadori, Sumitra Ganesh

In this paper, we address this problem through a factor baseline which exploits independence structure encoded in a novel action-target influence network.

Policy Gradient Methods

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

no code implementations NeurIPS 2020 Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems.

Reinforcement Learning for Market Making in a Multi-agent Dealer Market

1 code implementation14 Nov 2019 Sumitra Ganesh, Nelson Vadori, Mengda Xu, Hua Zheng, Prashant Reddy, Manuela Veloso

Market makers play an important role in providing liquidity to markets by continuously quoting prices at which they are willing to buy and sell, and managing inventory risk.

reinforcement-learning Reinforcement Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.