no code implementations • 11 Dec 2024 • Udari Madhushani Sehwag, Kassiani Papasotiriou, Jared Vann, Sumitra Ganesh
In-context learning has recently emerged as a promising approach for leveraging pretrained knowledge of LLMs across a range of natural language processing tasks and has been widely adopted in both academia and industry.
no code implementations • 6 Nov 2024 • Jung Yeon Park, Sujay Bhatt, Sihan Zeng, Lawson L. S. Wong, Alec Koppel, Sumitra Ganesh, Robin Walters
Equivariant neural networks have shown great success in reinforcement learning, improving sample efficiency and generalization when there is symmetry in the task.
no code implementations • 1 Nov 2024 • Leo Ardon, Benjamin Patrick Evans, Deepeka Garg, Annapoorani Lakshmi Narayanan, Makada Henry-Nickie, Sumitra Ganesh
We develop a novel two-layer approach for optimising mortgage relief products through a simulated multi-agent mortgage environment.
no code implementations • 10 Oct 2024 • Natraj Raman, Sumitra Ganesh, Manuela Veloso
Large language models (LLMs) are primarily designed to understand unstructured text.
no code implementations • 10 Oct 2024 • Yuancheng Xu, Udari Madhushani Sehwag, Alec Koppel, Sicheng Zhu, Bang An, Furong Huang, Sumitra Ganesh
Traditional training-time methods finetune LLMs using human preference datasets but incur significant training costs and require repeated training to handle diverse user preferences.
no code implementations • 17 Sep 2024 • Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh
The standard contextual bandit framework assumes fully observable and actionable contexts.
no code implementations • 1 Feb 2024 • Benjamin Patrick Evans, Sumitra Ganesh
Agent-based models (ABMs) have shown promise for modelling various real world phenomena incompatible with traditional equilibrium analysis.
no code implementations • 18 Nov 2023 • Sihan Zeng, Sujay Bhatt, Eleonora Kreacic, Parisa Hassanzadeh, Alec Koppel, Sumitra Ganesh
We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks.
no code implementations • 22 Oct 2023 • Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani, Jared Vann, Deepeka Garg, Sumitra Ganesh
Recent advancements in large language models (LLMs) have exhibited promising performance in solving sequential decision-making problems.
no code implementations • 10 Jan 2023 • Parisa Hassanzadeh, Eleonora Kreacic, Sihan Zeng, Yuchen Xiao, Sumitra Ganesh
We propose a new algorithm, SAFFE, that makes fair allocations with respect to the entire demands revealed over the horizon by accounting for expected future demands at each arrival time.
no code implementations • 28 Nov 2022 • Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh
Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy.
no code implementations • 13 Oct 2022 • Nelson Vadori, Leo Ardon, Sumitra Ganesh, Thomas Spooner, Selim Amrouni, Jared Vann, Mengda Xu, Zeyu Zheng, Tucker Balch, Manuela Veloso
We study a game between liquidity provider and liquidity taker agents interacting in an over-the-counter market, for which the typical example is foreign exchange.
Deep Reinforcement Learning Multi-agent Reinforcement Learning +2
1 code implementation • 12 Oct 2022 • Leo Ardon, Jared Vann, Deepeka Garg, Tom Spooner, Sumitra Ganesh
Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions.
no code implementations • 21 Jun 2022 • Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, Furong Huang
Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to share information and make good decisions.
no code implementations • 5 Jan 2022 • Mengda Xu, Sumitra Ganesh, Pranay Pasula
In this paper we consider settings in which the data distribution(task) shifts abruptly and the timing of these shifts are not known.
no code implementations • 13 Oct 2021 • Leo Ardon, Nelson Vadori, Thomas Spooner, Mengda Xu, Jared Vann, Sumitra Ganesh
We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective.
no code implementations • 4 Jun 2021 • Nelson Vadori, Rahul Savani, Thomas Spooner, Sumitra Ganesh
Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative.
no code implementations • NeurIPS 2021 • Thomas Spooner, Nelson Vadori, Sumitra Ganesh
In this paper, we address this problem through a factor baseline which exploits independence structure encoded in a novel action-target influence network.
no code implementations • NeurIPS 2020 • Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso
Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems.
no code implementations • 23 Jun 2020 • Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso
We introduce a novel framework to account for sensitivity to rewards uncertainty in sequential decision-making problems.
1 code implementation • 14 Nov 2019 • Sumitra Ganesh, Nelson Vadori, Mengda Xu, Hua Zheng, Prashant Reddy, Manuela Veloso
Market makers play an important role in providing liquidity to markets by continuously quoting prices at which they are willing to buy and sell, and managing inventory risk.