Search Results for author: Jayakumar Subramanian

Found 11 papers, 6 papers with code

Behavior Optimized Image Generation

no code implementations18 Nov 2023 Varun Khurana, Yaman K Singla, Jayakumar Subramanian, Rajiv Ratn Shah, Changyou Chen, Zhiqiang Xu, Balaji Krishnamurthy

We show that BoigLLM outperforms 13x larger models such as GPT-3. 5 and GPT-4 in this task, demonstrating that while these state-of-the-art models can understand images, they lack information on how these images perform in the real world.

Image Generation Marketing

Counterfactual Explanation Policies in RL

no code implementations25 Jul 2023 Shripad V. Deshmukh, Srivatsan R, Supriti Vijay, Jayakumar Subramanian, Chirag Agarwal

In this work, we present COUNTERPOL, the first framework to analyze RL policies using counterfactual explanations in the form of minimal changes to the policy that lead to the desired outcome.

counterfactual Counterfactual Explanation +2

SARC: Soft Actor Retrospective Critic

1 code implementation28 Jun 2023 Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two.

Explaining RL Decisions with Trajectories

1 code implementation6 May 2023 Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

To do so, we encode trajectories in offline training data individually as well as collectively (encoding a set of trajectories).

Attribute Continuous Control +3

Differentiable Agent-based Epidemiology

1 code implementation20 Jul 2022 Ayush Chopra, Alexander Rodríguez, Jayakumar Subramanian, Arnau Quera-Bofarull, Balaji Krishnamurthy, B. Aditya Prakash, Ramesh Raskar

Mechanistic simulators are an indispensable tool for epidemiology to explore the behavior of complex, dynamic infections under varying conditions and navigate uncertain environments.

Epidemiology Navigate

DeepABM: Scalable, efficient and differentiable agent-based simulations via graph neural networks

no code implementations9 Oct 2021 Ayush Chopra, Esma Gel, Jayakumar Subramanian, Balaji Krishnamurthy, Santiago Romero-Brufau, Kalyan S. Pasupathy, Thomas C. Kingsley, Ramesh Raskar

We introduce DeepABM, a framework for agent-based modeling that leverages geometric message passing of graph neural networks for simulating action and interactions over large agent populations.

Medical Dead-ends and Learning to Identify High-risk States and Treatments

1 code implementation NeurIPS 2021 Mehdi Fatemi, Taylor W. Killian, Jayakumar Subramanian, Marzyeh Ghassemi

Machine learning has successfully framed many sequential decision making problems as either supervised prediction, or optimal decision-making policy identification via reinforcement learning.

Decision Making

An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare

1 code implementation23 Nov 2020 Taylor W. Killian, Haoran Zhang, Jayakumar Subramanian, Mehdi Fatemi, Marzyeh Ghassemi

Reinforcement Learning (RL) has recently been applied to sequential estimation and prediction problems identifying and developing hypothetical treatment strategies for septic patients, with a particular focus on offline learning with observational data.

Open-Ended Question Answering reinforcement-learning +2

Approximate information state for approximate planning and reinforcement learning in partially observed systems

1 code implementation17 Oct 2020 Jayakumar Subramanian, Amit Sinha, Raihan Seraj, Aditya Mahajan

Our key result is to show that if a function of the history (called approximate information state (AIS)) approximately satisfies the properties of the information state, then there is a corresponding approximate dynamic program.

reinforcement-learning Reinforcement Learning (RL)

Renewal Monte Carlo: Renewal theory based reinforcement learning

no code implementations3 Apr 2018 Jayakumar Subramanian, Aditya Mahajan

We generalize the RMC algorithm to post-decision state models and also present a variant that converges faster to an approximately optimal policy.

Management reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.