Search Results for author: Piyush Gupta

Found 15 papers, 4 papers with code

Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations

no code implementations13 Mar 2025 Piyush Gupta, Sangjae Bae, David Isele

To overcome these limitations, we propose Graph-Grounded LLMs, a system that improves LLM performance on graph-related tasks by integrating a graph library through function calls.

Autonomous Vehicles Knowledge Graphs +2

GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks

no code implementations CVPR 2025 Haoqiang Kang, Enna Sachdeva, Piyush Gupta, Sangjae Bae, Kwonjoon Lee

To address these challenges, we introduce a novel framework, GFlowVLM, a framework that fine-tune VLMs using Generative Flow Networks (GFlowNets) to promote generation of diverse solutions for complex reasoning tasks.

Card Games Diversity +2

Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees

no code implementations27 Jan 2025 Piyush Gupta, David Isele, Enna Sachdeva, Pin-Hao Huang, Behzad Dariush, Kwonjoon Lee, Sangjae Bae

We present a novel mission-planning strategy for heterogeneous multi-robot teams, taking into account the specific constraints and capabilities of each robot.

Fast Inventory for 3GPP Ambient IoT Considering Device Unavailability due to Energy Harvesting

no code implementations25 Jan 2025 Zhikun Wu, Kazuk Takeda, Piyush Gupta, Ruiming Zheng, Luanxia Yang, Chengjin Zhang, Zhifei Fan, Hao Xu, Kiran Mukkavilli, Tingfang Ji

With the growing demand for massive internet of things (IoT), new IoT technology, namely ambient IoT (A-IoT), has been studied in the 3rd Generation Partnership Project (3GPP).

SARC: Soft Actor Retrospective Critic

1 code implementation28 Jun 2023 Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two.

Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning

no code implementations12 Sep 2022 Piyush Gupta, Vaibhav Srivastava

During exploration, DSEE explores the environment and updates the estimates for expected reward and transition probabilities.

Efficient Exploration reinforcement-learning +2

Information-theoretic Evolution of Model Agnostic Global Explanations

no code implementations14 May 2021 Sukriti Verma, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

Our approach builds on top of existing local model explanation methods to extract conditions important for explaining model behavior for specific instances followed by an evolutionary algorithm that optimizes an information theory based fitness function to construct rules that explain global model behavior.

Marketing model

MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

no code implementations3 Sep 2020 Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji K

Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem.

Data Augmentation Fraud Detection +1

Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks

no code implementations24 Jun 2020 Surgan Jandial, Ayush Chopra, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy, Vineeth Balasubramanian

Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains.

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

1 code implementation ICLR 2020 Piyush Gupta, Nikaash Puri, Sukriti Verma, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

We show through illustrative examples (Chess, Atari, Go), human studies (Chess), and automated evaluation methods (Chess) that our approach generates saliency maps that are more interpretable for humans than existing approaches.

Atari Games Board Games +3

Towards Safer Self-Driving Through Great PAIN (Physically Adversarial Intelligent Networks)

1 code implementation24 Mar 2020 Piyush Gupta, Demetris Coleman, Joshua E. Siegel

Automated vehicles' neural networks suffer from overfit, poor generalizability, and untrained edge cases due to limited data availability.

ShapeVis: High-dimensional Data Visualization at Scale

no code implementations15 Jan 2020 Nupur Kumari, Siddarth R., Akash Rupela, Piyush Gupta, Balaji Krishnamurthy

This graph captures the structural characteristics of the point cloud, and its weights are determined using a Finite Markov Chain.

Community Detection Data Visualization +3

Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

2 code implementations23 Dec 2019 Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

We show through illustrative examples (Chess, Atari, Go), human studies (Chess), and automated evaluation methods (Chess) that SARFA generates saliency maps that are more interpretable for humans than existing approaches.

Atari Games Board Games +3

Cannot find the paper you are looking for? You can Submit a new open access paper.