Search Results for author: Asim Munawar

Found 25 papers, 6 papers with code

Neuro-Symbolic Approaches for Text-Based Policy Learning

1 code implementation EMNLP 2021 Subhajit Chaudhury, Prithviraj Sen, Masaki Ono, Daiki Kimura, Michiaki Tatsubori, Asim Munawar

We outline a method for end-to-end differentiable symbolic rule learning and show that such symbolic policies outperform previous state-of-the-art methods in text-based RL for the coin collector environment from 5-10x fewer training games.

Reinforcement Learning (RL) text-based games

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

no code implementations23 Feb 2024 Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras

There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks.

Benchmarking slot-filling +2

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

no code implementations4 Feb 2024 Gaurav Pandey, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo

Following the success of Proximal Policy Optimization (PPO) for Reinforcement Learning from Human Feedback (RLHF), new techniques such as Sequence Likelihood Calibration (SLiC) and Direct Policy Optimization (DPO) have been proposed that are offline in nature and use rewards in an indirect manner.

Text Generation

Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

1 code implementation5 Jul 2023 Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar, Alexander Gray

Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games.

reinforcement-learning Representation Learning

LOA: Logical Optimal Actions for Text-based Interaction Games

1 code implementation ACL 2021 Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray

We present Logical Optimal Actions (LOA), an action decision architecture of reinforcement learning applications with a neuro-symbolic framework which is a combination of neural network and symbolic knowledge acquisition approach for natural language interaction games.

reinforcement-learning Reinforcement Learning (RL) +1

Reinforcement Learning with External Knowledge by using Logical Neural Networks

no code implementations3 Mar 2021 Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray

Specifically, we propose an integrated method that enables model-free reinforcement learning from external knowledge sources in an LNNs-based logical constrained framework such as action shielding and guide.

reinforcement-learning Reinforcement Learning (RL)

Ensemble of Discriminators for Domain Adaptation in Multiple Sound Source 2D Localization

no code implementations10 Dec 2020 Guillaume Le Moing, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Phongtharin Vinayavekhin

This paper introduces an ensemble of discriminators that improves the accuracy of a domain adaptation technique for the localization of multiple sound sources.

Domain Adaptation

Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization

no code implementations10 Dec 2020 Guillaume Le Moing, Phongtharin Vinayavekhin, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana

Moreover, learning for different microphone array layouts makes the task more complicated due to the infinite number of possible layouts.

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

1 code implementation EMNLP 2020 Subhajit Chaudhury, Daiki Kimura, Kartik Talamadupula, Michiaki Tatsubori, Asim Munawar, Ryuki Tachibana

Our bootstrapped agent shows improved generalization in solving unseen TextWorld games, using 10x-20x fewer training games compared to previous state-of-the-art methods despite requiring less number of training episodes.

Q-Learning Reinforcement Learning (RL) +1

Design and Implementation of Linked Planning Domain Definition Language

no code implementations17 Dec 2019 Michiaki Tatsubori, Asim Munawar, Takao Moriyama

Planning is a critical component of any artificial intelligence system that concerns the realization of strategies or action sequences typically for intelligent agents and autonomous robots.

Common Sense Reasoning

Spatially-weighted Anomaly Detection with Regression Model

no code implementations23 Mar 2019 Daiki Kimura, Minori Narita, Asim Munawar, Ryuki Tachibana

Visual anomaly detection is common in several applications including medical screening and production quality check.

Anomaly Detection regression

Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning

no code implementations2 Oct 2018 Subhajit Chaudhury, Daiki Kimura, Asim Munawar, Ryuki Tachibana

Experimental results show that the proposed adversarial learning method from raw videos produces a similar performance to state-of-the-art imitation learning techniques while frequently outperforming existing hand-crafted video imitation methods.

Imitation Learning

Constrained Exploration and Recovery from Experience Shaping

1 code implementation21 Sep 2018 Tu-Hoa Pham, Giovanni De Magistris, Don Joven Agravante, Subhajit Chaudhury, Asim Munawar, Ryuki Tachibana

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding undesirable actions or states, associated to lower rewards, or penalties.

reinforcement-learning Reinforcement Learning (RL)

Safe Exploration in Markov Decision Processes with Time-Variant Safety using Spatio-Temporal Gaussian Process

no code implementations12 Sep 2018 Akifumi Wachi, Hiroshi Kajino, Asim Munawar

This paper presents a learning algorithm called ST-SafeMDP for exploring Markov decision processes (MDPs) that is based on the assumption that the safety features are a priori unknown and time-variant.

Robot Navigation Safe Exploration

Reinforcement Learning Testbed for Power-Consumption Optimization

1 code implementation21 Aug 2018 Takao Moriyama, Giovanni De Magistris, Michiaki Tatsubori, Tu-Hoa Pham, Asim Munawar, Ryuki Tachibana

Common approaches to control a data-center cooling system rely on approximated system/environment models that are built upon the knowledge of mechanical cooling and electrical and thermal management.

Systems and Control

Limiting the Reconstruction Capability of Generative Neural Network using Negative Learning

no code implementations16 Aug 2017 Asim Munawar, Phongtharin Vinayavekhin, Giovanni De Magistris

In the results section we demonstrate the features of the algorithm using MNIST handwritten digit dataset and latter apply the technique to a real-world obstacle detection problem.

Anomaly Detection Data Compression

Conditional generation of multi-modal data using constrained embedding space mapping

no code implementations4 Jul 2017 Subhajit Chaudhury, Sakyasingha Dasgupta, Asim Munawar, Md. A. Salam Khan, Ryuki Tachibana

We present a conditional generative model that maps low-dimensional embeddings of multiple modalities of data to a common latent space hence extracting semantic relationships between them.

Cannot find the paper you are looking for? You can Submit a new open access paper.