Search Results for author: Dinesh Raghu

Found 17 papers, 7 papers with code

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

no code implementations4 Feb 2024 Gaurav Pandey, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo

Following the success of Proximal Policy Optimization (PPO) for Reinforcement Learning from Human Feedback (RLHF), new techniques such as Sequence Likelihood Calibration (SLiC) and Direct Policy Optimization (DPO) have been proposed that are offline in nature and use rewards in an indirect manner.

Text Generation

DKAF: KB Arbitration for Learning Task-Oriented Dialog Systems with Dialog-KB Inconsistencies

1 code implementation26 May 2023 Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam

In this work, we define the novel problem of learning a TOD agent with dialog-KB inconsistencies in the training data.

Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs

1 code implementation20 May 2023 Yatin Nandwani, Vineet Kumar, Dinesh Raghu, Sachindra Joshi, Luis A. Lastras

PMI quantifies the extent to which the document influences the generated response -- with a higher PMI indicating a more faithful response.

Response Generation

Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog

1 code implementation13 Oct 2022 Mayank Mishra, Danish Contractor, Dinesh Raghu

Traditional systems designed for task oriented dialog utilize knowledge present only in structured knowledge sources to generate responses.

Response Generation

Matching Papers and Reviewers at Large Conferences

1 code implementation24 Feb 2022 Kevin Leyton-Brown, Mausam, Yatin Nandwani, Hedayat Zarkoob, Chris Cameron, Neil Newman, Dinesh Raghu

Peer-reviewed conferences, the main publication venues in CS, rely critically on matching highly qualified reviewers for each paper.

Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs

no code implementations Findings (ACL) 2021 Dinesh Raghu, Atishya Jain, Mausam, Sachindra Joshi

In this paper, we propose a novel filtering technique that consists of (1) a pairwise similarity based filter that identifies relevant information by respecting the n-ary structure in a KB record.

Response Generation Task-Oriented Dialogue Systems

End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs

1 code implementation EMNLP 2021 Dinesh Raghu, Shantanu Agarwal, Sachindra Joshi, Mausam

We propose a novel problem within end-to-end learning of task-oriented dialogs (TOD), in which the dialog system mimics a troubleshooting agent who helps a user by diagnosing their problem (e. g., car not starting).

Flowchart Grounded Dialog Response Generation Retrieval +1

Unsupervised Learning of KB Queries in Task-Oriented Dialogs

no code implementations30 Apr 2020 Dinesh Raghu, Nikhil Gupta, Mausam

Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses.

Position Reinforcement Learning (RL)

Mask & Focus: Conversation Modelling by Learning Concepts

no code implementations11 Feb 2020 Gaurav Pandey, Dinesh Raghu, Sachindra Joshi

The proposed model, referred to as Mask \& Focus maps the input context to a sequence of concepts which are then used to generate the response concepts.

Machine Translation Response Generation

Unsupervised Learning of Interpretable Dialog Models

no code implementations2 Nov 2018 Dhiraj Madan, Dinesh Raghu, Gaurav Pandey, Sachindra Joshi

However these states need to be handcrafted and annotated in the data.

Multi-level Memory for Task Oriented Dialogs

1 code implementation NAACL 2019 Revanth Reddy, Danish Contractor, Dinesh Raghu, Sachindra Joshi

Instead of using triples to store KB results, we introduce a novel multi-level memory architecture consisting of cells for each query and their corresponding results.

Disentangling Language and Knowledge in Task-Oriented Dialogs

1 code implementation NAACL 2019 Dinesh Raghu, Nikhil Gupta, Mausam

We also systematically modify existing datasets to measure disentanglement and show BoSsNet to be robust to KB modifications.

Disentanglement Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.