Search Results for author: Pranjal Aggarwal

Found 6 papers, 4 papers with code

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

no code implementations12 Apr 2024 Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hallucinations.

Language Modelling reinforcement-learning

GEO: Generative Engine Optimization

no code implementations16 Nov 2023 Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik R Narasimhan, Ameet Deshpande

We facilitate systematic evaluation in this new paradigm by introducing GEO-bench, a benchmark of diverse user queries across multiple domains, coupled with sources required to answer these queries.

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs

1 code implementation19 May 2023 Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam

A popular approach for improving the correctness of output from large language models (LLMs) is Self-Consistency - poll the LLM multiple times and output the most frequent solution.

Code Generation

SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

1 code implementation26 Jan 2023 Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan

In this paper, we develop SemSup-XC, a model that achieves state-of-the-art zero-shot and few-shot performance on three XC datasets derived from legal, e-commerce, and Wikipedia data.

Contrastive Learning

Hope Speech Detection on Social Media Platforms

1 code implementation14 Nov 2022 Pranjal Aggarwal, Pasupuleti Chandana, Jagrut Nemade, Shubham Sharma, Sunil Saumya, Shankar Biradar

Since personal computers became widely available in the consumer market, the amount of harmful content on the internet has significantly expanded.

Hope Speech Detection Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.