Search Results for author: Makesh Narsimhan Sreedhar

Found 8 papers, 5 papers with code

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

no code implementations4 Apr 2024 Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Christopher Parisien

Recent advancements in instruction-tuning datasets have predominantly focused on specific tasks like mathematical or logical reasoning.

Chatbot Instruction Following +1

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

1 code implementation16 Nov 2023 Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev

To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various aspects that make responses helpful.

Attribute

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

no code implementations16 Nov 2023 Yun-Shiuan Chuang, Yi Wu, Dhruv Gupta, Rheeya Uppaal, Ananya Kumar, Luhang Sun, Makesh Narsimhan Sreedhar, Sijia Yang, Timothy T. Rogers, Junjie Hu

Adapting pre-trained language models (PLMs) for time-series text classification amidst evolving domain shifts (EDS) is critical for maintaining accuracy in applications like stance detection.

Domain Adaptation Stance Detection +3

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

1 code implementation9 Oct 2023 Yi Dong, Zhilin Wang, Makesh Narsimhan Sreedhar, Xianchao Wu, Oleksii Kuchaiev

Model alignment with human preferences is an essential step in making Large Language Models (LLMs) helpful and consistent with human values.

Attribute

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

no code implementations10 Nov 2022 Makesh Narsimhan Sreedhar, Christopher Parisien

We show that canonical forms offer a promising alternative to traditional methods for intent classification.

Domain Adaptation intent-classification +4

Towards Lifelong Self-Supervision For Unpaired Image-to-Image Translation

1 code implementation31 Mar 2020 Victor Schmidt, Makesh Narsimhan Sreedhar, Mostafa ElAraby, Irina Rish

Unpaired Image-to-Image Translation (I2IT) tasks often suffer from lack of data, a problem which self-supervised learning (SSL) has recently been very popular and successful at tackling.

Colorization Continual Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.