Search Results for author: Makesh Narsimhan Sreedhar

Found 8 papers, 5 papers with code

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

no code implementations • 4 Apr 2024 • Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Christopher Parisien

Recent advancements in instruction-tuning datasets have predominantly focused on specific tasks like mathematical or logical reasoning.

Chatbot Instruction Following +1

Paper
Add Code

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

1 code implementation • 16 Nov 2023 • Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev

To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various aspects that make responses helpful.

Attribute

175

Paper
Code

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

no code implementations • 16 Nov 2023 • Yun-Shiuan Chuang, Yi Wu, Dhruv Gupta, Rheeya Uppaal, Ananya Kumar, Luhang Sun, Makesh Narsimhan Sreedhar, Sijia Yang, Timothy T. Rogers, Junjie Hu

Adapting pre-trained language models (PLMs) for time-series text classification amidst evolving domain shifts (EDS) is critical for maintaining accuracy in applications like stance detection.

Domain Adaptation Stance Detection +3

Paper
Add Code

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

1 code implementation • 9 Oct 2023 • Yi Dong, Zhilin Wang, Makesh Narsimhan Sreedhar, Xianchao Wu, Oleksii Kuchaiev

Model alignment with human preferences is an essential step in making Large Language Models (LLMs) helpful and consistent with human values.

Attribute

175

Paper
Code

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

no code implementations • 10 Nov 2022 • Makesh Narsimhan Sreedhar, Christopher Parisien

We show that canonical forms offer a promising alternative to traditional methods for intent classification.

Domain Adaptation intent-classification +4

Paper
Add Code

Local Byte Fusion for Neural Machine Translation

1 code implementation • 23 May 2022 • Makesh Narsimhan Sreedhar, Xiangpeng Wan, Yu Cheng, Junjie Hu

Subword tokenization schemes are the dominant technique used in current NLP models.

Domain Adaptation Machine Translation +2

Paper
Code

Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Makesh Narsimhan Sreedhar, Kun Ni, Siva Reddy

The ubiquitous nature of chatbots and their interaction with users generate an enormous amount of data.

Chatbot

Paper
Code

Towards Lifelong Self-Supervision For Unpaired Image-to-Image Translation

1 code implementation • 31 Mar 2020 • Victor Schmidt, Makesh Narsimhan Sreedhar, Mostafa ElAraby, Irina Rish

Unpaired Image-to-Image Translation (I2IT) tasks often suffer from lack of data, a problem which self-supervised learning (SSL) has recently been very popular and successful at tackling.

Colorization Continual Learning +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.