no code implementations • 4 Apr 2024 • Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Christopher Parisien
Recent advancements in instruction-tuning datasets have predominantly focused on specific tasks like mathematical or logical reasoning.
1 code implementation • 16 Nov 2023 • Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev
To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various aspects that make responses helpful.
no code implementations • 16 Nov 2023 • Yun-Shiuan Chuang, Yi Wu, Dhruv Gupta, Rheeya Uppaal, Ananya Kumar, Luhang Sun, Makesh Narsimhan Sreedhar, Sijia Yang, Timothy T. Rogers, Junjie Hu
Adapting pre-trained language models (PLMs) for time-series text classification amidst evolving domain shifts (EDS) is critical for maintaining accuracy in applications like stance detection.
1 code implementation • 9 Oct 2023 • Yi Dong, Zhilin Wang, Makesh Narsimhan Sreedhar, Xianchao Wu, Oleksii Kuchaiev
Model alignment with human preferences is an essential step in making Large Language Models (LLMs) helpful and consistent with human values.
no code implementations • 10 Nov 2022 • Makesh Narsimhan Sreedhar, Christopher Parisien
We show that canonical forms offer a promising alternative to traditional methods for intent classification.
1 code implementation • 23 May 2022 • Makesh Narsimhan Sreedhar, Xiangpeng Wan, Yu Cheng, Junjie Hu
Subword tokenization schemes are the dominant technique used in current NLP models.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Makesh Narsimhan Sreedhar, Kun Ni, Siva Reddy
The ubiquitous nature of chatbots and their interaction with users generate an enormous amount of data.
1 code implementation • 31 Mar 2020 • Victor Schmidt, Makesh Narsimhan Sreedhar, Mostafa ElAraby, Irina Rish
Unpaired Image-to-Image Translation (I2IT) tasks often suffer from lack of data, a problem which self-supervised learning (SSL) has recently been very popular and successful at tackling.