Search Results for author: Sridhar Thiagarajan

Found 4 papers, 0 papers with code

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

no code implementations18 Dec 2024 Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Sridhar Thiagarajan, Craig Boutilier, Rishabh Agarwal, Aviral Kumar, Aleksandra Faust

Recent studies have indicated that effectively utilizing inference-time compute is crucial for attaining better performance from large language models (LLMs).

HumanEval Imitation Learning +2

Finetuning Language Models to Emit Linguistic Expressions of Uncertainty

no code implementations18 Sep 2024 Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur

In this work, we explore supervised finetuning on uncertainty-augmented predictions as a method to develop models that produce linguistic expressions of uncertainty.

Decision Making Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.