Search Results for author: Kousha Kalantari

Found 2 papers, 0 papers with code

Optimal Design for Human Feedback

no code implementations • 22 Apr 2024 • Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton

Learning of preference models from human feedback has been central to recent advances in artificial intelligence.

Paper
Add Code

Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

no code implementations • 13 Jun 2023 • Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton

Our algorithms rely on non-uniform budget allocations among the arms where the arms with higher reward variances are pulled more often than those with lower variances.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.