Search Results for author: Kousha Kalantari

Found 2 papers, 0 papers with code

Optimal Design for Human Feedback

no code implementations22 Apr 2024 Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton

Learning of preference models from human feedback has been central to recent advances in artificial intelligence.

Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

no code implementations13 Jun 2023 Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton

Our algorithms rely on non-uniform budget allocations among the arms where the arms with higher reward variances are pulled more often than those with lower variances.

Cannot find the paper you are looking for? You can Submit a new open access paper.