Search Results for author: Masrour Zoghi

Found 10 papers, 4 papers with code

Ever Evolving Evaluator (EV3): Towards Flexible and Reliable Meta-Optimization for Knowledge Distillation

1 code implementation29 Oct 2023 Li Ding, Masrour Zoghi, Guy Tennenholtz, Maryam Karimzadehgan

We introduce EV3, a novel meta-optimization framework designed to efficiently train scalable machine learning models through an intuitive explore-assess-adapt protocol.

Evolutionary Algorithms Knowledge Distillation +2

Overcoming Prior Misspecification in Online Learning to Rank

1 code implementation25 Jan 2023 Javad Azizi, Ofer Meshi, Masrour Zoghi, Maryam Karimzadehgan

The recent literature on online learning to rank (LTR) has established the utility of prior knowledge to Bayesian ranking bandit algorithms.

Learning-To-Rank

MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation

1 code implementation11 Dec 2018 Chang Li, Ilya Markov, Maarten de Rijke, Masrour Zoghi

Our main finding is that for large-scale Condorcet ranker evaluation problems, MergeDTS outperforms the state-of-the-art dueling bandit algorithms.

Information Retrieval Online Ranker Evaluation +2

BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback

no code implementations15 Jun 2018 Chang Li, Branislav Kveton, Tor Lattimore, Ilya Markov, Maarten de Rijke, Csaba Szepesvari, Masrour Zoghi

In this paper, we study the problem of safe online learning to re-rank, where user feedback is used to improve the quality of displayed lists.

Learning-To-Rank Re-Ranking +1

Copeland Dueling Bandits

no code implementations NeurIPS 2015 Masrour Zoghi, Zohar Karnin, Shimon Whiteson, Maarten de Rijke

A version of the dueling bandit problem is addressed in which a Condorcet winner may not exist.

Contextual Dueling Bandits

no code implementations23 Feb 2015 Miroslav Dudík, Katja Hofmann, Robert E. Schapire, Aleksandrs Slivkins, Masrour Zoghi

The first of these algorithms achieves particularly low regret, even when data is adversarial, although its time and space requirements are linear in the size of the policy space.

Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem

no code implementations12 Dec 2013 Masrour Zoghi, Shimon Whiteson, Remi Munos, Maarten de Rijke

This paper proposes a new method for the K-armed dueling bandit problem, a variation on the regular K-armed bandit problem that offers only relative feedback about pairs of arms.

Information Retrieval Retrieval

Bayesian Optimization in a Billion Dimensions via Random Embeddings

1 code implementation9 Jan 2013 Ziyu Wang, Frank Hutter, Masrour Zoghi, David Matheson, Nando de Freitas

Bayesian optimization techniques have been successfully applied to robotics, planning, sensor placement, recommendation, advertising, intelligent user interfaces and automatic algorithm configuration.

Bayesian Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.