no code implementations • 28 Aug 2023 • Uri Sherman, Alon Cohen, Tomer Koren, Yishay Mansour
We study regret minimization in online episodic linear Markov Decision Processes, and obtain rate-optimal $\widetilde O (\sqrt K)$ regret where $K$ denotes the number of episodes.
no code implementations • 12 Mar 2023 • Dana Azouri, Oz Granit, Michael Alburquerque, Yishay Mansour, Tal Pupko, Itay Mayrose
Our proposed method does not require likelihood calculation with every step, nor is it limited to greedy uphill moves in the likelihood space.
no code implementations • 2 Mar 2023 • Orin Levy, Alon Cohen, Asaf Cassel, Yishay Mansour
To the best of our knowledge, our algorithm is the first efficient rate optimal regret minimization algorithm for adversarial CMDPs that operates under the minimal standard assumption of online function approximation.
no code implementations • 27 Feb 2023 • Haim Kaplan, Yishay Mansour, Shay Moran, Kobbi Nissim, Uri Stemmer
In this work we introduce an interactive variant of joint differential privacy towards handling online processes in which existing privacy definitions seem too restrictive.
no code implementations • 3 Feb 2023 • Christoph Dann, Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan
We then use that to show, modulo mild normalization assumptions, that there exists an $\ell_\infty$-approachability algorithm whose convergence is independent of the dimension of the original vectorial payoff.
1 code implementation • 1 Feb 2023 • Yogev Bar-On, Yishay Mansour
Decentralized Exchanges (DEXs) are new types of marketplaces leveraging Blockchain technology.
no code implementations • 30 Jan 2023 • Uri Sherman, Tomer Koren, Yishay Mansour
We study reinforcement learning with linear function approximation and adversarially changing cost functions, a setup that has mostly been considered under simplifying assumptions such as full information feedback or exploratory conditions. We present a computationally efficient policy optimization algorithm for the challenging general setting of unknown dynamics and bandit feedback, featuring a combination of mirror-descent and least squares policy evaluation in an auxiliary MDP used to compute exploration bonuses. Our algorithm obtains an $\widetilde O(K^{6/7})$ regret bound, improving significantly over previous state-of-the-art of $\widetilde O (K^{14/15})$ in this setting.
no code implementations • 29 Jan 2023 • Jay Tenenbaum, Haim Kaplan, Yishay Mansour, Uri Stemmer
the counter problem) and show that the concurrent shuffle model allows for significantly improved error compared to a standard (single) shuffle model.
no code implementations • 8 Dec 2022 • Olivier Bousquet, Haim Kaplan, Aryeh Kontorovich, Yishay Mansour, Shay Moran, Menachem Sadigurschi, Uri Stemmer
We construct a universally Bayes consistent learning rule that satisfies differential privacy (DP).
no code implementations • 27 Nov 2022 • Orin Levy, Asaf Cassel, Alon Cohen, Yishay Mansour
To the best of our knowledge, our algorithm is the first efficient and rate-optimal regret minimization algorithm for CMDPs that operates under the general offline function approximation setting.
no code implementations • 27 Sep 2022 • Aadirupa Saha, Tomer Koren, Yishay Mansour
We address the problem of \emph{convex optimization with dueling feedback}, where the goal is to minimize a convex function given a weaker form of \emph{dueling} feedback.
no code implementations • 28 Jul 2022 • Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour
Our key observation is that online learning via policy optimization in Markov games essentially reduces to a form of weighted regret minimization, with unknown weights determined by the path length of the agents' policy sequence.
no code implementations • 22 Jul 2022 • Orin Levy, Yishay Mansour
For the latter, our algorithm obtains regret bound of $\widetilde{O}( (H+{1}/{p_{min}})H|S|^{3/2}\sqrt{|A|T\log(\max\{|\mathcal{G}|,|\mathcal{P}|\}/\delta)})$ with probability $1-\delta$, where $\mathcal{P}$ and $\mathcal{G}$ are finite and realizable function classes used to approximate the dynamics and rewards respectively, $p_{min}$ is the minimum reachability parameter, $S$ is the set of states, $A$ the set of actions, $H$ the horizon, and $T$ the number of episodes.
no code implementations • 19 Jun 2022 • Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan
This paper presents a theoretical analysis of such policies and provides the first regret and sample-complexity bounds for reinforcement learning with myopic exploration.
no code implementations • 9 Jun 2022 • Yishay Mansour, Michal Moshkovitz, Cynthia Rudin
Interpretability is an essential building block for trustworthiness in reinforcement learning systems.
no code implementations • 19 May 2022 • Yishay Mansour, Richard Nock, Robert C. Williamson
A landmark negative result of Long and Servedio established a worst-case spectacular failure of a supervised learning trio (loss, algorithm, model) otherwise praised for its high precision machinery.
no code implementations • 17 May 2022 • Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan
We study repeated two-player games where one of the players, the learner, employs a no-regret learning strategy, while the other, the optimizer, is a rational utility maximizer.
no code implementations • 25 Mar 2022 • Omer Ben-Porat, Lee Cohen, Liu Leqi, Zachary C. Lipton, Yishay Mansour
We first address the case where all users share the same type, demonstrating that a recent UCB-based algorithm is optimal.
no code implementations • 2 Mar 2022 • Orin Levy, Yishay Mansour
We study learning contextual MDPs using a function approximation for both the rewards and the dynamics.
no code implementations • 27 Feb 2022 • Tomer Koren, Roi Livni, Yishay Mansour, Uri Sherman
We study to what extent may stochastic gradient descent (SGD) be understood as a "conventional" learning rule that achieves generalization performance by obtaining a good fit to training data.
no code implementations • 23 Feb 2022 • Lee Cohen, Yishay Mansour, Michal Moshkovitz
Given a policy of a Markov Decision Process, we define a SafeZone as a subset of states, such that most of the policy's trajectories are confined to this subset.
no code implementations • 12 Feb 2022 • Eitan-Hai Mashiah, Idan Attias, Yishay Mansour
Following this, we show how to compute both the optimal pure and mixed strategies.
no code implementations • 11 Feb 2022 • Idan Attias, Steve Hanneke, Yishay Mansour
This shows that there is a significant benefit in semi-supervised robust learning even in the worst-case distribution-free model, and establishes a gap between the supervised and semi-supervised label complexities which is known not to hold in standard non-robust PAC learning.
no code implementations • 10 Feb 2022 • Olivier Bousquet, Amit Daniely, Haim Kaplan, Yishay Mansour, Shay Moran, Uri Stemmer
Our transformation readily implies monotone learners in a variety of contexts: for example it extends Pestov's result to classification tasks with an arbitrary number of labels.
no code implementations • 31 Jan 2022 • Tal Lancewicki, Aviv Rosenberg, Yishay Mansour
We study cooperative online learning in stochastic and adversarial Markov decision process (MDP).
1 code implementation • 31 Jan 2022 • Alexander Soen, Ibrahim Alabdulmohsin, Sanmi Koyejo, Yishay Mansour, Nyalleng Moorosi, Richard Nock, Ke Sun, Lexing Xie
We introduce a new family of techniques to post-process ("wrap") a black-box classifier in order to reduce its bias.
no code implementations • 31 Jan 2022 • Tiancheng Jin, Tal Lancewicki, Haipeng Luo, Yishay Mansour, Aviv Rosenberg
The standard assumption in reinforcement learning (RL) is that agents observe feedback for their actions immediately.
no code implementations • 29 Dec 2021 • Edith Cohen, Haim Kaplan, Yishay Mansour, Uri Stemmer, Eliad Tsfadia
Clustering is a fundamental problem in data analysis.
no code implementations • 6 Dec 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Claudio Gentile, Yishay Mansour
We investigate a nonstochastic bandit setting in which the loss of an action is not immediately charged to the player, but rather spread over the subsequent rounds in an adversarial way.
no code implementations • 19 Oct 2021 • Eliad Tsfadia, Edith Cohen, Haim Kaplan, Yishay Mansour, Uri Stemmer
Differentially private algorithms for common metric aggregation tasks, such as clustering or averaging, often have limited practicality due to their complexity or to the large number of data points that is required for accurate results.
no code implementations • NeurIPS 2021 • Lee Cohen, Ulrike Schmidt-Kraepelin, Yishay Mansour
We introduce the dueling teams problem, a new online-learning setting in which the learner observes noisy comparisons of disjoint pairs of $k$-sized teams from a universe of $n$ players.
no code implementations • NeurIPS 2021 • Uri Sherman, Tomer Koren, Yishay Mansour
We study online convex optimization in the random order model, recently proposed by \citet{garber2020online}, where the loss functions may be chosen by an adversary, but are then presented to the online algorithm in a uniformly random order.
no code implementations • NeurIPS 2021 • Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan
In this work, we consider the more realistic setting of agnostic RL with rich observation spaces and a fixed class of policies $\Pi$ that may not contain any near-optimal policy.
no code implementations • NeurIPS 2021 • Jay Tenenbaum, Haim Kaplan, Yishay Mansour, Uri Stemmer
We give an $(\varepsilon,\delta)$-differentially private algorithm for the multi-armed bandit (MAB) problem in the shuffle model with a distribution-dependent regret of $O\left(\left(\sum_{a\in [k]:\Delta_a>0}\frac{\log T}{\Delta_a}\right)+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, and a distribution-independent regret of $O\left(\sqrt{kT\log T}+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, where $T$ is the number of rounds, $\Delta_a$ is the suboptimality gap of the arm $a$, and $k$ is the total number of arms.
no code implementations • 4 Jun 2021 • Tal Lancewicki, Shahar Segal, Tomer Koren, Yishay Mansour
We study the stochastic Multi-Armed Bandit (MAB) problem with random delays in the feedback received by the algorithm.
no code implementations • NeurIPS 2021 • Alon Cohen, Yonathan Efroni, Yishay Mansour, Aviv Rosenberg
In this work we show that the minimax regret for this setting is $\widetilde O(\sqrt{ (B_\star^2 + B_\star) |S| |A| K})$ where $B_\star$ is a bound on the expected cost of the optimal policy from any state, $S$ is the state space, and $A$ is the action space.
no code implementations • 31 Jan 2021 • Alon Cohen, Haim Kaplan, Tomer Koren, Yishay Mansour
We study a novel variant of online finite-horizon Markov Decision Processes with adversarially changing loss functions and initially unknown dynamics.
no code implementations • 26 Jan 2021 • Haim Kaplan, Yishay Mansour, Kobbi Nissim, Uri Stemmer
We present a streaming problem for which every adversarially-robust streaming algorithm must use polynomial space, while there exists a classical (oblivious) streaming algorithm that uses only polylogarithmic space.
Data Structures and Algorithms
no code implementations • 29 Dec 2020 • Tal Lancewicki, Aviv Rosenberg, Yishay Mansour
We present novel algorithms based on policy optimization that achieve near-optimal high-probability regret of $\widetilde O ( \sqrt{K} + \sqrt{D} )$ under full-information feedback, where $K$ is the number of episodes and $D = \sum_{k} d^k$ is the total delay.
no code implementations • 27 Oct 2020 • Aadirupa Saha, Tomer Koren, Yishay Mansour
We introduce the problem of regret minimization in Adversarial Dueling Bandits.
no code implementations • 2 Oct 2020 • Haim Kaplan, Yishay Mansour, Uri Stemmer
This simple algorithm privately tests whether the value of a given query on a database is close to what we expect it to be.
1 code implementation • NeurIPS 2021 • Aviv Rosenberg, Yishay Mansour
We study regret minimization in non-episodic factored Markov decision processes (FMDPs), where all existing algorithms make the strong assumption that the factored structure of the FMDP is known to the learner in advance.
no code implementations • 21 Aug 2020 • Pranjal Awasthi, Corinna Cortes, Yishay Mansour, Mehryar Mohri
In the adversarial setting, we design efficient algorithms with competitive ratio guarantees.
no code implementations • ICLR 2020 • Raphael Fettaya, Yishay Mansour
We show, using a data set of 90000 files downloadable online, that our approach maintains a high detection rate (94%) of PDF malware and even detects new malicious files, still undetected by most antiviruses.
no code implementations • 20 Jul 2020 • Guy Aridor, Yishay Mansour, Aleksandrs Slivkins, Zhiwei Steven Wu
Users arrive one by one and choose between the two firms, so that each firm makes progress on its bandit problem only if it is chosen.
no code implementations • 19 Jul 2020 • Yishay Mansour, Mehryar Mohri, Jae Ro, Ananda Theertha Suresh, Ke wu
We present a theoretical and algorithmic study of the multiple-source domain adaptation problem in the common scenario where the learner has access only to a limited amount of labeled target data, but where the learner has at disposal a large amount of labeled data from multiple source domains.
no code implementations • 20 Jun 2020 • Aviv Rosenberg, Yishay Mansour
Stochastic shortest path (SSP) is a well-known problem in planning and control, in which an agent has to reach a goal state in minimum total expected cost.
no code implementations • NeurIPS 2020 • Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan
We study episodic reinforcement learning in Markov decision processes when the agent receives additional feedback per step in the form of several transition observations.
no code implementations • NeurIPS 2020 • Eliran Shabat, Lee Cohen, Yishay Mansour
There is a growing interest in societal concerns in machine learning systems, especially in fairness.
no code implementations • NeurIPS 2020 • Haim Kaplan, Yishay Mansour, Uri Stemmer, Eliad Tsfadia
We present a differentially private learner for halfspaces over a finite grid $G$ in $\mathbb{R}^d$ with sample complexity $\approx d^{2. 5}\cdot 2^{\log^*|G|}$, which improves the state-of-the-art result of [Beimel et al., COLT 2019] by a $d^2$ factor.
no code implementations • NeurIPS 2020 • Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Yossi Matias, Uri Stemmer
A streaming algorithm is said to be adversarially robust if its accuracy guarantees are maintained even when the data stream is chosen maliciously, by an adaptive adversary.
1 code implementation • 25 Feb 2020 • Yishay Mansour, Mehryar Mohri, Jae Ro, Ananda Theertha Suresh
The standard objective in machine learning is to train a single model for all users.
no code implementations • NeurIPS 2020 • Idan Amir, Idan Attias, Tomer Koren, Roi Livni, Yishay Mansour
We revisit the fundamental problem of prediction with expert advice, in a setting where the environment is benign and generates losses stochastically, but the feedback observed by the learner is subject to a moderate adversarial corruption.
no code implementations • ICML 2020 • Alon Cohen, Haim Kaplan, Yishay Mansour, Aviv Rosenberg
In this work we remove this dependence on the minimum cost---we give an algorithm that guarantees a regret bound of $\widetilde{O}(B_\star |S| \sqrt{|A| K})$, where $B_\star$ is an upper bound on the expected cost of the optimal policy, $S$ is the set of states, $A$ is the set of actions and $K$ is the number of episodes.
no code implementations • NeurIPS 2019 • Aviv Rosenberg, Yishay Mansour
We consider online learning in episodic loop-free Markov decision processes (MDPs), where the loss function can change arbitrarily between episodes.
no code implementations • 22 Nov 2019 • Haim Kaplan, Katrina Ligett, Yishay Mansour, Moni Naor, Uri Stemmer
This problem has received much attention recently; unlike the non-private case, where the sample complexity is independent of the domain size and just depends on the desired accuracy and confidence, for private learning the sample complexity must depend on the domain size $X$ (even for approximate differential privacy).
no code implementations • 5 Nov 2019 • Tom Zahavy, Alon Cohen, Haim Kaplan, Yishay Mansour
Specifically, we show that a variation of the FW method that is based on taking "away steps" achieves a linear rate of convergence when applied to AL and that a stochastic version of the FW algorithm can be used to avoid precise estimation of feature expectations.
no code implementations • NeurIPS 2019 • Yogev Bar-On, Yishay Mansour
We study agents communicating over an underlying network by exchanging messages, in order to optimize their individual regret in a common nonstochastic multi-armed bandit problem.
no code implementations • 21 Jun 2019 • Yuval Lewi, Haim Kaplan, Yishay Mansour
We also bound the regret of those sequences, the worse case sequences have regret $O(\sqrt{T})$ and the best case sequence have regret $O(1)$.
no code implementations • NeurIPS 2019 • Roi Livni, Yishay Mansour
A function $g\in \mathcal{G}$ distinguishes between two distributions, if the expected value of $g$, on a $k$-tuple of i. i. d examples, on the two distributions is (significantly) different.
no code implementations • NeurIPS 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Yishay Mansour, Vianney Perchet
We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making.
no code implementations • 28 May 2019 • Idan Rejwan, Yishay Mansour
Top-k Combinatorial Bandits generalize multi-armed bandits, where at each round any subset of $k$ out of $n$ arms may be chosen and the sum of the rewards is gained.
no code implementations • 27 May 2019 • Lee Cohen, Zachary C. Lipton, Yishay Mansour
We analyze the optimal employer policy both when the employer sets a fixed number of tests per candidate and when the employer can set a dynamic policy, assigning further tests adaptively based on results from the previous tests.
no code implementations • 23 May 2019 • Tom Zahavy, Alon Cohen, Haim Kaplan, Yishay Mansour
We derive and analyze learning algorithms for apprenticeship learning, policy evaluation, and policy gradient for average reward criteria.
no code implementations • 19 May 2019 • Aviv Rosenberg, Yishay Mansour
We consider online learning in episodic loop-free Markov decision processes (MDPs), where the loss function can change arbitrarily between episodes, and the transition function is not known to the learner.
no code implementations • 7 Apr 2019 • Amit Daniely, Yishay Mansour
Our end result is an online algorithm that can combine a "base" online algorithm, having a guaranteed competitive ratio, with a range of online algorithms that guarantee a small regret over any interval of time.
no code implementations • 26 Feb 2019 • Tom Zahavy, Avinatan Hasidim, Haim Kaplan, Yishay Mansour
We consider a settings of hierarchical reinforcement learning, in which the reward is a sum of components.
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • 17 Feb 2019 • Alon Cohen, Tomer Koren, Yishay Mansour
We present the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics.
no code implementations • NeurIPS 2019 • Alon Cohen, Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Shay Moran
(ii) In the second variant it is assumed that before the process starts, the algorithm has an access to a training set of $n$ items drawn independently from the same unknown distribution (e. g.\ data of candidates from previous recruitment seasons).
no code implementations • 13 Feb 2019 • Haim Kaplan, Yishay Mansour, Yossi Matias, Uri Stemmer
We present differentially private efficient algorithms for learning union of polygons in the plane (which are not necessarily convex).
no code implementations • 22 Oct 2018 • Alon Resler, Yishay Mansour
We present and study models of adversarial online learning where the feedback observed by the learner is noisy, and the feedback is either full information feedback or bandit feedback.
no code implementations • 4 Oct 2018 • Idan Attias, Aryeh Kontorovich, Yishay Mansour
For binary classification, the algorithm of Feige et al. (2015) uses a regret minimization algorithm and an ERM oracle as a black box; we adapt it for the multiclass and regression settings.
no code implementations • ICML 2018 • Alon Cohen, Avinatan Hassidim, Tomer Koren, Nevena Lazic, Yishay Mansour, Kunal Talwar
We study the problem of controlling linear time-invariant systems with known noisy dynamics and adversarially chosen quadratic losses.
no code implementations • 13 May 2018 • Amir Ban, Yishay Mansour
We note that this would make it possible to aggregate multiple predictions into a result that is more accurate than their consensus average, and that the improvement prospects grow with the amount of differentiation.
no code implementations • 7 May 2018 • Craig Boutilier, Alon Cohen, Amit Daniely, Avinatan Hassidim, Yishay Mansour, Ofer Meshi, Martin Mladenov, Dale Schuurmans
From an RL perspective, we show that Q-learning with sampled action sets is sound.
no code implementations • 13 Mar 2018 • Tom Zahavy, Avinatan Hasidim, Haim Kaplan, Yishay Mansour
In this work, we provide theoretical guarantees for reward decomposition in deterministic MDPs.
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • NeurIPS 2017 • Tomer Koren, Roi Livni, Yishay Mansour
We consider the non-stochastic Multi-Armed Bandit problem in a setting where there is a fixed and known metric on the action space that determines a cost for switching between any pair of actions.
no code implementations • 16 Oct 2017 • Nir Rosenfeld, Yishay Mansour, Elad Yom-Tov
Most current methods for constructing prediction intervals offer guarantees for a single new test point.
no code implementations • NeurIPS 2017 • Noga Alon, Moshe Babaioff, Yannai A. Gonczarowski, Yishay Mansour, Shay Moran, Amir Yehudayoff
In this work we derive a variant of the classic Glivenko-Cantelli Theorem, which asserts uniform convergence of the empirical Cumulative Distribution Function (CDF) to the CDF of the underlying distribution.
no code implementations • 21 Mar 2017 • Pranjal Awasthi, Avrim Blum, Nika Haghtalab, Yishay Mansour
When a noticeable fraction of the labelers are perfect, and the rest behave arbitrarily, we show that any $\mathcal{F}$ that can be efficiently learned in the traditional realizable PAC model can be learned in a computationally efficient manner by querying the crowd, despite high amounts of noise in the responses.
no code implementations • 27 Feb 2017 • Yishay Mansour, Aleksandrs Slivkins, Zhiwei Steven Wu
Most modern systems strive to learn from interactions with users, and many engage in exploration: making potentially suboptimal choices for the sake of acquiring new information.
no code implementations • 24 Feb 2017 • Tomer Koren, Roi Livni, Yishay Mansour
In this setting, we give a new algorithm that establishes a regret of $\widetilde{O}(\sqrt{kT} + T/k)$, where $k$ is the number of actions and $T$ is the time horizon.
no code implementations • 23 Feb 2017 • Assaf Hallak, Yishay Mansour, Elad Yom-Tov
The LTV approach considers the future implications of the item recommendation, and seeks to maximize the cumulative gain over time.
no code implementations • NeurIPS 2016 • Michal Feldman, Tomer Koren, Roi Livni, Yishay Mansour, Aviv Zohar
We consider a seller with an unlimited supply of a single good, who is faced with a stream of $T$ buyers.
no code implementations • 24 Oct 2016 • Nir Rosenfeld, Yishay Mansour, Elad Yom-Tov
The conventional way to answer this counterfactual question is to estimate the effect of the new treatment in comparison to that of the conventional treatment by running a controlled, randomized experiment.
no code implementations • 21 Mar 2016 • Elad Hazan, Tomer Koren, Roi Livni, Yishay Mansour
We consider the problem of prediction with expert advice when the losses of the experts have low-dimensional structure: they are restricted to an unknown $d$-dimensional subspace.
no code implementations • 24 Feb 2016 • Yishay Mansour, Aleksandrs Slivkins, Vasilis Syrgkanis, Zhiwei Steven Wu
As a key technical tool, we introduce the concept of explorable actions, the actions which some incentive-compatible policy can recommend with non-zero probability.
no code implementations • 15 Feb 2016 • Nicolo' Cesa-Bianchi, Claudio Gentile, Yishay Mansour, Alberto Minora
We introduce \textsc{Exp3-Coop}, a cooperative version of the {\sc Exp3} algorithm and prove that with $K$ actions and $N$ agents the average per-agent regret after $T$ rounds is at most of order $\sqrt{\bigl(d+1 + \tfrac{K}{N}\alpha_{\le d}\bigr)(T\ln K)}$, where $\alpha_{\le d}$ is the independence number of the $d$-th power of the connected communication graph $G$.
no code implementations • 10 Nov 2015 • Maria Florina Balcan, Travis Dick, Yishay Mansour
We present a new perspective on the popular multi-class algorithmic techniques of one-vs-all and error correcting output codes.
no code implementations • 14 Jan 2015 • Elad Hazan, Roi Livni, Yishay Mansour
We consider classification and regression tasks where we have missing data and assume that the (clean) data resides in a low rank subspace.
no code implementations • 5 Nov 2014 • Nicolò Cesa-Bianchi, Yishay Mansour, Ohad Shamir
In this paper, we study lower bounds on the error attainable by such methods as a function of the number of entries observed in the kernel matrix or the rank of an approximate kernel matrix.
no code implementations • 30 Sep 2014 • Noga Alon, Nicolò Cesa-Bianchi, Claudio Gentile, Shie Mannor, Yishay Mansour, Ohad Shamir
This naturally models several situations where the losses of different actions are related, and knowing the loss of one action provides information on the loss of other actions.
no code implementations • 3 Nov 2013 • Aditya Gopalan, Shie Mannor, Yishay Mansour
We consider stochastic multi-armed bandit problems with complex actions over a set of basic arms, where the decision maker plays a complex action rather than a basic arm in each round.
no code implementations • NeurIPS 2013 • Noga Alon, Nicolò Cesa-Bianchi, Claudio Gentile, Yishay Mansour
We consider the partial observability model for multi-armed bandits, introduced by Mannor and Shamir.
no code implementations • NeurIPS 2012 • Koby Crammer, Yishay Mansour
In this work we consider a setting where we have a very large number of related tasks with few examples from each individual task.
no code implementations • NeurIPS 2010 • Corinna Cortes, Yishay Mansour, Mehryar Mohri
This paper presents an analysis of importance weighting for learning from finite samples and gives a series of theoretical and algorithmic results.
no code implementations • 19 Feb 2009 • Yishay Mansour, Mehryar Mohri, Afshin Rostamizadeh
This motivates our analysis of the problem of minimizing the empirical discrepancy for various loss functions for which we also give novel algorithms.
no code implementations • NeurIPS 2008 • Yishay Mansour, Mehryar Mohri, Afshin Rostamizadeh
The problem consists of combining these hypotheses to derive a hypothesis with small error with respect to the target domain.