no code implementations • 9 Feb 2024 • Vinod Raman, Unique Subedi, Ambuj Tewari
We study the problem of learning to predict the next state of a dynamical system when the underlying evolution function is unknown.
no code implementations • 29 Oct 2023 • Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari
In particular, we show that in the realizable setting, the expected number of mistakes of any learner, under apple tasting feedback, can be $\Theta(1), \Theta(\sqrt{T})$, or $\Theta(T)$.
no code implementations • 8 Sep 2023 • Vinod Raman, Unique Subedi, Ambuj Tewari
Finally, we prove that the impossibility result and the separation between uniform convergence and learnability also hold in the batch setting.
no code implementations • 8 Aug 2023 • Ananth Raman, Vinod Raman, Unique Subedi, Idan Mehalel, Ambuj Tewari
We study online multiclass classification under bandit feedback.
no code implementations • 7 Jul 2023 • Vinod Raman, Unique Subedi, Ambuj Tewari
We study the online learnability of hypothesis classes with respect to arbitrary, but bounded loss functions.
no code implementations • 9 Jun 2023 • Vinod Raman, Unique Subedi, Ambuj Tewari
We study a variant of online multiclass classification where the learner predicts a single label but receives a \textit{set of labels} as feedback.
no code implementations • 30 Mar 2023 • Steve Hanneke, Shay Moran, Vinod Raman, Unique Subedi, Ambuj Tewari
We argue that the best expert has regret at most Littlestone dimension relative to the best concept in the class.
no code implementations • 6 Jan 2023 • Vinod Raman, Unique Subedi, Ambuj Tewari
This provides a complete characterization of the learnability of multilabel classification and multioutput regression in both batch and online settings.
1 code implementation • 30 May 2022 • Vinod Raman, Ambuj Tewari
In this way, boosting algorithms convert weak learners into strong ones.
no code implementations • 24 Oct 2019 • Vinod Raman, Daniel T. Zhang, Young Hun Jung, Ambuj Tewari
We present online boosting algorithms for multilabel ranking with top-k feedback, where the learner only receives information about the top k items from the ranking it provides.