no code implementations • 13 Mar 2023 • Anh Nguyen, Nikos Karampatziakis, Weizhu Chen
Most language models (LMs) are trained and applied in an autoregressive left-to-right fashion, assuming that the next token only depends on the preceding ones.
Ranked #7 on
Code Generation
on HumanEval
1 code implementation • 19 Oct 2022 • Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro
Importantly, our methods can be employed while the original experiment is still running (that is, not necessarily post-hoc), when the logging policy may be itself changing (due to learning), and even if the context distributions are a highly dependent time-series (such as if they are drifting over time).
no code implementations • 6 Dec 2021 • Sandra Sajeev, Jade Huang, Nikos Karampatziakis, Matthew Hall, Sebastian Kochman, Weizhu Chen
We do, however, have access to partial feedback provided by the user (clicks, surveys, and other events) which can be leveraged to improve the user experience.
no code implementations • 18 Feb 2021 • Nikos Karampatziakis, Paul Mineiro, Aaditya Ramdas
We develop confidence bounds that hold uniformly over time for off-policy evaluation in the contextual bandit setting.
1 code implementation • NeurIPS 2020 • Nikos Karampatziakis, John Langford, Paul Mineiro
We propose an estimator and confidence interval for computing the value of a policy from off-policy data in the contextual bandit setting.
no code implementations • 6 May 2019 • Nikos Karampatziakis, Sebastian Kochman, Jade Huang, Paul Mineiro, Kathy Osborne, Weizhu Chen
In this work, we describe practical lessons we have learned from successfully using contextual bandits (CBs) to improve key business metrics of the Microsoft Virtual Agent for customer support.
no code implementations • ICML 2017 • Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis
We propose a novel coding theoretic framework for mitigating stragglers in distributed learning.
2 code implementations • 10 Dec 2016 • Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis
We propose a novel coding theoretic framework for mitigating stragglers in distributed learning.
1 code implementation • 7 Nov 2016 • Kalina Jasinska, Nikos Karampatziakis
We present LTLS, a technique for multiclass and multilabel prediction that can perform training and inference in logarithmic time and space.
no code implementations • ICML 2017 • Hal Daume III, Nikos Karampatziakis, John Langford, Paul Mineiro
Compared to previous approaches, we obtain substantially better statistical performance for two reasons: First, we prove a tighter and more complete boosting theorem, and second we translate the results more directly into an algorithm.
no code implementations • 5 Feb 2016 • He He, Paul Mineiro, Nikos Karampatziakis
We propose a general framework for sequential and dynamic acquisition of useful information in order to solve a particular task.
General Reinforcement Learning
Reinforcement Learning (RL)
+1
no code implementations • 10 Nov 2015 • Paul Mineiro, Nikos Karampatziakis
Extreme classification problems are multiclass and multilabel classification problems where the number of outputs is so large that straightforward strategies are neither statistically nor computationally viable.
no code implementations • 30 Mar 2015 • Paul Mineiro, Nikos Karampatziakis
Many modern multiclass and multilabel problems are characterized by increasingly large output spaces.
1 code implementation • 9 Feb 2015 • Nikos Karampatziakis, Paul Mineiro
In this work we show that a generic regularized nonlinearity mapping independent predictions to joint predictions is sufficient to achieve state-of-the-art performance on a variety of benchmark problems.
no code implementations • 19 Dec 2014 • Paul Mineiro, Nikos Karampatziakis
Many modern multiclass and multilabel problems are characterized by increasingly large output spaces.
no code implementations • 13 Nov 2014 • Paul Mineiro, Nikos Karampatziakis
We present RandomizedCCA, a randomized algorithm for computing canonical analysis, suitable for large datasets stored either out of core or on a distributed file system.
no code implementations • 18 Dec 2013 • Zhen Qin, Vaclav Petricek, Nikos Karampatziakis, Lihong Li, John Langford
Bootstrapping is a useful technique for estimating the uncertainty of a predictor, for example, confidence intervals for prediction.
no code implementations • 23 Oct 2013 • Nikos Karampatziakis, Paul Mineiro
Principal Component Analysis (PCA) is a ubiquitous tool with many applications in machine learning including feature construction, subspace embedding, and outlier detection.
no code implementations • 7 Oct 2013 • Alekh Agarwal, Sham M. Kakade, Nikos Karampatziakis, Le Song, Gregory Valiant
This work provides simple algorithms for multi-class (and multi-label) prediction in settings where both the number of examples n and the data dimension d are relatively large.
no code implementations • 7 Oct 2013 • Nikos Karampatziakis, Paul Mineiro
Representing examples in a way that is compatible with the underlying classifier can greatly enhance the performance of a learning system.
no code implementations • 7 Jun 2013 • Paul Mineiro, Nikos Karampatziakis
We propose a sampling scheme suitable for reducing a data set prior to selecting a hypothesis with minimum empirical risk.
no code implementations • NeurIPS 2010 • Nikos Karampatziakis
We cast the problem of identifying basic blocks of code in a binary executable as learning a mapping from a byte sequence to a segmentation of the sequence.