no code implementations • 7 Mar 2025 • Greg d'Eon, Hala Murad, Kevin Leyton-Brown, James R. Wright
Models of human behavior in game-theoretic settings often distinguish between strategic behavior, in which a player both reasons about how others will act and best responds to these beliefs, and "level-0" non-strategic behavior, in which they do not respond to explicit beliefs about others.
no code implementations • 18 Feb 2025 • Narun Raman, Taylor Lundy, Thiago Amin, Jesse Perla, Kevin Leyton-Brown
How should one judge whether a given large language model (LLM) can reliably perform economic reasoning?
no code implementations • 16 Jun 2024 • Kevin Leyton-Brown, Yoav Shoham
Motivated by the rapid ascent of Large Language Models (LLMs) and debates about the extent to which they possess human-level qualities, we propose a framework for testing whether any agent (be it a machine or a human) understands a subject matter.
no code implementations • 28 May 2024 • Devon Graham, Kevin Leyton-Brown
Utilitarian algorithm configuration is a general-purpose technique for automatically searching the parameter space of a given algorithm to optimize its performance, as measured by a given utility function, on a given set of inputs.
1 code implementation • 10 May 2024 • Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, Tom Hope, Dirk Hovy, Jonathan K. Kummerfeld, Anne Lauscher, Kevin Leyton-Brown, Sheng Lu, Mausam, Margot Mieskes, Aurélie Névéol, Danish Pruthi, Lizhen Qu, Roy Schwartz, Noah A. Smith, Thamar Solorio, Jingyan Wang, Xiaodan Zhu, Anna Rogers, Nihar B. Shah, Iryna Gurevych
We hope that our work will help set the agenda for research in machine-assisted scientific quality control in the age of AI, within the NLP community and beyond.
1 code implementation • 29 Feb 2024 • Greg d'Eon, Neil Newman, Kevin Leyton-Brown
Iterative combinatorial auctions are widely used in high stakes settings such as spectrum auctions.
no code implementations • 14 Feb 2024 • Narun Raman, Taylor Lundy, Samuel Amouyal, Yoav Levine, Kevin Leyton-Brown, Moshe Tennenholtz
We begin by surveying the economic literature on rational decision making, taxonomizing a large set of fine-grained "elements" that an agent should exhibit, along with dependencies between them.
1 code implementation • NeurIPS 2023 • Devon R. Graham, Kevin Leyton-Brown, Tim Roughgarden
We prove upper bounds on the runtime of these procedures that are similar to theoretical lower bounds, while also demonstrating their performance empirically.
2 code implementations • 13 Jul 2023 • Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham
FACTOR automatically transforms a factual corpus of interest into a benchmark evaluating an LM's propensity to generate true facts from the corpus vs. similar but incorrect statements.
no code implementations • 7 Jun 2023 • Greg d'Eon, Sophie Greenwood, Kevin Leyton-Brown, James R. Wright
Researchers building behavioral models, such as behavioral game theorists, use experimental data to evaluate predictive models of human behavior.
2 code implementations • 31 Jan 2023 • Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham
Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance.
1 code implementation • 21 Dec 2022 • Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham
We present Parallel Context Windows (PCW), a method that alleviates the context window restriction for any off-the-shelf LLM without further training.
1 code implementation • 22 Nov 2022 • Chris Cameron, Jason Hartford, Taylor Lundy, Tuan Truong, Alan Milligan, Rex Chen, Kevin Leyton-Brown
We introduce Monte Carlo Forest Search (MCFS), a class of reinforcement learning (RL) algorithms for learning policies in {tree MDPs}, for which policy execution involves traversing an exponential-sized tree.
no code implementations • 31 Oct 2022 • Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram Kalyanakrishnan, Ece Kamar, Sarit Kraus, Kevin Leyton-Brown, David Parkes, William Press, AnnaLee Saxenian, Julie Shah, Milind Tambe, Astro Teller
In September 2016, Stanford's "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the first report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society.
1 code implementation • 2 Sep 2022 • Hedayat Zarkoob, Greg d'Eon, Lena Podina, Kevin Leyton-Brown
Peer grading systems aggregate noisy reports from multiple students to approximate a true grade as closely as possible.
1 code implementation • 25 May 2022 • Devon R. Graham, Kevin Leyton-Brown, Tim Roughgarden
We propose a principled alternative, taking a utility-theoretic approach to characterize the scoring functions that describe preferences over algorithms.
no code implementations • 1 May 2022 • Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tenenholtz
Huge language models (LMs) have ushered in a new era for AI, serving as a gateway to natural-language-based knowledge tasks.
no code implementations • 21 Apr 2022 • Yoav Levine, Itay Dalmedigos, Ori Ram, Yoel Zeldes, Daniel Jannai, Dor Muhlgay, Yoni Osin, Opher Lieber, Barak Lenz, Shai Shalev-Shwartz, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham
To demonstrate this, we introduce three novel methods for leveraging frozen models: input-dependent prompt tuning, frozen readers, and recursive LMs, each of which vastly improves on current frozen-model approaches.
1 code implementation • 24 Feb 2022 • Kevin Leyton-Brown, Mausam, Yatin Nandwani, Hedayat Zarkoob, Chris Cameron, Neil Newman, Dinesh Raghu
Peer-reviewed conferences, the main publication venues in CS, rely critically on matching highly qualified reviewers for each paper.
2 code implementations • 1 Jul 2021 • Greg d'Eon, Jason d'Eon, James R. Wright, Kevin Leyton-Brown
Supervised learning models often make systematic errors on rare subsets of the data.
no code implementations • 18 Jun 2021 • Chris Cameron, Jason Hartford, Taylor Lundy, Kevin Leyton-Brown
Typically, learning the prediction model used to generate the optimization problem and solving that problem are performed in two separate stages.
no code implementations • NeurIPS 2020 • Gellert Weisz, András György, Wei-I Lin, Devon Graham, Kevin Leyton-Brown, Csaba Szepesvari, Brendan Lucier
Algorithm configuration procedures optimize parameters of a given algorithm to perform well over a distribution of inputs.
no code implementations • 1 Dec 2020 • Natalie Collina, Nicole Immorlica, Kevin Leyton-Brown, Brendan Lucier, Neil Newman
The value of a match is determined by the types of the matched agents.
Computer Science and Game Theory Data Structures and Algorithms
no code implementations • NeurIPS 2020 • Jason Hartford, Kevin Leyton-Brown, Hadas Raviv, Dan Padnos, Shahar Lev, Barak Lenz
The challenge is that we are not informed which labels are common and which are rare, and the true label distribution may exhibit extreme skew.
1 code implementation • ICLR 2021 • Yoav Levine, Barak Lenz, Opher Lieber, Omri Abend, Kevin Leyton-Brown, Moshe Tennenholtz, Yoav Shoham
Specifically, we show experimentally that PMI-Masking reaches the performance of prior masking approaches in half the training time, and consistently improves performance at the end of training.
no code implementations • 19 Jun 2020 • Jason Hartford, Victor Veitch, Dhanya Sridhar, Kevin Leyton-Brown
The technique is simple to apply and is "black-box" in the sense that it may be used with any instrumental variable estimator as long as the treatment effect is identified for each valid instrument independently.
no code implementations • 8 Jun 2020 • Gal Bahar, Omer Ben-Porat, Kevin Leyton-Brown, Moshe Tennenholtz
A recent body of work addresses safety constraints in explore-and-exploit systems.
no code implementations • 21 Mar 2020 • Devon Graham, Satish Kumar Sarraf, Taylor Lundy, Ali MohammadMehr, Sara Uppal, Tae Yoon Lee, Hedayat Zarkoob, Scott Duke Kominers, Kevin Leyton-Brown
To see where this might be true in downtown Vancouver, we used artificial intelligence techniques to estimate the amount of time it would take drivers to both park on and off street for destinations throughout the city.
no code implementations • ICML 2020 • Gal Bahar, Omer Ben-Porat, Kevin Leyton-Brown, Moshe Tennenholtz
Recommendation systems often face exploration-exploitation tradeoffs: the system can only learn about the desirability of new options by recommending them to some user.
1 code implementation • NeurIPS 2019 • Robert Kleinberg, Kevin Leyton-Brown, Brendan Lucier, Devon Graham
Unfortunately, Structured Procrastination is not $\textit{adaptive}$ to characteristics of the parameterized algorithm: it treats every input like the worst case.
1 code implementation • ICML 2018 • Jason Hartford, Devon R Graham, Kevin Leyton-Brown, Siamak Ravanbakhsh
In experiments, our models achieved surprisingly good generalization performance on this matrix extrapolation task, both within domains (e. g., new users and new movies drawn from the same distribution used for training) and even across domains (e. g., predicting music ratings after training on movies).
Ranked #3 on
Recommendation Systems
on YahooMusic Monti
1 code implementation • ICML 2017 • Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy
Counterfactual prediction requires understanding causal relationships between so-called treatment and outcome variables.
no code implementations • 11 Jun 2017 • Neil Newman, Alexandre Fréchette, Kevin Leyton-Brown
A crucial element of the auction design was the construction of a solver, dubbed SATFC, that determined whether sets of stations could be "repacked" in this way; it needed to run every time a station was given a price quote.
no code implementations • 30 Mar 2017 • Katharina Eggensperger, Marius Lindauer, Holger H. Hoos, Frank Hutter, Kevin Leyton-Brown
In our experiments, we construct and evaluate surrogate benchmarks for hyperparameter optimization as well as for AC problems that involve performance optimization of solvers for hard combinatorial problems, drawing training data from the runs of existing AC procedures.
no code implementations • 30 Dec 2016 • Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy
We are in the middle of a remarkable rise in the use and capability of artificial intelligence.
no code implementations • NeurIPS 2016 • Jason S. Hartford, James R. Wright, Kevin Leyton-Brown
Predicting the behavior of human participants in strategic settings is an important problem in many domains.
2 code implementations • 8 Jun 2015 • Bernd Bischl, Pascal Kerschke, Lars Kotthoff, Marius Lindauer, Yuri Malitsky, Alexandre Frechette, Holger Hoos, Frank Hutter, Kevin Leyton-Brown, Kevin Tierney, Joaquin Vanschoren
To address this problem, we introduce a standardized format for representing algorithm selection scenarios and a repository that contains a growing number of data sets from the literature.
no code implementations • 5 May 2015 • Frank Hutter, Marius Lindauer, Adrian Balint, Sam Bayless, Holger Hoos, Kevin Leyton-Brown
It is well known that different solution strategies work well for different types of instances of hard combinatorial problems.
no code implementations • 4 Aug 2014 • David R. M Thompson, Kevin Leyton-Brown
After experimentation with other designs, the major search engines converged on the weighted, generalized second-price auction (wGSP) for selling keyword advertisements.
no code implementations • 31 Jan 2014 • Erik Zawadzki, Asher Lipson, Kevin Leyton-Brown
Most such claims in the literature are based on small experiments, which has hampered understanding as well as the development of new multiagent learning (MAL) algorithms.
no code implementations • 15 Jan 2014 • Frank Hutter, Thomas Stuetzle, Kevin Leyton-Brown, Holger H. Hoos
The identification of performance-optimizing parameter settings is an important part of the development and application of algorithms.
no code implementations • 7 Oct 2013 • Frank Hutter, Holger Hoos, Kevin Leyton-Brown
Bayesian optimization (BO) aims to minimize a given blackbox function using a model that is updated whenever new evidence about the function becomes available.
no code implementations • 5 Nov 2012 • Frank Hutter, Lin Xu, Holger H. Hoos, Kevin Leyton-Brown
We also comprehensively describe new and existing features for predicting algorithm runtime for propositional satisfiability (SAT), travelling salesperson (TSP) and mixed integer programming (MIP) problems.
1 code implementation • 18 Aug 2012 • Chris Thornton, Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown
Many different machine learning algorithms exist; taking into account each algorithm's hyperparameters, there is a staggeringly large number of possible alternatives overall.
1 code implementation • LION 2011 2011 • Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown
State-of-the-art algorithms for hard computational problems often expose many parameters that can be modified to improve empirical performance.
no code implementations • NeurIPS 2010 • Albert X. Jiang, Kevin Leyton-Brown
Games of incomplete information, or Bayesian games, are an important game-theoretic model and have many applications in economics.