Search Results for author: Vineet Goyal

Found 10 papers, 0 papers with code

Last Switch Dependent Bandits with Monotone Payoff Functions

no code implementations1 Jun 2023 Ayoub Foussoul, Vineet Goyal, Orestis Papadigenopoulos, Assaf Zeevi

In a recent work, Laforgue et al. introduce the model of last switch dependent (LSD) bandits, in an attempt to capture nonstationary phenomena induced by the interaction between the player and the environment.

MNL-Bandit in non-stationary environments

no code implementations4 Mar 2023 Ayoub Foussoul, Vineet Goyal, Varun Gupta

In this paper, we study the MNL-Bandit problem in a non-stationary environment and present an algorithm with a worst-case expected regret of $\tilde{O}\left( \min \left\{ \sqrt{NTL}\;,\; N^{\frac{1}{3}}(\Delta_{\infty}^{K})^{\frac{1}{3}} T^{\frac{2}{3}} + \sqrt{NT}\right\}\right)$.

Interpretable Machine Learning for Resource Allocation with Application to Ventilator Triage

no code implementations21 Oct 2021 Julien Grand-Clément, Carri Chan, Vineet Goyal, Elizabeth Chuang

We propose a novel data-driven model to compute interpretable triage guidelines based on policies for Markov Decision Process that can be represented as simple sequences of decision trees ("tree policies").

BIG-bench Machine Learning Interpretable Machine Learning

Dynamic pricing and assortment under a contextual MNL demand

no code implementations19 Oct 2021 Vineet Goyal, Noemie Perivier

We consider dynamic multi-product pricing and assortment problems under an unknown demand over T periods, where in each period, the seller decides on the price for each product or the assortment of products to offer to a customer who chooses according to an unknown Multinomial Logit Model (MNL).

Multi-Armed Bandits

MNL-Bandit with Knapsacks: a near-optimal algorithm

no code implementations2 Jun 2021 Abdellah Aznag, Vineet Goyal, Noemie Perivier

The goal of the seller is to maximize the total expected revenue from the $T$ customers given the fixed initial inventory of $N$ products.

Robust Policies For Proactive ICU Transfers

no code implementations14 Feb 2020 Julien Grand-Clement, Carri W. Chan, Vineet Goyal, Gabriel Escobar

In this work, we study the problem of finding \emph{robust} patient transfer policies which account for uncertainty in statistical estimates due to data limitations when optimizing to improve overall patient care.

A Generalized Markov Chain Model to Capture Dynamic Preferences and Choice Overload

no code implementations15 Nov 2019 Kumar Goutam, Vineet Goyal, Agathe Soret

Assortment optimization is an important problem that arises in many industries such as retailing and online advertising where the goal is to find a subset of products from a universe of substitutable products which maximize seller's expected revenue.

MNL-Bandit: A Dynamic Learning Approach to Assortment Selection

no code implementations13 Jun 2017 Shipra Agrawal, Vashist Avadhanula, Vineet Goyal, Assaf Zeevi

The retailer observes this choice and the objective is to dynamically learn the model parameters, while optimizing cumulative revenues over a selling horizon of length $T$.

Thompson Sampling for the MNL-Bandit

no code implementations3 Jun 2017 Shipra Agrawal, Vashist Avadhanula, Vineet Goyal, Assaf Zeevi

We consider a sequential subset selection problem under parameter uncertainty, where at each time step, the decision maker selects a subset of cardinality $K$ from $N$ possible items (arms), and observes a (bandit) feedback in the form of the index of one of the items in said subset, or none.

Thompson Sampling

Assortment Optimization Under the Mallows model

no code implementations NeurIPS 2016 Antoine Desir, Vineet Goyal, Srikanth Jagabathula, Danny Segev

We consider the assortment optimization problem when customer preferences follow a mixture of Mallows distributions.

Cannot find the paper you are looking for? You can Submit a new open access paper.