Search Results for author: Krishna Jagannathan

Found 15 papers, 5 papers with code

A Survey of Risk-Aware Multi-Armed Bandits

no code implementations12 May 2022 Vincent Y. F. Tan, Prashanth L. A., Krishna Jagannathan

In several applications such as clinical trials and financial portfolio optimization, the expected value (or the average reward) does not satisfactorily capture the merits of a drug or a portfolio.

Multi-Armed Bandits Portfolio Optimization

Online Estimation and Optimization of Utility-Based Shortfall Risk

1 code implementation16 Nov 2021 Vishwajit Hegde, Arvind S. Menon, L. A. Prashanth, Krishna Jagannathan

We derive non-asymptotic bounds on the estimation error in the number of samples.

Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits

no code implementations28 Aug 2020 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

In this paper, we show that specialized algorithms that exploit such parametric information are prone to inconsistent learning performance when the parameter is misspecified.

Multi-Armed Bandits

Bandit algorithms: Letting go of logarithmic regret for statistical robustness

1 code implementation22 Jun 2020 Kumar Ashutosh, Jayakrishnan Nair, Anmol Kagrecha, Krishna Jagannathan

We study regret minimization in a stochastic multi-armed bandit setting and establish a fundamental trade-off between the regret suffered under an algorithm, and its statistical robustness.

Constrained regret minimization for multi-criterion multi-armed bandits

1 code implementation17 Jun 2020 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We consider a stochastic multi-armed bandit setting and study the problem of constrained regret minimization over a given time horizon.

Attribute Multi-Armed Bandits +1

LSTM-Based Anomaly Detection: Detection Rules from Extreme Value Theory

no code implementations13 Sep 2019 Neema Davis, Gaurav Raina, Krishna Jagannathan

In this paper, we explore various statistical techniques for anomaly detection in conjunction with the popular Long Short-Term Memory (LSTM) deep learning model for transportation networks.

Anomaly Detection

Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards

1 code implementation NeurIPS 2019 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We also compare the error bounds for our distribution oblivious algorithms with those corresponding to standard non-oblivious algorithms.

Multi-Armed Bandits

Grids versus Graphs: Partitioning Space for Improved Taxi Demand-Supply Forecasts

no code implementations18 Feb 2019 Neema Davis, Gaurav Raina, Krishna Jagannathan

To explore the Voronoi tessellation scheme, we propose the use of GraphLSTM (Graph-based LSTM), by representing the Voronoi spatial partitions as nodes on an arbitrarily structured graph.

Ensemble Learning

Taxi Demand-Supply Forecasting: Impact of Spatial Partitioning on the Performance of Neural Networks

1 code implementation10 Dec 2018 Neema Davis, Gaurav Raina, Krishna Jagannathan

We find that the LSTM model based on input features extracted from a variable-sized polygon tessellation yields superior performance over the LSTM model based on fixed-sized grid tessellation.

Concentration bounds for empirical conditional value-at-risk: The unbounded case

no code implementations6 Aug 2018 Ravi Kumar Kolla, Prashanth L. A., Sanjay P. Bhat, Krishna Jagannathan

In several real-world applications involving decision making under uncertainty, the traditional expected value objective may not be suitable, as it may be necessary to control losses in the case of a rare but extreme event.

Decision Making Decision Making Under Uncertainty

Taxi demand forecasting: A HEDGE based tessellation strategy for improved accuracy

no code implementations17 May 2018 Neema Davis, Gaurav Raina, Krishna Jagannathan

We show that the hybrid tessellation strategy performs consistently better than either of the two strategies across the data sets considered, at multiple time scales, and with different performance metrics.

Time Series Analysis

Bandit algorithms to emulate human decision making using probabilistic distortions

no code implementations30 Nov 2016 Ravi Kumar Kolla, Prashanth L. A., Aditya Gopalan, Krishna Jagannathan, Michael Fu, Steve Marcus

For the $K$-armed bandit setting, we derive an upper bound on the expected regret for our proposed algorithm, and then we prove a matching lower bound to establish the order-optimality of our algorithm.

Decision Making Multi-Armed Bandits

Collaborative Learning of Stochastic Bandits over a Social Network

no code implementations29 Feb 2016 Ravi Kumar Kolla, Krishna Jagannathan, Aditya Gopalan

A key finding of this paper is that natural extensions of widely-studied single agent learning policies to the network setting need not perform well in terms of regret.

Cannot find the paper you are looking for? You can Submit a new open access paper.