no code implementations • 27 Feb 2024 • Siddhartha Banerjee, Alankrita Bhatt, Christina Lee Yu
We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i. e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy.
1 code implementation • 30 Sep 2022 • Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu
How best to incorporate historical data to "warm start" bandit algorithms is an open question: naively initializing reward estimates using all historical samples can suffer from spurious data and imbalanced data coverage, leading to computational and storage issues $\unicode{x2014}$ particularly salient in continuous action spaces.
no code implementations • 29 Oct 2021 • Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu
In this paper we provide a unified theoretical analysis of tree-based hierarchical partitioning methods for online reinforcement learning, providing model-free and model-based algorithms.
2 code implementations • 10 Mar 2021 • Siddhartha Banerjee, Chamsi Hssaine, Noémie Périvier, Samitha Samaranayake
We study real-time routing policies in smart transit systems, where the platform has a combination of cars and high-capacity vehicles (e. g., buses or shuttles) and seeks to serve a set of incoming trip requests.
Optimization and Control
no code implementations • 5 Jan 2021 • Devleena Das, Siddhartha Banerjee, Sonia Chernova
In order for error explanations to be meaningful, we investigate what types of information within a set of hand-scripted explanations are most helpful to non-experts for failure and solution identification.
no code implementations • 18 Nov 2020 • Devleena Das, Siddhartha Banerjee, Sonia Chernova
With the growing capabilities of intelligent systems, the integration of artificial intelligence (AI) and robots in everyday life is increasing.
1 code implementation • NeurIPS 2020 • Sean R. Sinclair, Tianyu Wang, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu
We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces.
Model-based Reinforcement Learning reinforcement-learning +1
1 code implementation • 17 Oct 2019 • Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu
We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces.
no code implementations • ACL 2019 • Siddhartha Banerjee, Cem Akkaya, Francisco Perez-Sorrosal, Kostas Tsioutsiouliklis
Compared to binary classifiers trained from scratch, our HTrans approach results in significant improvements of 1{\%} on micro-F1 and 3{\%} on macro-F1 on the RCV1 dataset.
no code implementations • 14 Jun 2019 • Alberto Vera, Siddhartha Banerjee, Itai Gurvich
We develop a framework for designing simple and efficient policies for a family of online allocation and pricing problems, that includes online packing, budget-constrained probing, dynamic pricing, and online contextual bandits with knapsacks.
1 code implementation • 15 Jan 2019 • Alberto Vera, Siddhartha Banerjee
We develop a new framework for designing online policies given access to an oracle providing statistical information about an offline benchmark.
no code implementations • 5 Oct 2016 • Koustav Rudra, Siddhartha Banerjee, Niloy Ganguly, Pawan Goyal, Muhammad Imran, Prasenjit Mitra
The use of microblogging platforms such as Twitter during crises has become widespread.
no code implementations • 22 Sep 2016 • Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama
The most informative and well-formed sub-graph obtained by integer linear programming (ILP) is selected to generate a one-sentence summary for each topic segment.
no code implementations • 22 Sep 2016 • Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama
The sentences in the most important document are aligned to sentences in other documents to generate clusters of similar sentences.
no code implementations • 22 Sep 2016 • Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama
Automatic summarization techniques on meeting conversations developed so far have been primarily extractive, resulting in poor summaries.
1 code implementation • 23 Feb 2016 • Siddharth Reddy, Igor Labutov, Siddhartha Banerjee, Thorsten Joachims
Second, we use this memory model to develop a stochastic model for spaced repetition systems.
no code implementations • NeurIPS 2015 • Siddhartha Banerjee, Peter Lofgren
We develop a new bidirectional algorithm for estimating Markov chain multi-step transition probabilities: given a Markov chain, we want to estimate the probability of hitting a given target state in $\ell$ steps after starting from a given source distribution.
1 code implementation • 21 Jul 2015 • Peter Lofgren, Siddhartha Banerjee, Ashish Goel
First, for the problem of estimating Personalized PageRank (PPR) from a source distribution to a target node, we present a new bidirectional estimator with simple yet strong guarantees on correctness and performance, and 3x to 8x speedup over existing estimators in experiments on a diverse set of networks.
no code implementations • 7 Nov 2014 • Siddhartha Banerjee, Sujay Sanghavi, Sanjay Shakkottai
We consider this problem under a simple natural model, wherein the number of items and the number of item-views are of the same order, and an `access-graph' constrains which user is allowed to see which item.
no code implementations • 13 Jul 2012 • Siddhartha Banerjee, Nidhi Hegde, Laurent Massoulié
In the information-rich regime, where each user rates at least a constant fraction of items, a spectral clustering approach is shown to achieve a sample-complexity lower bound derived from a simple information-theoretic argument based on Fano's inequality.