Search Results for author: Avinandan Bose

Found 8 papers, 1 papers with code

Offline Multi-task Transfer RL with Representational Penalization

no code implementations19 Feb 2024 Avinandan Bose, Simon Shaolei Du, Maryam Fazel

We study the problem of representation transfer in offline Reinforcement Learning (RL), where a learner has access to episodic data from a number of source tasks collected a priori, and aims to learn a shared representation to be used in finding a good policy for a target task.

Offline RL Reinforcement Learning (RL)

Initializing Services in Interactive ML Systems for Diverse Users

no code implementations19 Dec 2023 Avinandan Bose, Mihaela Curmei, Daniel L. Jiang, Jamie Morgenstern, Sarah Dean, Lillian J. Ratliff, Maryam Fazel

(ii) Suboptimal Local Solutions: The total loss (sum of loss functions across all users and all services) landscape is not convex even if the individual losses on a single service are convex, making it likely for the learning dynamics to get stuck in local minima.

Scalable Distributional Robustness in a Class of Non Convex Optimization with Guarantees

no code implementations31 May 2022 Avinandan Bose, Arunesh Sinha, Tien Mai

Distributionally robust optimization (DRO) has shown lot of promise in providing robustness in learning as well as sample based optimization problems.

Decision Making

Conditional Expectation based Value Decomposition for Scalable On-Demand Ride Pooling

no code implementations1 Dec 2021 Avinandan Bose, Pradeep Varakantham

Owing to the benefits for customers (lower prices), drivers (higher revenues), aggregation companies (higher revenues) and the environment (fewer vehicles), on-demand ride pooling (e. g., Uber pool, Grab Share) has become quite popular.

Decision Making

Changepoint Analysis of Topic Proportions in Temporal Text Data

no code implementations29 Nov 2021 Avinandan Bose, Soumendu Sundar Mukherjee

Changepoint analysis deals with unsupervised detection and/or estimation of time-points in time-series data, when the distribution generating the data changes.

Time Series Time Series Analysis

NeurInt : Learning to Interpolate through Neural ODEs

no code implementations7 Nov 2021 Avinandan Bose, Aniket Das, Yatin Dandi, Piyush Rai

In this work, we propose a novel generative model that learns a flexible non-parametric prior over interpolation trajectories, conditioned on a pair of source and target images.

Image Generation

NeurInt-Learning Interpolation by Neural ODEs

no code implementations NeurIPS Workshop DLDE 2021 Avinandan Bose, Aniket Das, Yatin Dandi, Piyush Rai

A range of applications require learning image generation models whose latent space effectively captures the high-level factors of variation in the data distribution, which can be judged by its ability to interpolate between images smoothly.

Image Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.