Search Results for author: Jiashuo Jiang

Found 8 papers, 0 papers with code

Achieving $\tilde{O}(1/ε)$ Sample Complexity for Constrained Markov Decision Process

no code implementations26 Feb 2024 Jiashuo Jiang, Yinyu Ye

To be specific, our algorithm operates in the primal space and we resolve the primal LP for the CMDP problem at each period in an online manner, with \textit{adaptive} remaining resource capacities.

Decision Making

Constrained Online Two-stage Stochastic Optimization: Algorithm with (and without) Predictions

no code implementations2 Jan 2024 Piao Hu, Jiashuo Jiang, Guodong Lyu, Hao Su

When the model parameters are drawn from unknown non-stationary distributions and we are given machine-learned predictions of the distributions, we develop a new algorithm from our framework with a regret $O(W_T+\sqrt{T})$, where $W_T$ measures the total inaccuracy of the machine-learned predictions.

Stochastic Optimization

High-dimensional Linear Bandits with Knapsacks

no code implementations2 Nov 2023 Wanteng Ma, Dong Xia, Jiashuo Jiang

We study the contextual bandits with knapsack (CBwK) problem under the high-dimensional setting where the dimension of the feature is large.

Multi-Armed Bandits

Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning

no code implementations2 Feb 2023 Jiashuo Jiang

At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage action from a feasible set that depends both on the first-stage decision and the model parameter.

Stochastic Optimization Vocal Bursts Valence Prediction

Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions

no code implementations14 Oct 2022 Jiashuo Jiang, Will Ma, Jiawei Zhang

We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals.

Management

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

no code implementations10 Jul 2022 Boxiao Chen, Jiashuo Jiang, Jiawei Zhang, Zhengyuan Zhou

We aim to minimize the $T$-period cost, a problem that is known to be computationally intractable even under known distributions of demand and supply.

Non-stationary Bandits with Knapsacks

no code implementations25 May 2022 Shang Liu, Jiashuo Jiang, Xiaocheng Li

Finally, we also extend the non-stationarity measure to the problem of online convex optimization with constraints and obtain new regret bounds accordingly.

Online Stochastic Optimization with Wasserstein Based Non-stationarity

no code implementations13 Dec 2020 Jiashuo Jiang, Xiaocheng Li, Jiawei Zhang

We propose a unified Wasserstein-distance based measure to quantify the inaccuracy of the prior estimate in setting (i) and the non-stationarity of the system in setting (ii).

Management Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.