no code implementations • 1 Mar 2022 • Max Mowbray, Dongda Zhang, Ehecatl Antonio del Rio Chanona
In this work, we present a RL methodology tailored to efficiently address production scheduling problems in the presence of uncertainty.
1 code implementation • 23 Apr 2021 • Max Mowbray, Panagiotis Petsagkourakis, Ehecatl Antonio del Río Chanona, Dongda Zhang
Specifically, we propose a data-driven approach that utilizes Gaussian processes for the offline simulation model and use the associated posterior uncertainty prediction to account for joint chance constraints and plant-model mismatch.
no code implementations • 16 Nov 2020 • Elton Pan, Panagiotis Petsagkourakis, Max Mowbray, Dongda Zhang, Antonio del Rio-Chanona
We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with a high probability, which is crucial for safety critical tasks.
no code implementations • 30 Jul 2020 • Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Federico Galvanin, Dongda Zhang, Ehecatl Antonio del Rio-Chanona
We propose a chance constrained policy optimization (CCPO) algorithm which guarantees the satisfaction of joint chance constraints with a high probability - which is crucial for safety critical tasks.
no code implementations • 4 Jun 2020 • Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Dongda Zhang, Ehecatl Antonio del Río Chanona
We use chance constraints to guarantee the probabilistic satisfaction of process constraints, which is accomplished by introducing backoffs, such that the optimal policy and backoffs are computed simultaneously.
2 code implementations • 15 Apr 2019 • Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Dongda Zhang, Ehecatl Antonio del Rio Chanona
In this work, we applied the Policy Gradient method from batch-to-batch to update a control policy parametrized by a recurrent neural network.
Optimization and Control Systems and Control