Search Results for author: Dongda Zhang

Found 6 papers, 2 papers with code

Distributional Reinforcement Learning for Scheduling of Chemical Production Processes

no code implementations1 Mar 2022 Max Mowbray, Dongda Zhang, Ehecatl Antonio del Rio Chanona

In this work, we present a RL methodology tailored to efficiently address production scheduling problems in the presence of uncertainty.

Decision Making Distributional Reinforcement Learning +3

Safe Chance Constrained Reinforcement Learning for Batch Process Control

1 code implementation23 Apr 2021 Max Mowbray, Panagiotis Petsagkourakis, Ehecatl Antonio del Río Chanona, Dongda Zhang

Specifically, we propose a data-driven approach that utilizes Gaussian processes for the offline simulation model and use the associated posterior uncertainty prediction to account for joint chance constraints and plant-model mismatch.

Gaussian Processes Model Predictive Control +2

Constrained Model-Free Reinforcement Learning for Process Optimization

no code implementations16 Nov 2020 Elton Pan, Panagiotis Petsagkourakis, Max Mowbray, Dongda Zhang, Antonio del Rio-Chanona

We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with a high probability, which is crucial for safety critical tasks.

Model Predictive Control Q-Learning +3

Chance Constrained Policy Optimization for Process Control and Optimization

no code implementations30 Jul 2020 Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Federico Galvanin, Dongda Zhang, Ehecatl Antonio del Rio-Chanona

We propose a chance constrained policy optimization (CCPO) algorithm which guarantees the satisfaction of joint chance constraints with a high probability - which is crucial for safety critical tasks.

Bayesian Optimization Chemical Process +2

Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty

no code implementations4 Jun 2020 Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Dongda Zhang, Ehecatl Antonio del Río Chanona

We use chance constraints to guarantee the probabilistic satisfaction of process constraints, which is accomplished by introducing backoffs, such that the optimal policy and backoffs are computed simultaneously.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning for Batch Bioprocess Optimization

2 code implementations15 Apr 2019 Panagiotis Petsagkourakis, Ilya Orson Sandoval, Eric Bradford, Dongda Zhang, Ehecatl Antonio del Rio Chanona

In this work, we applied the Policy Gradient method from batch-to-batch to update a control policy parametrized by a recurrent neural network.

Optimization and Control Systems and Control

Cannot find the paper you are looking for? You can Submit a new open access paper.