1 code implementation • 26 Sep 2022 • Hardik Parwana, Dimitra Panagou
Under state and control input constraints, the state prediction is subsequently used in tandem with a proposed variant of constrained gradient-descent for online update of policy parameters in a receding horizon fashion.