no code implementations • 20 Sep 2020 • Bhaskar Ramasubramanian, Baicen Xiao, Linda Bushnell, Radha Poovendran
We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation.
1 code implementation • 24 Jul 2020 • Shana Moothedath, Dinuka Sahabandu, Joey Allen, Linda Bushnell, Wenke Lee, Radha Poovendran
Our game model has imperfect information as the players do not have information about the actions of the opponent.
Computer Science and Game Theory Cryptography and Security
no code implementations • 19 Jan 2020 • Baicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran
The output of the feedback neural network is converted to a shaping reward that is augmented to the reward provided by the environment.
no code implementations • 20 Jul 2019 • Baicen Xiao, Bhaskar Ramasubramanian, Andrew Clark, Hannaneh Hajishirzi, Linda Bushnell, Radha Poovendran
This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies.