no code implementations • 29 Nov 2015 • Frederik Ruelens, Bert Claessens, Salman Quaiyum, Bart De Schutter, Robert Babuska, Ronnie Belmans
A wellknown batch reinforcement learning technique, fitted Q-iteration, is used to find a control policy, given this feature representation.