no code implementations • 20 Jan 2024 • Michael Gimelfarb, Ayal Taitler, Scott Sanner
To achieve such results, CGPO proposes a bi-level mixed-integer nonlinear optimization framework for optimizing policies within defined expressivity classes (i. e. piecewise (non)-linear) and reduces it to an optimal constraint generation methodology that adversarially generates worst-case state trajectories.
no code implementations • 29 May 2023 • Xiaocan Li, Ray Coden Mercurius, Ayal Taitler, Xiaoyu Wang, Mohammad Noaeen, Scott Sanner, Baher Abdulhai
Moreover, no existing studies have employed reinforcement learning for homogeneous flow rate optimization in microscopic simulation, where spatial characteristics, vehicle-level information, and metering realizations -- often overlooked in macroscopic simulations -- are taken into account.
2 code implementations • 11 Nov 2022 • Ayal Taitler, Michael Gimelfarb, Jihwan Jeong, Sriram Gopalakrishnan, Martin Mladenov, Xiaotian Liu, Scott Sanner
We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description.
no code implementations • 4 Apr 2021 • Joel Oren, Chana Ross, Maksym Lefarov, Felix Richter, Ayal Taitler, Zohar Feldman, Christian Daniel, Dotan Di Castro
This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e. g., jobs in scheduling problems) are not known in advance, but rather arrive during the decision-making process.
no code implementations • 26 Feb 2017 • Ayal Taitler, Nahum Shimkin
We consider the task of learning control policies for a robotic mechanism striking a puck in an air hockey game.