Search Results for author: Maksym Lefarov

Found 2 papers, 0 papers with code

On-Policy Model Errors in Reinforcement Learning

no code implementations ICLR 2022 Lukas P. Fröhlich, Maksym Lefarov, Melanie N. Zeilinger, Felix Berkenkamp

In contrast, model-based methods can use the learned model to generate new data, but model errors and bias can render learning unstable or suboptimal.

reinforcement-learning Reinforcement Learning (RL)

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

no code implementations4 Apr 2021 Joel Oren, Chana Ross, Maksym Lefarov, Felix Richter, Ayal Taitler, Zohar Feldman, Christian Daniel, Dotan Di Castro

This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e. g., jobs in scheduling problems) are not known in advance, but rather arrive during the decision-making process.

Combinatorial Optimization Decision Making +3

Cannot find the paper you are looking for? You can Submit a new open access paper.