no code implementations • 13 Apr 2023 • Daniel Segal, Aharon Bar-Gill, Nahum Shimkin
Engine failure is a recurring emergency in General Aviation and fixed-wing UAVs, often requiring the pilot or remote operator to carry out carefully planned glides to safely reach a candidate landing strip.
no code implementations • 23 May 2021 • Nir Greshler, Ofir Gordon, Oren Salzman, Nahum Shimkin
We introduce the Cooperative Multi-Agent Path Finding (Co-MAPF) problem, an extension to the classical MAPF problem, where cooperative behavior is incorporated.
no code implementations • ICLR 2020 • Guy Adam, Tom Zahavy, Oron Anschel, Nahum Shimkin
Rather than using hand-design state representation, we use a state representation that is being learned directly from the data by a DQN agent.
1 code implementation • 8 Dec 2019 • Yair Shemer, Daniel Rotman, Nahum Shimkin
We consider shot-based video summarization where the summary consists of a subset of the video shots which can be of various lengths.
no code implementations • 26 Feb 2017 • Ayal Taitler, Nahum Shimkin
We consider the task of learning control policies for a robotic mechanism striking a puck in an air hockey game.
no code implementations • ICML 2017 • Oron Anschel, Nir Baram, Nahum Shimkin
Instability and variability of Deep Reinforcement Learning (DRL) algorithms tend to adversely affect their performance.
no code implementations • 23 Dec 2015 • Yahel David, Nahum Shimkin
Under the PAC framework, we provide a lower bound on the sample complexity of any $(\epsilon,\delta)$-correct algorithm, and propose an algorithm that attains this bound up to logarithmic factors.
no code implementations • 23 Aug 2015 • Yahel David, Nahum Shimkin
We consider the Max $K$-Armed Bandit problem, where a learning agent is faced with several sources (arms) of items (rewards), and interested in finding the best item overall.
no code implementations • 1 Mar 2015 • Nahum Shimkin
The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing {\em steering directions} as projections from the current average payoff vector to the (convex) target set.
no code implementations • 30 Dec 2013 • Andrey Bernstein, Nahum Shimkin
The first (primary) condition is a geometric separation condition, while the second (dual) condition requires that the set be {\em non-excludable}, namely that for every mixed action of the opponent there exists a mixed action of the agent (a {\em response}) such that the resulting payoff vector belongs to $S$.
no code implementations • NeurIPS 2010 • Andrey Bernstein, Shie Mannor, Nahum Shimkin
To our best knowledge, this is the first algorithm that addresses the problem of the average tp-rate maximization under average fp-rate constraints in the online setting.