no code implementations • 1 Apr 2024 • Pignge Hu, Xiaoteng Zhang, Mengmeng Li, Yingjie Zhu, Li Shi
As an inspiration from nature, the avian visual system is capable of processing motion information in various complex aerial scenes, and its Retina-OT-Rt visual circuit is highly sensitive to capturing the motion information of small objects from high altitudes.
no code implementations • 13 Nov 2023 • Wouter Jongeneel, Mengmeng Li, Daniel Kuhn
Motivated by policy gradient methods in the context of reinforcement learning, we derive the first large deviation rate function for the iterates generated by stochastic gradient descent for possibly non-convex objectives satisfying a Polyak-Lojasiewicz condition.
no code implementations • 30 May 2023 • Mengmeng Li, Daniel Kuhn, Tobias Sutter
We propose policy gradient algorithms for robust infinite-horizon Markov decision processes (MDPs) with non-rectangular uncertainty sets, thereby addressing an open challenge in the robust MDP literature.
1 code implementation • 12 Jun 2021 • Mengmeng Li, Tobias Sutter, Daniel Kuhn
We study a stochastic program where the probability distribution of the uncertain problem parameters is unknown and only indirectly observed via finitely many correlated samples generated by an unknown Markov chain with $d$ states.