no code implementations • 4 Feb 2023 • Pouya Hamadanian, Arash Nasr-Esfahany, Malte Schwarzkopf, Siddartha Sen, Mohammad Alizadeh
We present Locally Constrained Policy Optimization (LCPO), an online RL approach that combats CF by anchoring policy outputs on old experiences while optimizing the return on current experiences.
no code implementations • 14 Jan 2022 • Pouya Hamadanian, Malte Schwarzkopf, Siddartha Sen, Mohammad Alizadeh
Such agents must explore and learn new environments, without hurting the system's performance, and remember them over time.
1 code implementation • 5 Jan 2022 • Abdullah Alomar, Pouya Hamadanian, Arash Nasr-Esfahany, Anish Agarwal, Mohammad Alizadeh, Devavrat Shah
Key to CausalSim is mapping unbiased trace-driven simulation to a tensor completion problem with extremely sparse observations.
1 code implementation • ICCV 2021 • Mehrdad Khani, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh
Real-time video inference on edge devices like mobile phones and drones is challenging due to the high computation cost of Deep Neural Networks.