no code implementations • 11 Jan 2024 • Sihan Zeng, Youngdae Kim, Yuxuan Ren, Kibaek Kim
At the heart of power system operations, alternating current optimal power flow (ACOPF) studies the generation of electric power in the most economical way under network-wide load requirement, and can be formulated as a highly structured non-convex quadratically constrained quadratic program (QCQP).
no code implementations • 18 Nov 2023 • Sihan Zeng, Sujay Bhatt, Eleonora Kreacic, Parisa Hassanzadeh, Alec Koppel, Sumitra Ganesh
We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks.
no code implementations • 10 Jan 2023 • Parisa Hassanzadeh, Eleonora Kreacic, Sihan Zeng, Yuchen Xiao, Sumitra Ganesh
We propose a new algorithm, SAFFE, that makes fair allocations with respect to the entire demands revealed over the horizon by accounting for expected future demands at each arrival time.
no code implementations • 27 May 2022 • Sihan Zeng, Thinh T. Doan, Justin Romberg
We study the problem of finding the Nash equilibrium in a two-player zero-sum Markov game.
no code implementations • 22 Oct 2021 • Sihan Zeng, Alyssa Kody, Youngdae Kim, Kibaek Kim, Daniel K. Molzahn
We train our RL policy using deep Q-learning, and show that this policy can result in significantly accelerated convergence (up to a 59% reduction in the number of iterations compared to existing, curvature-informed penalty parameter selection methods).
no code implementations • 21 Oct 2021 • Sihan Zeng, Thinh T. Doan, Justin Romberg
To solve this constrained optimization program, we study an online actor-critic variant of a classic primal-dual method where the gradients of both the primal and dual functions are estimated using samples from a single trajectory generated by the underlying time-varying Markov processes.
no code implementations • 29 Sep 2021 • Sihan Zeng, Thinh T. Doan, Justin Romberg
In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution.
no code implementations • 28 Oct 2020 • Sihan Zeng, Thinh T. Doan, Justin Romberg
We study a decentralized variant of stochastic approximation, a data-driven approach for finding the root of an operator under noisy measurements.
no code implementations • 8 Jun 2020 • Sihan Zeng, Aqeel Anwar, Thinh Doan, Arijit Raychowdhury, Justin Romberg
We develop a mathematical framework for solving multi-task reinforcement learning (MTRL) problems based on a type of policy gradient method.
1 code implementation • 19 Feb 2019 • Shaojie Xu, Sihan Zeng, Justin Romberg
Deep learning models have significantly improved the visual quality and accuracy on compressive sensing recovery.