no code implementations • 17 Jun 2023 • YingYing Li, Tianpeng Zhang, Subhro Das, Jeff Shamma, Na Li
This paper considers a single-trajectory system identification problem for linear systems under general nonlinear and/or time-varying policies with i. i. d.
1 code implementation • 10 Mar 2023 • Haitong Ma, Tianpeng Zhang, Yixuan Wu, Flavio P. Calmon, Na Li
We focus on Entropy Search (ES), a sample-efficient BO algorithm that selects queries to maximize the mutual information about the maximum of the black-box function.
1 code implementation • 20 Sep 2022 • Tianpeng Zhang, Kasper Johansson, Na Li
The graph defines the agent's freedom in selecting the next available nodes at each step.