no code implementations • 15 Jun 2023 • Abishek Sankararaman, Balakrishnan, Narayanaswamy
We derive guarantees on worst-case, finite-sample false-positive rate (FPR) over the family of all distributions with bounded second moment.
no code implementations • 15 Jan 2019 • Yifei Ma, Yu-Xiang Wang, Balakrishnan, Narayanaswamy
To solve both problems, we show how one can use policy improvement (PIL) objectives, regularized by policy imitation (IML).