no code implementations • pproximateinference AABI Symposium 2022 • Aishwarya Mandyam, Didong Li, Diana Cai, Andrew Jones, Barbara Engelhardt
Inverse reinforcement learning (IRL) methods attempt to recover the reward function of an agent by observing its behavior.
no code implementations • 6 Oct 2021 • Aishwarya Mandyam, Andrew Jones, Jiayu Yao, Krzysztof Laudanski, Barbara Engelhardt
CFQI uses a compositional $Q$-value function with separate modules for each task variant, allowing it to take advantage of shared knowledge while learning distinct policies for each variant.
no code implementations • 29 Sep 2021 • Aishwarya Mandyam, Andrew Jones, Krzysztof Laudanski, Barbara Engelhardt
Off-policy reinforcement learning (RL) has proven to be a powerful framework for guiding agents' actions in environments with stochastic rewards and unknown or noisy state dynamics.
2 code implementations • 14 Dec 2020 • Didong Li, Andrew Jones, Barbara Engelhardt
Recently, contrastive principal component analysis (CPCA) was proposed for this setting.
1 code implementation • 9 Dec 2016 • Gautam Sabnis, Debdeep Pati, Barbara Engelhardt, Natesh Pillai
Our approach is novel in this regard: it includes all of the $n$ samples in each subproblem and, instead, splits the dimension $p$ into smaller subsets for each subproblem.
Methodology
no code implementations • 19 Oct 2015 • Mehmet Emin Basbug, Barbara Engelhardt
We compare AdaCluster with EM for a Gaussian mixture model on synthetic data and nine UCI data sets.