1 code implementation • 23 May 2023 • Anmol Kabra, Ethan R. Elenberg
Large, general purpose language models have demonstrated impressive performance across many different conversational domains.
1 code implementation • 28 Dec 2021 • Gene Li, Junbo Li, Anmol Kabra, Nathan Srebro, Zhaoran Wang, Zhuoran Yang
We propose an optimistic model-based algorithm, dubbed SMRL, for finite-horizon episodic reinforcement learning (RL) when the transition model is specified by exponential family distributions with $d$ parameters and the reward is bounded and known.