1 code implementation • 3 Oct 2023 • W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson, Serena Booth, Anca Dragan, Peter Stone, Scott Niekum
Most recent work assumes that human preferences are generated based only upon the reward accrued within those segments, or their partial return.
no code implementations • 22 Oct 2022 • Sigurdur Orn Adalgeirsson, Cynthia Breazeal
Partially Observable Markov Decision Processes (POMDPs) offer a promising world representation for autonomous agents, as they can model both transitional and perceptual uncertainties.