Search Results for author: Katherine Metcalf

Found 9 papers, 1 papers with code

Hindsight PRIORs for Reward Learning from Human Preferences

no code implementations12 Apr 2024 Mudit Verma, Katherine Metcalf

Incorporating state importance into reward learning improves the speed of policy learning, overall policy performance, and reward recovery on both locomotion and manipulation tasks.

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards

1 code implementation28 Feb 2024 Katherine Metcalf, Miguel Sarabia, Natalie Mackraz, Barry-John Theobald

Preference-based reinforcement learning (PbRL) aligns a robot behavior with human preferences via a reward function learned from binary feedback over agent behaviors.

reinforcement-learning

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning

no code implementations12 Nov 2022 Katherine Metcalf, Miguel Sarabia, Barry-John Theobald

In this work, we demonstrate that encoding environment dynamics in the reward function (REED) dramatically reduces the number of preference labels required in state-of-the-art preference-based RL frameworks.

reinforcement-learning Reinforcement Learning (RL)

Symbol Guided Hindsight Priors for Reward Learning from Human Preferences

no code implementations17 Oct 2022 Mudit Verma, Katherine Metcalf

Specifying rewards for reinforcement learned (RL) agents is challenging.

On the role of Lip Articulation in Visual Speech Perception

no code implementations18 Mar 2022 Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald

Previous research has shown that traditional metrics used to optimize and assess models for generating lip motion from speech are not a good indicator of subjective opinion of animation quality.

FedEmbed: Personalized Private Federated Learning

no code implementations18 Feb 2022 Andrew Silva, Katherine Metcalf, Nicholas Apostoloff, Barry-John Theobald

Federated learning enables the deployment of machine learning to problems for which centralized data collection is impractical.

Federated Learning

Mirroring to Build Trust in Digital Assistants

no code implementations2 Apr 2019 Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff

We describe experiments towards building a conversational digital assistant that considers the preferred conversational style of the user.

Learning Sharing Behaviors with Arbitrary Numbers of Agents

no code implementations10 Dec 2018 Katherine Metcalf, Barry-John Theobald, Nicholas Apostoloff

We model the individual behavior for each agent in an interaction and then use a multi-agent fusion model to generate a summary over the expected actions of the group to render the model independent of the number of agents.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.