Search Results for author: Spencer Frazier

Found 10 papers, 0 papers with code

Machine Learning Approaches for Principle Prediction in Naturally Occurring Stories

no code implementations19 Nov 2022 Md Sultan Al Nahian, Spencer Frazier, Brent Harrison, Mark Riedl

To do this, we extend a dataset that has been previously used to train a binary normative classifier with annotations of moral principles.

Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning

no code implementations14 Oct 2022 Louis Castricato, Alexander Havrilla, Shahbuland Matiana, Michael Pieler, Anbang Ye, Ian Yang, Spencer Frazier, Mark Riedl

However, simply fine-tuning a generative language model with a contrastive reward model does not always reliably result in a story generation system capable of generating stories that meet user preferences.

Contrastive Learning Language Modelling +4

Cut the CARP: Fishing for zero-shot story evaluation

no code implementations6 Oct 2021 Shahbuland Matiana, JR Smith, Ryan Teehan, Louis Castricato, Stella Biderman, Leo Gao, Spencer Frazier

Recent advances in large-scale language models (Raffel et al., 2019; Brown et al., 2020) have brought significant qualitative and quantitative improvements in machine-driven text generation.

Contrastive Learning Language Modelling +2

Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior

no code implementations19 Apr 2021 Md Sultan Al Nahian, Spencer Frazier, Brent Harrison, Mark Riedl

As more machine learning agents interact with humans, it is increasingly a prospect that an agent trained to perform a task optimally, using only a measure of task performance as feedback, can violate societal norms for acceptable behavior or cause harm.

reinforcement-learning Reinforcement Learning (RL)

Playing Text-Based Games with Common Sense

no code implementations4 Dec 2020 Sahith Dambekodi, Spencer Frazier, Prithviraj Ammanabrolu, Mark O. Riedl

We test our technique in the 9to05 game, which is an extreme version of a text based game that requires numerous interactions with common, everyday objects in common, everyday scenarios.

Common Sense Reasoning Language Modelling +1

Learning Norms from Stories: A Prior for Value Aligned Agents

no code implementations7 Dec 2019 Spencer Frazier, Md Sultan Al Nahian, Mark Riedl, Brent Harrison

Value alignment is a property of an intelligent agent indicating that it can only pursue goals and activities that are beneficial to humans.

Imitation Learning

Improving Deep Reinforcement Learning in Minecraft with Action Advice

no code implementations2 Aug 2019 Spencer Frazier, Mark Riedl

We hypothesize that interactive machine learning IML, wherein human teachers play a direct role in training through demonstrations, critique, or action advice, may alleviate agent susceptibility to aliasing.

BIG-bench Machine Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.