Search Results for author: Nicholas Waytowich

Found 19 papers, 2 papers with code

Scalable Interactive Machine Learning for Future Command and Control

no code implementations9 Feb 2024 Anna Madison, Ellen Novoseller, Vinicius G. Goecks, Benjamin T. Files, Nicholas Waytowich, Alfred Yu, Vernon J. Lawhern, Steven Thurman, Christopher Kelshaw, Kaleb McDowell

Future warfare will require Command and Control (C2) personnel to make decisions at shrinking timescales in complex and potentially ill-defined situations.

Decision Making

Rating-based Reinforcement Learning

no code implementations30 Jul 2023 Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao

This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning.

reinforcement-learning

DIP-RL: Demonstration-Inferred Preference Learning in Minecraft

no code implementations22 Jul 2023 Ellen Novoseller, Vinicius G. Goecks, David Watkins, Josh Miller, Nicholas Waytowich

In machine learning for sequential decision-making, an algorithmic agent learns to interact with an environment while receiving feedback in the form of a reward signal.

Decision Making reinforcement-learning +1

Towards an Interpretable Hierarchical Agent Framework using Semantic Goals

no code implementations16 Oct 2022 Bharat Prakash, Nicholas Waytowich, Tim Oates, Tinoosh Mohsenin

Learning to solve long horizon temporally extended tasks with reinforcement learning has been a challenge for several years now.

reinforcement-learning Reinforcement Learning (RL)

Multiple View Performers for Shape Completion

no code implementations13 Sep 2022 David Watkins-Valls, Peter Allen, Krzysztof Choromanski, Jacob Varley, Nicholas Waytowich

We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views.

Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II

no code implementations11 May 2022 Nicholas Waytowich, James Hare, Vinicius G. Goecks, Mark Mittrick, John Richardson, Anjon Basak, Derrik E. Asher

Traditionally, learning from human demonstrations via direct behavior cloning can lead to high-performance policies given that the algorithm has access to large amounts of high-quality data covering the most likely scenarios to be encountered when the agent is operating.

reinforcement-learning Reinforcement Learning (RL) +2

Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft

1 code implementation7 Dec 2021 Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat Prakash

In this work, we present the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge: Learning from Human Feedback in Minecraft, which challenged participants to use human data to solve four tasks defined only by a natural language description and no reward function.

Imitation Learning

Automatic Goal Generation using Dynamical Distance Learning

no code implementations7 Nov 2021 Bharat Prakash, Nicholas Waytowich, Tinoosh Mohsenin, Tim Oates

In this work, we propose a method for automatic goal generation using a dynamical distance function (DDF) in a self-supervised fashion.

Decision Making Reinforcement Learning (RL)

On games and simulators as a platform for development of artificial intelligence for command and control

no code implementations21 Oct 2021 Vinicius G. Goecks, Nicholas Waytowich, Derrik E. Asher, Song Jun Park, Mark Mittrick, John Richardson, Manuel Vindiola, Anne Logie, Mark Dennison, Theron Trout, Priya Narayanan, Alexander Kott

Games and simulators can be a valuable platform to execute complex multi-agent, multiplayer, imperfect information scenarios with significant parallels to military applications: multiple participants manage resources and make decisions that command assets to secure specific areas of a map or neutralize opposing forces.

Starcraft Starcraft II

A Narration-based Reward Shaping Approach using Grounded Natural Language Commands

no code implementations31 Oct 2019 Nicholas Waytowich, Sean L. Barton, Vernon Lawhern, Garrett Warnell

While this problem can be addressed through reward shaping, such approaches typically require a human expert with specialized knowledge.

Starcraft Starcraft II

Learning from Observations Using a Single Video Demonstration and Human Feedback

no code implementations29 Sep 2019 Sunil Gandhi, Tim Oates, Tinoosh Mohsenin, Nicholas Waytowich

In this paper, we present a method for learning from video demonstrations by using human feedback to construct a mapping between the standard representation of the agent and the visual representation of the demonstration.

Learning Your Way Without Map or Compass: Panoramic Target Driven Visual Navigation

no code implementations20 Sep 2019 David Watkins-Valls, Jingxi Xu, Nicholas Waytowich, Peter Allen

We present a robot navigation system that uses an imitation learning framework to successfully navigate in complex environments.

Imitation Learning Navigate +3

Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

no code implementations24 Apr 2019 Nicholas Waytowich, Sean L. Barton, Vernon Lawhern, Ethan Stump, Garrett Warnell

While deep reinforcement learning techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of {\em reward sparsity}.

Reinforcement Learning (RL) Starcraft +1

Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces

2 code implementations28 Sep 2017 Garrett Warnell, Nicholas Waytowich, Vernon Lawhern, Peter Stone

While recent advances in deep reinforcement learning have allowed autonomous learning agents to succeed at a variety of complex tasks, existing algorithms generally require a lot of training data.

reinforcement-learning Reinforcement Learning (RL) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.