no code implementations • 29 Mar 2023 • Mitchell DeHaven, Stephen Scott
We also apply this pipeline to another fact verification dataset, Scifact, and achieve the highest label accuracy among all systems on that dataset as well.
1 code implementation • 2 Oct 2019 • Eleanor Quint, Dong Xu, Samuel Flint, Stephen Scott, Matthew Dwyer
In order to satisfy safety conditions, an agent may be constrained from acting freely.
no code implementations • 27 Sep 2018 • Dong Xu, Eleanor Quint, Zeynep Hakguder, Haluk Dogan, Stephen Scott, Matthew Dwyer
We study the problem of deep reinforcement learning where the agent's action sequences are constrained, e. g., prohibition of dithering or overactuating action sequences that might damage a robot, drone, or other physical device.
no code implementations • ICLR 2018 • Eleanor Quint, Garrett Wirka, Jacob Williams, Stephen Scott, N. V. Vinodchandran
As deep learning-based classifiers are increasingly adopted in real-world applications, the importance of understanding how a particular label is chosen grows.