no code implementations • 7 Dec 2023 • Jarad Forristal, Niloofar Mireshghallah, Greg Durrett, Taylor Berg-Kirkpatrick
Recent work has shown that energy-based language modeling is an effective framework for controllable text generation because it enables flexible integration of arbitrary discriminators.
no code implementations • 13 Oct 2022 • Prasann Singhal, Jarad Forristal, Xi Ye, Greg Durrett
We address the task of predicting out-of-domain (OOD) performance in a few-shot fashion: given a few target-domain examples and a set of models with similar training performance, can we understand how these models will perform on OOD test data?
no code implementations • 19 Apr 2022 • Jarad Forristal, Joshua Griffin, Wenwen Zhou, Seyedalireza Yektamaram
ARC methods are a relatively new family of optimization strategies that utilize a cubic-regularization (CR) term in place of trust-regions and line-searches.