1 code implementation • 16 Nov 2023 • Shivanshu Gupta, Clemens Rosenbaum, Ethan R. Elenberg
Further, we experiment with two variations: (1) fine-tuning gist models for each dataset and (2) multi-task training a single model on a large collection of datasets.
no code implementations • 30 Sep 2022 • Nihal V. Nayak, Ethan R. Elenberg, Clemens Rosenbaum
We adapt existing approaches from the few-sample model evaluation literature to actively sub-sample, with a learned surrogate model, the most informative data points for annotation to estimate the evaluation metric.
no code implementations • 31 Dec 2019 • Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro
In this work we note that while this key assumption of the policy gradient theorems of option-critic holds in the tabular case, it is always violated in practice for the deep function approximation setting.
no code implementations • NAACL 2019 • Ignacio Cases, Clemens Rosenbaum, Matthew Riemer, Atticus Geiger, Tim Klinger, Alex Tamkin, Olivia Li, S Agarwal, hini, Joshua D. Greene, Dan Jurafsky, Christopher Potts, Lauri Karttunen
The model jointly optimizes the parameters of the functions and the meta-learner{'}s policy for routing inputs through those functions.
1 code implementation • 29 Apr 2019 • Clemens Rosenbaum, Ignacio Cases, Matthew Riemer, Tim Klinger
Compositionality is a key strategy for addressing combinatorial complexity and the curse of dimensionality.
no code implementations • ICLR 2018 • Clemens Rosenbaum, Tim Klinger, Matthew Riemer
A routing network is a kind of self-organizing neural network consisting of two components: a router and a set of one or more function blocks.
1 code implementation • ICLR 2018 • Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell
Options in reinforcement learning allow agents to hierarchically decompose a task into subtasks, having the potential to speed up learning and planning.
no code implementations • 5 Aug 2017 • Clemens Rosenbaum, Tian Gao, Tim Klinger
In this paper we present a new dataset and user simulator e-QRAQ (explainable Query, Reason, and Answer Question) which tests an Agent's ability to read an ambiguous text; ask questions until it can answer a challenge question; and explain the reasoning behind its questions and answer.
no code implementations • 15 Jun 2016 • Ishan P. Durugkar, Clemens Rosenbaum, Stefan Dernbach, Sridhar Mahadevan
Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain.