no code implementations • 9 Aug 2023 • Tim Hartill, Diana Benavides-Prado, Michael Witbrock, Patricia J. Riddle
When provided with sufficient explanatory context, smaller Language Models have been shown to exhibit strong reasoning ability on challenging short-answer question-answering tasks where the questions are unseen in training.
1 code implementation • 5 May 2023 • Kobe Knowles, Joshua Bensemann, Diana Benavides-Prado, Vithya Yogarajan, Michael Witbrock, Gillian Dobbie, Yang Chen
We introduce a novel architecture, the Neuromodulation Gated Transformer (NGT), which is a simple implementation of neuromodulation in transformers via a multiplicative effect.
no code implementations • 14 Aug 2022 • Diana Benavides-Prado, Patricia Riddle
Continual learning of a stream of tasks is an active area in deep neural networks.