no code implementations • 9 Apr 2024 • Gonçalo Paulo, Thomas Marshall, Nora Belrose
Recent advances in recurrent neural network architectures, such as Mamba and RWKV, have enabled RNNs to match or exceed the performance of equal-size transformers in terms of language modeling perplexity and downstream evaluations, suggesting that future systems may be built on completely new architectures.
no code implementations • 24 Feb 2021 • Gonçalo Paulo, Mykola Tasinkevych
In model $\cal J$, in contrast, the oscillators in subpopulation $\cal A$ behave as in model $\cal I$, while the oscillators in subpopulation $\cal B$ tend to be out of phase with all the others.
Soft Condensed Matter