Search Results for author: Eric Nguyen

Found 9 papers, 6 papers with code

Mechanistic Design and Scaling of Hybrid Architectures

no code implementations • 26 Mar 2024 • Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli

The development of deep learning architectures is a resource-demanding process, due to a vast design space, long prototyping times, and high compute costs associated with at-scale model training and evaluation.

Paper
Add Code

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

1 code implementation • 10 Nov 2023 • Daniel Y. Fu, Hermann Kumbong, Eric Nguyen, Christopher Ré

FlashFFTConv uses a matrix decomposition that computes the FFT using matrix multiply units and enables kernel fusion for long sequences, reducing I/O.

216

Paper
Code

HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution

2 code implementations • NeurIPS 2023 • Eric Nguyen, Michael Poli, Marjan Faizi, Armin Thomas, Callum Birch-Sykes, Michael Wornow, Aman Patel, Clayton Rabideau, Stefano Massaroli, Yoshua Bengio, Stefano Ermon, Stephen A. Baccus, Chris Ré

Leveraging Hyena's new long-range capabilities, we present HyenaDNA, a genomic foundation model pretrained on the human reference genome with context lengths of up to 1 million tokens at the single nucleotide-level - an up to 500x increase over previous dense attention-based models.

4k In-Context Learning +2

486

Paper
Code

Hyena Hierarchy: Towards Larger Convolutional Language Models

5 code implementations • 21 Feb 2023 • Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré

Recent advances in deep learning have relied heavily on the use of large Transformers due to their ability to learn at scale.

Ranked #37 on Language Modelling on WikiText-103

2k 8k +2

838

Paper
Code

Simple Hardware-Efficient Long Convolutions for Sequence Modeling

1 code implementation • 13 Feb 2023 • Daniel Y. Fu, Elliot L. Epstein, Eric Nguyen, Armin W. Thomas, Michael Zhang, Tri Dao, Atri Rudra, Christopher Ré

We find that a key requirement to achieving high performance is keeping the convolution kernels smooth.

Image Classification Language Modelling

838

Paper
Code

S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces

1 code implementation • 12 Oct 2022 • Eric Nguyen, Karan Goel, Albert Gu, Gordon W. Downs, Preey Shah, Tri Dao, Stephen A. Baccus, Christopher Ré

On ImageNet-1k, S4ND exceeds the performance of a Vision Transformer baseline by $1. 5\%$ when training with a $1$D sequence of patches, and matches ConvNeXt when modeling images in $2$D.

Inductive Bias Video Classification

2,111

Paper
Code

Continual Reinforcement Learning with TELLA

no code implementations • 8 Aug 2022 • Neil Fendley, Cash Costello, Eric Nguyen, Gino Perrotta, Corey Lowman

Training reinforcement learning agents that continually learn across multiple environments is a challenging problem.

Continual Learning reinforcement-learning +1

Paper
Add Code

Lifelong Learning Metrics

1 code implementation • 20 Jan 2022 • Alexander New, Megan Baker, Eric Nguyen, Gautam Vallabha

The DARPA Lifelong Learning Machines (L2M) program seeks to yield advances in artificial intelligence (AI) systems so that they are capable of learning (and improving) continuously, leveraging data on one task to improve performance on another, and doing so in a computationally sustainable way.

Autonomous Driving

Paper
Code

OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution

no code implementations • ICCV 2021 • Eric Nguyen, Tu Bui, Vishy Swaminathan, John Collomosse

Our key contribution is OSCAR-Net (Object-centric Scene Graph Attention for Image Attribution Network); a robust image hashing model inspired by recent successes of Transformers in the visual domain.

Contrastive Learning Graph Attention

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.