Search Results for author: Abhinav Venigalla

Found 5 papers, 2 papers with code

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

1 code implementation • 27 Mar 2024 • Elliot Bolton, Abhinav Venigalla, Michihiro Yasunaga, David Hall, Betty Xiong, Tony Lee, Roxana Daneshjou, Jonathan Frankle, Percy Liang, Michael Carbin, Christopher D. Manning

Models such as GPT-4 and Med-PaLM 2 have demonstrated impressive performance on a wide variety of biomedical NLP tasks.

Language Modelling Medical Genetics +3

580

Paper
Code

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

1 code implementation • NeurIPS 2023 • Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

Here, we introduce MosaicBERT, a BERT-style encoder architecture and training recipe that is empirically optimized for fast pretraining.

Language Modelling Masked Language Modeling

419

Paper
Code

Representation range needs for 16-bit neural network training

no code implementations • 29 Mar 2021 • Valentina Popescu, Abhinav Venigalla, Di wu, Robert Schreiber

While neural networks have been trained using IEEE-754 binary32 arithmetic, the rapid growth of computational demands in deep learning has boosted interest in faster, low precision training.

Paper
Add Code

Adaptive Braking for Mitigating Gradient Delay

no code implementations • 2 Jul 2020 • Abhinav Venigalla, Atli Kosson, Vitaliy Chiley, Urs Köster

Neural network training is commonly accelerated by using multiple synchronized workers to compute gradient updates in parallel.

Paper
Add Code

Pipelined Backpropagation at Scale: Training Large Models without Batches

no code implementations • 25 Mar 2020 • Atli Kosson, Vitaliy Chiley, Abhinav Venigalla, Joel Hestness, Urs Köster

New hardware can substantially increase the speed and efficiency of deep neural network training.

Image Classification Stochastic Optimization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.