Search Results for author: Jacob Portes

Found 3 papers, 2 papers with code

Fast Benchmarking of Accuracy vs. Training Time with Cyclic Learning Rates

1 code implementation • 2 Jun 2022 • Jacob Portes, Davis Blalock, Cory Stephenson, Jonathan Frankle

Benchmarking the tradeoff between neural network accuracy and training time is computationally expensive.

Paper
Code

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

no code implementations • 22 Nov 2023 • Aditi Jha, Sam Havens, Jeremey Dohmann, Alex Trott, Jacob Portes

We find that subsets of 1k-6k instruction finetuning samples are sufficient to achieve good performance on both (1) traditional NLP benchmarks and (2) model-based evaluation.

Instruction Following

Paper
Add Code

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

1 code implementation • NeurIPS 2023 • Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

Here, we introduce MosaicBERT, a BERT-style encoder architecture and training recipe that is empirically optimized for fast pretraining.

Language Modelling Masked Language Modeling

416

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.