Search Results for author: Nikhil Sardana

Found 4 papers, 3 papers with code

Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

no code implementations • 31 Dec 2023 • Nikhil Sardana, Jonathan Frankle

We modify the Chinchilla scaling laws to calculate the optimal LLM parameter count and pre-training data size to train and deploy a model of a given quality and inference demand.

Language Modelling Large Language Model

Paper
Add Code

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

1 code implementation • NeurIPS 2023 • Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

Here, we introduce MosaicBERT, a BERT-style encoder architecture and training recipe that is empirically optimized for fast pretraining.

Language Modelling Masked Language Modeling

416

Paper
Code

Autonomous Reinforcement Learning: Formalism and Benchmarking

2 code implementations • ICLR 2022 • Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn

In this paper, we aim to address this discrepancy by laying out a framework for Autonomous Reinforcement Learning (ARL): reinforcement learning where the agent not only learns through its own experience, but also contends with lack of human supervision to reset between trials.

Benchmarking reinforcement-learning +1

Paper
Code

Bayesian Meta-Learning Through Variational Gaussian Processes

1 code implementation • 21 Oct 2021 • Vivek Myers, Nikhil Sardana

This problem setting can be extended to the Bayesian context, wherein rather than predicting a single label for each query data point, a model predicts a distribution of labels capturing its uncertainty.

Gaussian Processes Meta-Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.