Search Results for author: Nikolas Tezak

Found 4 papers, 4 papers with code

Efficient Training of Language Models to Fill in the Middle

2 code implementations28 Jul 2022 Mohammad Bavarian, Heewoo Jun, Nikolas Tezak, John Schulman, Christine McLeavey, Jerry Tworek, Mark Chen

To this end, we run a series of ablations on key hyperparameters, such as the data transformation frequency, the structure of the transformation, and the method of selecting the infill span.

Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.