Search Results for author: Marco Bellagente

Found 9 papers, 4 papers with code

Arabic Stable LM: Adapting Stable LM 2 1.6B to Arabic

no code implementations5 Dec 2024 Zaid Alyafeai, Michael Pieler, Hannah Teufel, Jonathan Tow, Marco Bellagente, Duy Phung, Nikhil Pinnaparaju, Reshinth Adithyan, Paulo Rocha, Maksym Zhuravinskyi, Carlos Riquelme

Our Arabic Stable LM 1. 6B chat model achieves impressive results on several benchmarks beating multiple models with up to 8x the parameters.

Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training

no code implementations28 Oct 2024 Michael Pieler, Marco Bellagente, Hannah Teufel, Duy Phung, Nathan Cooper, Jonathan Tow, Paulo Rocha, Reshinth Adithyan, Zaid Alyafeai, Nikhil Pinnaparaju, Maksym Zhuravinskyi, Carlos Riquelme

In addition, we provide a detailed study of our pipeline, investigating the choice of the base dataset and LLM for the rephrasing, as well as the relationship between the model size and the performance after pre-training.

Benchmarking Language Modeling +2

Stable Code Technical Report

no code implementations1 Apr 2024 Nikhil Pinnaparaju, Reshinth Adithyan, Duy Phung, Jonathan Tow, James Baicoianu, Ashish Datta, Maksym Zhuravinskyi, Dakota Mahan, Marco Bellagente, Carlos Riquelme, Nathan Cooper

Stable Code Instruct also exhibits state-of-the-art performance on the MT-Bench coding tasks and on Multi-PL completion compared to other instruction tuned models.

Code Completion Language Modelling +2

Quality-Diversity through AI Feedback

no code implementations19 Oct 2023 Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

In many text-generation problems, users may prefer not only a single response, but a diverse range of high-quality outputs from which to choose.

Diversity Text Generation

Latent Space Refinement for Deep Generative Models

1 code implementation1 Jun 2021 Ramon Winterhalder, Marco Bellagente, Benjamin Nachman

Deep generative models are becoming widely used across science and industry for a variety of purposes.

Cannot find the paper you are looking for? You can Submit a new open access paper.