Search Results for author: Yury Tokpanov

Found 1 papers, 1 papers with code

BlackMamba: Mixture of Experts for State-Space Models

1 code implementation1 Feb 2024 Quentin Anthony, Yury Tokpanov, Paolo Glorioso, Beren Millidge

In this paper, we present BlackMamba, a novel architecture that combines the Mamba SSM with MoE to obtain the benefits of both.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.