Search Results for author: Denis Mazur

Found 3 papers, 3 papers with code

Fast Inference of Mixture-of-Experts Language Models with Offloading

1 code implementation • 28 Dec 2023 • Artyom Eliseev, Denis Mazur

In this work, we study the problem of running large MoE language models on consumer hardware with limited accelerator memory.

2,233

Paper
Code

Distributed Deep Learning in Open Collaborations

2 code implementations • NeurIPS 2021 • Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, Lucile Saulnier, Quentin Lhoest, Anton Sinitsin, Dmitry Popov, Dmitry Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko

Modern deep learning applications require increasingly more compute to train state-of-the-art models.

Language Modelling

1,836

Paper
Code

Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs

1 code implementation • NeurIPS 2019 • Denis Mazur, Vage Egiazarian, Stanislav Morozov, Artem Babenko

Our main contribution is PRODIGE: a method that learns a weighted graph representation of data end-to-end by gradient descent.

Collaborative Filtering Inductive Bias +1

47

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.