Search Results for author: Grigory Malinovsky

Found 18 papers, 4 papers with code

From Local SGD to Local Fixed Point Methods for Federated Learning

no code implementations • ICML 2020 • Grigory Malinovsky, Dmitry Kovalev, Elnur Gasanov, Laurent Condat, Peter Richtarik

Most algorithms for solving optimization problems or finding saddle points of convex-concave functions are fixed point algorithms.

Federated Learning

Paper
Add Code

Streamlining in the Riemannian Realm: Efficient Riemannian Optimization with Loopless Variance Reduction

no code implementations • 11 Mar 2024 • Yury Demidovich, Grigory Malinovsky, Peter Richtárik

These methods replace the outer loop with probabilistic gradient computation triggered by a coin flip in each iteration, ensuring simpler proofs, efficient hyperparameter selection, and sharp convergence guarantees.

Distributed Optimization Riemannian optimization

Paper
Add Code

MAST: Model-Agnostic Sparsified Training

1 code implementation • 27 Nov 2023 • Yury Demidovich, Grigory Malinovsky, Egor Shulgin, Peter Richtárik

We introduce a novel optimization problem formulation that departs from the conventional way of minimizing machine learning model loss as a black-box function.

Paper
Code

Byzantine Robustness and Partial Participation Can Be Achieved Simultaneously: Just Clip Gradient Differences

no code implementations • 23 Nov 2023 • Grigory Malinovsky, Peter Richtárik, Samuel Horváth, Eduard Gorbunov

Distributed learning has emerged as a leading paradigm for training large machine learning models.

Paper
Add Code

Improving Accelerated Federated Learning with Compression and Importance Sampling

no code implementations • 5 Jun 2023 • Michał Grudzień, Grigory Malinovsky, Peter Richtárik

In this setting, the communication between the server and clients poses a major bottleneck.

Federated Learning

Paper
Add Code

TAMUNA: Doubly Accelerated Distributed Optimization with Local Training, Compression, and Partial Participation

1 code implementation • 20 Feb 2023 • Laurent Condat, Ivan Agarský, Grigory Malinovsky, Peter Richtárik

We propose TAMUNA, the first algorithm for distributed optimization that leveraged the two strategies of local training and compression jointly and allows for partial participation.

Distributed Optimization Federated Learning

4,203

Paper
Code

Federated Learning with Regularized Client Participation

no code implementations • 7 Feb 2023 • Grigory Malinovsky, Samuel Horváth, Konstantin Burlachenko, Peter Richtárik

Under this scheme, each client joins the learning process every $R$ communication rounds, which we refer to as a meta epoch.

Federated Learning

Paper
Add Code

Can 5th Generation Local Training Methods Support Client Sampling? Yes!

no code implementations • 29 Dec 2022 • Michał Grudzień, Grigory Malinovsky, Peter Richtárik

The celebrated FedAvg algorithm of McMahan et al. (2017) is based on three components: client sampling (CS), data sampling (DS) and local training (LT).

Paper
Add Code

An Optimal Algorithm for Strongly Convex Min-min Optimization

no code implementations • 29 Dec 2022 • Alexander Gasnikov, Dmitry Kovalev, Grigory Malinovsky

In this paper we study the smooth strongly convex minimization problem $\min_{x}\min_y f(x, y)$.

Paper
Add Code

Minibatch Stochastic Three Points Method for Unconstrained Smooth Minimization

no code implementations • 16 Sep 2022 • Soumia Boucherouite, Grigory Malinovsky, Peter Richtárik, El Houcine Bergou

In this paper, we propose a new zero order optimization method called minibatch stochastic three points (MiSTP) method to solve an unconstrained minimization problem in a setting where only an approximation of the objective function evaluation is possible.

Paper
Add Code

Variance Reduced ProxSkip: Algorithm, Theory and Application to Federated Learning

1 code implementation • 9 Jul 2022 • Grigory Malinovsky, Kai Yi, Peter Richtárik

We study distributed optimization methods based on the {\em local training (LT)} paradigm: achieving communication efficiency by performing richer local gradient-based training on the clients before parameter averaging.

Distributed Optimization Federated Learning

Paper
Code

Federated Optimization Algorithms with Random Reshuffling and Gradient Compression

1 code implementation • 14 Jun 2022 • Abdurakhmon Sadiev, Grigory Malinovsky, Eduard Gorbunov, Igor Sokolov, Ahmed Khaled, Konstantin Burlachenko, Peter Richtárik

To reveal the true advantages of RR in the distributed learning with compression, we propose a new method called DIANA-RR that reduces the compression variance and has provably better convergence rates than existing counterparts with with-replacement sampling of stochastic gradients.

Federated Learning Quantization

Paper
Code

Federated Random Reshuffling with Compression and Variance Reduction

no code implementations • 8 May 2022 • Grigory Malinovsky, Peter Richtárik

Random Reshuffling (RR), which is a variant of Stochastic Gradient Descent (SGD) employing sampling without replacement, is an immensely popular method for training supervised machine learning models via empirical risk minimization.

BIG-bench Machine Learning Federated Learning

Paper
Add Code

ProxSkip: Yes! Local Gradient Steps Provably Lead to Communication Acceleration! Finally!

no code implementations • 18 Feb 2022 • Konstantin Mishchenko, Grigory Malinovsky, Sebastian Stich, Peter Richtárik

The canonical approach to solving such problems is via the proximal gradient descent (ProxGD) algorithm, which is based on the evaluation of the gradient of $f$ and the prox operator of $\psi$ in each iteration.

Federated Learning

Paper
Add Code

Server-Side Stepsizes and Sampling Without Replacement Provably Help in Federated Optimization

no code implementations • 26 Jan 2022 • Grigory Malinovsky, Konstantin Mishchenko, Peter Richtárik

Together, our results on the advantage of large and small server-side stepsizes give a formal justification for the practice of adaptive server-side optimization in federated learning.

Federated Learning

Paper
Add Code

Random Reshuffling with Variance Reduction: New Analysis and Better Rates

no code implementations • 19 Apr 2021 • Grigory Malinovsky, Alibek Sailanbayev, Peter Richtárik

One of the tricks that works so well in practice that it is used as default in virtually all widely used machine learning software is {\em random reshuffling (RR)}.

BIG-bench Machine Learning

Paper
Add Code

Distributed Proximal Splitting Algorithms with Rates and Acceleration

no code implementations • 2 Oct 2020 • Laurent Condat, Grigory Malinovsky, Peter Richtárik

We analyze several generic proximal splitting algorithms well suited for large-scale convex nonsmooth optimization.

Paper
Add Code

From Local SGD to Local Fixed-Point Methods for Federated Learning

no code implementations • 3 Apr 2020 • Grigory Malinovsky, Dmitry Kovalev, Elnur Gasanov, Laurent Condat, Peter Richtárik

Most algorithms for solving optimization problems or finding saddle points of convex-concave functions are fixed-point algorithms.

Federated Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.