Search Results for author: Zachary Charles

Found 27 papers, 12 papers with code

FAX: Scalable and Differentiable Federated Primitives in JAX

1 code implementation • 11 Mar 2024 • Keith Rush, Zachary Charles, Zachary Garrett

We show that FAX provides an easily programmable, performant, and scalable framework for federated computations in the data center.

32,745

Paper
Code

Adaptive Federated Optimization

5 code implementations • ICLR 2021 • Sashank Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Konečný, Sanjiv Kumar, H. Brendan McMahan

Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data.

Federated Learning

4,139

Paper
Code

Advances and Open Problems in Federated Learning

8 code implementations • 10 Dec 2019 • Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konečný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrède Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Özgür, Rasmus Pagh, Mariana Raykova, Hang Qi, Daniel Ramage, Ramesh Raskar, Dawn Song, Weikang Song, Sebastian U. Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X. Yu, Han Yu, Sen Zhao

FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs resulting from traditional, centralized machine learning and data science approaches.

BIG-bench Machine Learning Federated Learning

4,053

Paper
Code

On the Outsized Importance of Learning Rates in Local Update Methods

1 code implementation • 2 Jul 2020 • Zachary Charles, Jakub Konečný

We study a family of algorithms, which we refer to as local update methods, that generalize many federated learning and meta-learning algorithms.

Federated Learning Meta-Learning

647

Paper
Code

A Field Guide to Federated Optimization

2 code implementations • 14 Jul 2021 • Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz, Satyen Kale, Sai Praneeth Karimireddy, Jakub Konecny, Sanmi Koyejo, Tian Li, Luyang Liu, Mehryar Mohri, Hang Qi, Sashank J. Reddi, Peter Richtarik, Karan Singhal, Virginia Smith, Mahdi Soltanolkotabi, Weikang Song, Ananda Theertha Suresh, Sebastian U. Stich, Ameet Talwalkar, Hongyi Wang, Blake Woodworth, Shanshan Wu, Felix X. Yu, Honglin Yuan, Manzil Zaheer, Mi Zhang, Tong Zhang, Chunxiang Zheng, Chen Zhu, Wennan Zhu

Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection.

Federated Learning

647

Paper
Code

Optimizing the Communication-Accuracy Trade-off in Federated Learning with Rate-Distortion Theory

1 code implementation • 7 Jan 2022 • Nicole Mitchell, Johannes Ballé, Zachary Charles, Jakub Konečný

A significant bottleneck in federated learning (FL) is the network communication cost of sending model updates from client devices to the central server.

Federated Learning Quantization

647

Paper
Code

Motley: Benchmarking Heterogeneity and Personalization in Federated Learning

2 code implementations • 18 Jun 2022 • Shanshan Wu, Tian Li, Zachary Charles, Yu Xiao, Ziyu Liu, Zheng Xu, Virginia Smith

To better answer these questions, we propose Motley, a benchmark for personalized federated learning.

Benchmarking Fairness +3

647

Paper
Code

ATOMO: Communication-efficient Learning via Atomic Sparsification

1 code implementation • NeurIPS 2018 • Hongyi Wang, Scott Sievert, Zachary Charles, Shengchao Liu, Stephen Wright, Dimitris Papailiopoulos

We present ATOMO, a general framework for atomic sparsification of stochastic gradients.

Paper
Code

DRACO: Byzantine-resilient Distributed Training via Redundant Gradients

1 code implementation • ICML 2018 • Lingjiao Chen, Hongyi Wang, Zachary Charles, Dimitris Papailiopoulos

Distributed model training is vulnerable to byzantine system failures and adversarial compute nodes, i. e., nodes that use malicious updates to corrupt the global model stored at a parameter server (PS).

Paper
Code

Towards Federated Foundation Models: Scalable Dataset Pipelines for Group-Structured Learning

1 code implementation • NeurIPS 2023 • Zachary Charles, Nicole Mitchell, Krishna Pillutla, Michael Reneer, Zachary Garrett

Finally, it is framework-agnostic.

Federated Learning Language Modelling +1

Paper
Code

DETOX: A Redundancy-based Framework for Faster and More Robust Gradient Aggregation

1 code implementation • NeurIPS 2019 • Shashank Rajput, Hongyi Wang, Zachary Charles, Dimitris Papailiopoulos

In this work, we present DETOX, a Byzantine-resilient distributed training framework that combines algorithmic redundancy with robust aggregation.

Paper
Code

ErasureHead: Distributed Gradient Descent without Delays Using Approximate Gradient Coding

1 code implementation • 28 Jan 2019 • Hongyi Wang, Zachary Charles, Dimitris Papailiopoulos

We present ErasureHead, a new approach for distributed gradient descent (GD) that mitigates system delays by employing approximate gradient coding.

Paper
Code

Gradient Coding via the Stochastic Block Model

no code implementations • 25 May 2018 • Zachary Charles, Dimitris Papailiopoulos

Gradient descent and its many variants, including mini-batch stochastic gradient descent, form the algorithmic foundation of modern large-scale machine learning.

Stochastic Block Model

Paper
Add Code

Subspace Clustering with Missing and Corrupted Data

no code implementations • 8 Jul 2017 • Zachary Charles, Amin Jalali, Rebecca Willett

Given full or partial information about a collection of points that lie close to a union of several subspaces, subspace clustering refers to the process of clustering the points according to their subspace and identifying the subspaces.

Clustering

Paper
Add Code

Approximate Gradient Coding via Sparse Random Graphs

no code implementations • 17 Nov 2017 • Zachary Charles, Dimitris Papailiopoulos, Jordan Ellenberg

Distributed algorithms are often beset by the straggler effect, where the slowest compute nodes in the system dictate the overall running time.

Paper
Add Code

Stability and Generalization of Learning Algorithms that Converge to Global Optima

no code implementations • ICML 2018 • Zachary Charles, Dimitris Papailiopoulos

Finally, we show that although our results imply comparable stability for SGD and GD in the PL setting, there exist simple neural networks with multiple local minima where SGD is stable but GD is not.

Generalization Bounds

Paper
Add Code

A Geometric Perspective on the Transferability of Adversarial Directions

no code implementations • 8 Nov 2018 • Zachary Charles, Harrison Rosenberg, Dimitris Papailiopoulos

We show that these "transferable adversarial directions" are guaranteed to exist for linear separators of a given set, and will exist with high probability for linear classifiers trained on independent sets drawn from the same distribution.

Paper
Add Code

Does Data Augmentation Lead to Positive Margin?

no code implementations • 8 May 2019 • Shashank Rajput, Zhili Feng, Zachary Charles, Po-Ling Loh, Dimitris Papailiopoulos

Data augmentation (DA) is commonly used during model training, as it significantly improves test error and model robustness.

Data Augmentation

Paper
Add Code

Convergence and Margin of Adversarial Training on Separable Data

no code implementations • 22 May 2019 • Zachary Charles, Shashank Rajput, Stephen Wright, Dimitris Papailiopoulos

Our results are derived by showing that adversarial training with gradient updates minimizes a robust version of the empirical risk at a $\mathcal{O}(\ln(t)^2/t)$ rate, despite non-smoothness.

Paper
Add Code

Convergence and Accuracy Trade-Offs in Federated Learning and Meta-Learning

no code implementations • 8 Mar 2021 • Zachary Charles, Jakub Konečný

Using these insights, we are able to compare local update methods based on their convergence/accuracy trade-off, not just their convergence to critical points of the empirical loss.

Federated Learning Meta-Learning

Paper
Add Code

Local Adaptivity in Federated Learning: Convergence and Consistency

no code implementations • 4 Jun 2021 • Jianyu Wang, Zheng Xu, Zachary Garrett, Zachary Charles, Luyang Liu, Gauri Joshi

Popular optimization algorithms of FL use vanilla (stochastic) gradient descent for both local updates at clients and global updates at the aggregating server.

Federated Learning

Paper
Add Code

On Large-Cohort Training for Federated Learning

no code implementations • NeurIPS 2021 • Zachary Charles, Zachary Garrett, Zhouyuan Huo, Sergei Shmulyian, Virginia Smith

Our work highlights a number of challenges stemming from the use of larger cohorts.

Fairness Federated Learning

Paper
Add Code

Iterated Vector Fields and Conservatism, with Applications to Federated Learning

no code implementations • 8 Sep 2021 • Zachary Charles, Keith Rush

In the context of federated learning, we show that when clients have loss functions whose gradients satisfy this condition, federated averaging is equivalent to gradient descent on a surrogate loss function.

Federated Learning

Paper
Add Code

Federated Select: A Primitive for Communication- and Memory-Efficient Federated Learning

no code implementations • 19 Aug 2022 • Zachary Charles, Kallista Bonawitz, Stanislav Chiknavaryan, Brendan Mcmahan, Blaise Agüera y Arcas

In order to make this practical, we outline a primitive, federated select, which enables client-specific selection in realistic FL systems.

Federated Learning Privacy Preserving

Paper
Add Code

Federated Automatic Differentiation

no code implementations • 18 Jan 2023 • Keith Rush, Zachary Charles, Zachary Garrett

We propose a federated automatic differentiation (FAD) framework that 1) enables computing derivatives of functions involving client and server computation as well as communication between them and 2) operates in a manner compatible with existing federated technology.

FAD Federated Learning +1

Paper
Add Code

Gradient Descent with Linearly Correlated Noise: Theory and Applications to Differential Privacy

no code implementations • NeurIPS 2023 • Anastasia Koloskova, Ryan McKenna, Zachary Charles, Keith Rush, Brendan Mcmahan

We propose a simplified setting that distills key facets of these methods and isolates the impact of linearly correlated noise.

Federated Learning

Paper
Add Code

Leveraging Function Space Aggregation for Federated Learning at Scale

no code implementations • 17 Nov 2023 • Nikita Dhawan, Nicole Mitchell, Zachary Charles, Zachary Garrett, Gintare Karolina Dziugaite

Many federated learning algorithms, including the canonical Federated Averaging (FedAvg), take a direct (possibly weighted) average of the client parameter updates, motivated by results in distributed optimization.

Distributed Optimization Federated Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.