AutoSync

Introduced by Zhang et al. in AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning

AutoSync is a pipeline for automatically optimizing synchronization strategies, given model structures and resource specifications, in data-parallel distributed machine learning. By factorizing the synchronization strategy with respect to each trainable building block of a DL model, we can construct a valid and large strategy space spanned by multiple factors. AutoSync efficiently navigates the space and locates the optimal strategy. AutoSync leverages domain knowledge about synchronization systems to reduce the search space, and is equipped with a domain adaptive simulator, which combines principled communication modeling and data-driven ML models, to estimate the runtime of strategy proposals without launching real distributed execution.

Source: AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning

Read Paper

Papers

Paper	Code	Results	Date	Stars

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Distributed Methods

Auto Parallel Methods

AutoSync

Papers

Usage Over Time

Components

Categories Edit Add Remove

Categories

Add Remove