Tofu

Introduced by Wang et al. in Supporting Very Large Models using Automatic Dataflow Graph Partitioning

Tofu is an intra-layer model parallel system that partitions very large DNN models across multiple GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow graph of fine-grained tensor operators used by platforms like MXNet and TensorFlow. To optimally partition different operators in a dataflow graph, Tofu uses a recursive search algorithm that minimizes the total communication cost.

Source: Supporting Very Large Models using Automatic Dataflow Graph Partitioning

Read Paper

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Computational Efficiency	1	16.67%
Image Generation	1	16.67%
Recommendation Systems	1	16.67%
Federated Learning	1	16.67%
3D Reconstruction	1	16.67%
graph partitioning	1	16.67%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Distributed Methods

Model Parallel Methods

Intra-Layer Parallel