Neural Network Compression

74 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Neural Network Compression

Trend	Dataset	Best Model	Paper	Code	Compare
	CIFAR-10	ShuffleNet – Quantised			See all

Libraries

Use these libraries to find Neural Network Compression models and implementations

yoshitomo-matsubara/torchdistill

4 papers

1,278

Datasets

CIFAR-10

Latest papers with no code

Most implemented Social Latest No code

Neural Network Compression using Binarization and Few Full-Precision Weights

no code yet • 15 Jun 2023

Quantization and pruning are two effective Deep Neural Networks model compression methods.

Paper
Add Code

End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates

no code yet • 9 Jun 2023

Our algorithm is versatile and can be used with many popular compression methods including pruning, low-rank factorization, and quantization.

Paper
Add Code

Understanding the Effect of the Long Tail on Neural Network Compression

no code yet • 9 Jun 2023

E. g., it has been shown that mismatches between the full and compressed models can be biased towards under-represented classes.

Paper
Add Code

Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference

no code yet • 4 Jun 2023

Modular Transformers train modularized layers that have the same function of two or more consecutive layers in the original model via module replacing and knowledge distillation.

Paper
Add Code

Evaluation Metrics for DNNs Compression

no code yet • 18 May 2023

There is a lot of ongoing research effort into developing different techniques for neural networks compression.

Paper
Add Code

How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?

no code yet • 9 May 2023

While scaling the approximation error commonly is used to account for the different sizes of layers, the average correlation across layers is smaller than across all choices (i. e. layers, decompositions, and level of compression) before fine-tuning.

Paper
Add Code

Guaranteed Quantization Error Computation for Neural Network Model Compression

no code yet • 26 Apr 2023

Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems.

Paper
Add Code

AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

no code yet • 28 Nov 2022

Inspired by the redundancy of neural networks, we propose a lightweight parallel training framework based on neural network compression, AcceRL, to accelerate the policy learning while ensuring policy quality.

Paper
Add Code

Partial Binarization of Neural Networks for Budget-Aware Efficient Learning

no code yet • 12 Nov 2022

To address this issue, partial binarization techniques have been developed, but a systematic approach to mixing binary and full-precision parameters in a single network is still lacking.

Paper
Add Code

Neural Network Compression by Joint Sparsity Promotion and Redundancy Reduction

no code yet • 14 Oct 2022

Compression of convolutional neural network models has recently been dominated by pruning approaches.

Paper
Add Code

Neural Network Compression

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result