no code implementations • 1 Apr 2024 • Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub Ham
In particular, the distribution of activations for each channel vary drastically according to input instances, making PTQ methods for CNNs inappropriate for ViTs.