Search Results for author: Chaoxiang Lan

Found 1 papers, 1 papers with code

Unified Normalization for Accelerating and Stabilizing Transformers

1 code implementation2 Aug 2022 Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, ShiLiang Pu

To tackle these issues, we propose Unified Normalization (UN), which can speed up the inference by being fused with other linear operations and achieve comparable performance on par with LN.

Cannot find the paper you are looking for? You can Submit a new open access paper.