Search Results for author: Junki Park

Found 1 papers, 0 papers with code

NN-LUT: Neural Approximation of Non-Linear Operations for Efficient Transformer Inference

no code implementations3 Dec 2021 Joonsang Yu, Junki Park, Seongmin Park, Minsoo Kim, Sihwa Lee, Dong Hyun Lee, Jungwook Choi

Non-linear operations such as GELU, Layer normalization, and Softmax are essential yet costly building blocks of Transformer models.

Cannot find the paper you are looking for? You can Submit a new open access paper.