no code implementations • 19 Jun 2024 • Junhan Kim, Ho-young Kim, Eulrang Cho, Chungman Lee, Joonyoung Kim, Yongkweon Jeon
Quantization is a promising solution for deploying large-scale language models (LLMs) on resource-constrained devices.
no code implementations • 14 Feb 2024 • Junhan Kim, Kyungphil Park, Chungman Lee, Ho-young Kim, Joonyoung Kim, Yongkweon Jeon
Through extensive experiments on various language models and complexity analysis, we demonstrate that aespa is accurate and efficient in quantizing Transformer models.
no code implementations • 15 Jul 2023 • Joonyoung Kim, Kangwook Lee, Haebin Shin, Hurnjoo Lee, Sechun Kang, Byunguk Choi, Dong Shin, Joohyung Lee
The more new features that are being added to smartphones, the harder it becomes for users to find them.
no code implementations • 7 Dec 2021 • Youngjune Lee, Oh Joon Kwon, Haeju Lee, Joonyoung Kim, Kangwook Lee, Kee-Eung Kim
For this reason, data-centric approaches are crucial for the automation of machine learning operation pipeline.
1 code implementation • 13 Jan 2021 • Segwang Kim, Hyoungwook Nam, Joonyoung Kim, Kyomin Jung
Logical reasoning tasks over symbols, such as learning arithmetic operations and computer program evaluations, have become challenges to deep learning.