no code implementations • 23 Feb 2024 • Yanjun Zhao, Sizhe Dang, Haishan Ye, Guang Dai, Yi Qian, Ivor W. Tsang
Fine-tuning large language models (LLMs) with classic first-order optimizers entails prohibitive GPU memory due to the backpropagation process.
no code implementations • 8 Feb 2024 • Yanjun Zhao, Tian Zhou, Chao Chen, Liang Sun, Yi Qian, Rong Jin
Time series analysis is vital for numerous applications, and transformers have become increasingly prominent in this domain.
Computational Efficiency Multivariate Time Series Forecasting +2
1 code implementation • 6 Dec 2023 • Chao Chen, Tian Zhou, Yanjun Zhao, Hui Liu, Liang Sun, Rong Jin
Moreover, we approximate the sparse regression process using a blend of a two-layer MLP and an extensive codebook.
Ranked #5 on Traffic Prediction on BJTaxi
1 code implementation • 14 Jun 2023 • Yanjun Zhao, Ziqing Ma, Tian Zhou, Liang Sun, Mengni Ye, Yi Qian
On the other hand, the long input sequence usually leads to large model size and high time complexity.
no code implementations • 21 Jun 2020 • Sanghoon Lee, Colton Farley, Simon Shim, Yanjun Zhao, Wookjin Choi, Wook-Sung Yoo
We demonstrate the effectiveness of the proposed approach for cancer detection in BRCA and show how the machine can choose the most appropriate clusters during the unsupervised learning procedure.