no code implementations • 8 Dec 2024 • Yekun Ke, YIngyu Liang, Zhenmei Shi, Zhao Song, Chiwun Yang
The application of transformer-based models on time series forecasting (TSF) tasks has long been popular to study.
no code implementations • 15 Oct 2024 • Yekun Ke, Xiaoyu Li, YIngyu Liang, Zhenmei Shi, Zhao Song
Recent empirical studies have identified fixed point iteration phenomena in deep neural networks, where the hidden state tends to stabilize after several layers, showing minimal change in subsequent layers.