no code implementations • 1 Feb 2024 • Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad
The query, key, and value are often intertwined and generated within those blocks via a single, shared linear transformation.
no code implementations • 17 May 2023 • Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad
The manifold hypothesis posits that high-dimensional data often lies on a lower-dimensional manifold and that utilizing this manifold as the target space yields more efficient representations.
no code implementations • 23 Mar 2023 • Lars Ødegaard Bentsen, Narada Dilp Warakagoda, Roy Stenbro, Paal Engelstad
With an ever-increasing number of sensors in modern society, spatio-temporal time series forecasting has become a de facto tool to make informed decisions about the future.
1 code implementation • 13 Mar 2023 • Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad
Contrastive methods have performed better than generative models in previous state representation learning research.
1 code implementation • 29 Aug 2022 • Lars Ødegaard Bentsen, Narada Dilp Warakagoda, Roy Stenbro, Paal Engelstad
Various alterations have been proposed to better facilitate time series forecasting, of which this study focused on the Informer, LogSparse Transformer and Autoformer.
Multivariate Time Series Forecasting Spatio-Temporal Forecasting +2
1 code implementation • 30 Jun 2022 • Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad
Transformers are neural network models that utilize multiple layers of self-attention heads and have exhibited enormous potential in natural language processing tasks.
no code implementations • 2 Mar 2022 • Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad
In this article, we further explore the possibility of replacing priors with noise and sample the noise from a Gaussian distribution to introduce more diversity into this algorithm.
no code implementations • 10 Jan 2022 • Lars Ødegaard Bentsen, Narada Dilp Warakagoda, Roy Stenbro, Paal Engelstad
With the increased penetration of wind energy into the power grid, it has become increasingly important to be able to predict the expected power production for larger wind farms.
no code implementations • 28 Jun 2021 • Li Meng, Anis Yazidi, Morten Goodwin, Paal Engelstad
Using the board game Othello, we compare our algorithm with the baseline Q-learning algorithm, which is a combination of Double Q-learning and Dueling Q-learning.