no code implementations • 22 May 2023 • Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Alice Heiman, Judit Casademont, Magnus Sahlgren
This paper details the process of developing the first native large generative language model for the Nordic languages, GPT-SW3.
no code implementations • 30 Mar 2023 • Joey Öhman, Severine Verlinden, Ariel Ekgren, Amaru Cuba Gyllensten, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Magnus Sahlgren
Pre-training Large Language Models (LLMs) require massive amounts of text data, and the performance of the LLMs typically correlates with the scale and quality of the datasets.
no code implementations • LREC 2022 • Evangelia Gogoulou, Ariel Ekgren, Tim Isbister, Magnus Sahlgren
Additionally, the results of evaluating the transferred models in source language tasks reveal that their performance in the source domain deteriorates after transfer.
1 code implementation • NoDaLiDa 2021 • Tim Isbister, Fredrik Carlsson, Magnus Sahlgren
We demonstrate empirically that a large English language model coupled with modern machine translation outperforms native language models in most Scandinavian languages.
1 code implementation • 7 Sep 2020 • Tim Isbister, Magnus Sahlgren
This paper presents the first Swedish evaluation benchmark for textual semantic similarity.
no code implementations • 22 Oct 2019 • Nazar Akrami, Johan Fernquist, Tim Isbister, Lisa Kaati, Björn Pelzer
Our results show that the models based on the small high-reliability dataset performed better (in terms of $\textrm{R}^2$) than models based on large low-reliability dataset.
no code implementations • SEMEVAL 2019 • Tim Isbister, Fredrik Johansson
In a world of information operations, influence campaigns, and fake news, classification of news articles as following hyperpartisan argumentation or not is becoming increasingly important.
no code implementations • WS 2018 • Magnus Sahlgren, Tim Isbister, Fredrik Olsson
This paper discusses the question whether it is possible to learn a generic representation that is useful for detecting various types of abusive language.
no code implementations • 13 Mar 2018 • Tim Isbister, Magnus Sahlgren, Lisa Kaati, Milan Obaidi, Nazar Akrami
Hateful comments, swearwords and sometimes even death threats are becoming a reality for many people today in online environments.