Search Results for author: Magnus Sahlgren

Found 35 papers, 9 papers with code

Lessons Learned from GPT-SW3: Building the First Large-Scale Generative Language Model for Swedish

no code implementations • LREC 2022 • Ariel Ekgren, Amaru Cuba Gyllensten, Evangelia Gogoulou, Alice Heiman, Severine Verlinden, Joey Öhman, Fredrik Carlsson, Magnus Sahlgren

We present GTP-SW3, a 3. 5 billion parameter autoregressive language model, trained on a newly created 100 GB Swedish corpus.

Language Modelling Text Generation

Paper
Add Code

Gender Bias in Pretrained Swedish Embeddings

no code implementations • WS (NoDaLiDa) 2019 • Magnus Sahlgren, Fredrik Olsson

This paper investigates the presence of gender bias in pretrained Swedish embeddings.

Paper
Add Code

Cross-lingual and Multilingual CLIP

1 code implementation • LREC 2022 • Fredrik Carlsson, Philipp Eisen, Faton Rekathati, Magnus Sahlgren

The long-standing endeavor of relating the textual and the visual domain recently underwent a pivotal breakthrough, as OpenAI released CLIP.

Ranked #4 on Zero-shot Image Retrieval on XTD10

Contrastive Learning Machine Translation +3

719

Paper
Code

GANDALF: a General Character Name Description Dataset for Long Fiction

no code implementations • EMNLP (MRQA) 2021 • Fredrik Carlsson, Magnus Sahlgren, Fredrik Olsson, Amaru Cuba Gyllensten

This paper introduces a long-range multiple-choice Question Answering (QA) dataset, based on full-length fiction book texts.

Multiple-choice Question Answering

Paper
Add Code

Fine-Grained Controllable Text Generation Using Non-Residual Prompting

1 code implementation • ACL 2022 • Fredrik Carlsson, Joey Öhman, Fangyu Liu, Severine Verlinden, Joakim Nivre, Magnus Sahlgren

We propose a resource-efficient method for converting a pre-trained CLM into this architecture, and demonstrate its potential on various experiments, including the novel task of contextualized word inclusion.

Decoder Text Generation

Paper
Code

Decentralized Word2Vec Using Gossip Learning

no code implementations • NoDaLiDa 2021 • Abdul Aziz Alkathiri, Lodovico Giaretta, Sarunas Girdzijauskas, Magnus Sahlgren

Advanced NLP models require huge amounts of data from various domains to produce high-quality representations.

Paper
Add Code

It’s Basically the Same Language Anyway: the Case for a Nordic Language Model

no code implementations • NoDaLiDa 2021 • Magnus Sahlgren, Fredrik Carlsson, Fredrik Olsson, Love Börjeson

When is it beneficial for a research community to organize a broader collaborative effort on a topic, and when should we instead promote individual efforts?

Language Modelling

Paper
Add Code

GPT-SW3: An Autoregressive Language Model for the Nordic Languages

no code implementations • 22 May 2023 • Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Alice Heiman, Judit Casademont, Magnus Sahlgren

This paper details the process of developing the first native large generative language model for the Nordic languages, GPT-SW3.

Language Modelling

Paper
Add Code

The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling

no code implementations • 30 Mar 2023 • Joey Öhman, Severine Verlinden, Ariel Ekgren, Amaru Cuba Gyllensten, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Magnus Sahlgren

Pre-training Large Language Models (LLMs) require massive amounts of text data, and the performance of the LLMs typically correlates with the scale and quality of the datasets.

Language Modelling

Paper
Add Code

We Need to Talk About Data: The Importance of Data Readiness in Natural Language Processing

2 code implementations • 11 Oct 2021 • Fredrik Olsson, Magnus Sahlgren

In this paper, we identify the state of data as being an important reason for failure in applied Natural Language Processing (NLP) projects.

Paper
Code

Cross-lingual Transfer of Monolingual Models

no code implementations • LREC 2022 • Evangelia Gogoulou, Ariel Ekgren, Tim Isbister, Magnus Sahlgren

Additionally, the results of evaluating the transferred models in source language tasks reveal that their performance in the source domain deteriorates after transfer.

Cross-Lingual Transfer Domain Adaptation

Paper
Add Code

A comparative evaluation and analysis of three generations of Distributional Semantic Models

1 code implementation • 20 May 2021 • Alessandro Lenci, Magnus Sahlgren, Patrick Jeuniaux, Amaru Cuba Gyllensten, Martina Miliani

In this paper, we perform a comprehensive evaluation of type distributional vectors, either produced by static DSMs or obtained by averaging the contextualized vectors generated by BERT.

Paper
Code

Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead?

1 code implementation • NoDaLiDa 2021 • Tim Isbister, Fredrik Carlsson, Magnus Sahlgren

We demonstrate empirically that a large English language model coupled with modern machine translation outperforms native language models in most Scandinavian languages.

Language Modelling Machine Translation +1

Paper
Code

Federated Word2Vec: Leveraging Federated Learning to Encourage Collaborative Representation Learning

no code implementations • 19 Apr 2021 • Daniel Garcia Bernal, Lodovico Giaretta, Sarunas Girdzijauskas, Magnus Sahlgren

The results show that neither the quality of the results nor the convergence time in Federated Word2Vec deteriorates as compared to centralised Word2Vec.

Federated Learning Representation Learning

Paper
Add Code

Predicting Treatment Outcome from Patient Texts:The Case of Internet-Based Cognitive Behavioural Therapy

no code implementations • EACL 2021 • Evangelia Gogoulou, Magnus Boman, Fehmi ben Abdesslem, Nils Hentati Isacsson, Viktor Kaldo, Magnus Sahlgren

We investigate the feasibility of applying standard text categorisation methods to patient text in order to predict treatment outcome in Internet-based cognitive behavioural therapy.

Sentiment Analysis

Paper
Add Code

The Singleton Fallacy: Why Current Critiques of Language Models Miss the Point

no code implementations • 8 Feb 2021 • Magnus Sahlgren, Fredrik Carlsson

By contrast, we will argue that there are many different types of language use, meaning and understanding, and that (current) language models are build with the explicit purpose of acquiring and representing one type of structural understanding of language.

Natural Language Understanding Position

Paper
Add Code

Deep Representational Re-tuning using Contrastive Tension

1 code implementation • ICLR 2021 • Fredrik Carlsson, Amaru Cuba Gyllensten, Evangelia Gogoulou, Erik Ylipää Hellqvist, Magnus Sahlgren

Extracting semantically useful natural language sentence representations from pre-trained deep neural networks such as Transformers remains a challenge.

Paper
Code

SenseCluster at SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection

no code implementations • SEMEVAL 2020 • Amaru Cuba Gyllensten, Evangelia Gogoulou, Ariel Ekgren, Magnus Sahlgren

We (Team Skurt) propose a simple method to detect lexical semantic change by clustering contextualized embeddings produced by XLM-R, using K-Means++.

Change Detection Clustering +1

Paper
Add Code

Rethinking Topic Modelling: From Document-Space to Term-Space

no code implementations • Findings of the Association for Computational Linguistics 2020 • Magnus Sahlgren

This paper problematizes the reliance on documents as the basic notion for defining term interactions in standard topic models.

Topic Models Word Embeddings

Paper
Add Code

Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic Similarity

1 code implementation • 7 Sep 2020 • Tim Isbister, Magnus Sahlgren

This paper presents the first Swedish evaluation benchmark for textual semantic similarity.

Machine Translation Semantic Similarity +4

Paper
Code

Data Readiness for Natural Language Processing

2 code implementations • 4 Sep 2020 • Fredrik Olsson, Magnus Sahlgren

This document concerns data readiness in the context of machine learning and Natural Language Processing.

BIG-bench Machine Learning

Paper
Code

Text Categorization for Conflict Event Annotation

no code implementations • LREC 2020 • Fredrik Olsson, Magnus Sahlgren, Fehmi ben Abdesslem, Ariel Ekgren, Kristine Eck

We cast the problem of event annotation as one of text categorization, and compare state of the art text categorization techniques on event data produced within the Uppsala Conflict Data Program (UCDP).

Text Categorization

Paper
Add Code

Measuring Issue Ownership using Word Embeddings

no code implementations • WS 2018 • Amaru Cuba Gyllensten, Magnus Sahlgren

Sentiment and topic analysis are common methods used for social media monitoring.

Document Embedding Word Embeddings

Paper
Add Code

Learning Representations for Detecting Abusive Language

no code implementations • WS 2018 • Magnus Sahlgren, Tim Isbister, Fredrik Olsson

This paper discusses the question whether it is possible to learn a generic representation that is useful for detecting various types of abusive language.

Abusive Language Language Modelling +4

Paper
Add Code

R-grams: Unsupervised Learning of Semantic Units in Natural Language

1 code implementation • WS 2019 • Ariel Ekgren, Amaru Cuba Gyllensten, Magnus Sahlgren

This paper investigates data-driven segmentation using Re-Pair or Byte Pair Encoding-techniques.

Machine Translation Segmentation +1

Paper
Code

Monitoring Targeted Hate in Online Environments

no code implementations • 13 Mar 2018 • Tim Isbister, Magnus Sahlgren, Lisa Kaati, Milan Obaidi, Nazar Akrami

Hateful comments, swearwords and sometimes even death threats are becoming a reality for many people today in online environments.

Paper
Add Code

Distributional Term Set Expansion

no code implementations • LREC 2018 • Amaru Cuba Gyllensten, Magnus Sahlgren

This paper is a short empirical study of the performance of centrality and classification based iterative term set expansion methods for distributional semantic models.

Active Learning Classification +1

Paper
Add Code

Active learning for detection of stance components

no code implementations • WS 2016 • Maria Skeppstedt, Magnus Sahlgren, Carita Paradis, Andreas Kerren

This larger variation was also shown by the lower recall results achieved by the lexicon-based approach for sentiment than for the categories speculation, contrast and condition.

Active Learning Opinion Mining +2

Paper
Add Code

The Effects of Data Size and Frequency Range on Distributional Semantic Models

no code implementations • EMNLP 2016 • Magnus Sahlgren, Alessandro Lenci

This paper investigates the effects of data size and frequency range on distributional semantic models.

Paper
Add Code

Unshared task: (Dis)agreement in online debates

no code implementations • WS 2016 • Maria Skeppstedt, Magnus Sahlgren, Carita Paradis, Andreas Kerren

Argument Mining

Paper
Add Code

Parameterized context windows in Random Indexing

no code implementations • WS 2016 • Tobias Norlund, David Nilsson, Magnus Sahlgren

Representation Learning Sentiment Analysis +1

Paper
Add Code

The Gavagai Living Lexicon

no code implementations • LREC 2016 • Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, Anders Holst

This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages.

Paper
Add Code

Factorization of Latent Variables in Distributional Semantic Models

no code implementations • EMNLP 2015 • Arvid {\"O}sterlund, David {\"O}dling, Magnus Sahlgren

Dimensionality Reduction

Paper
Add Code

Detecting speculations, contrasts and conditionals in consumer reviews

no code implementations • WS 2015 • Maria Skeppstedt, Teri Schamp-Bjerede, Magnus Sahlgren, Carita Paradis, Andreas Kerren

Opinion Mining Sentiment Analysis

Paper
Add Code

Navigating the Semantic Horizon using Relative Neighborhood Graphs

no code implementations • EMNLP 2015 • Amaru Cuba Gyllensten, Magnus Sahlgren

We also argue that the topology of the neighborhoods in semantic space can be used to determine the semantic horizon of a point, which we define as the set of neighbors that have a direct connection to the point.

Word Sense Induction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.