no code implementations • 4 Mar 2024 • Jonathan Lautenschlager, Emma Sköldberg, Simon Hengchen, Dominik Schlechtweg
This study addresses the task of Unknown Sense Detection in English and Swedish.
1 code implementation • EMNLP 2021 • Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky, Barbara McGillivray
Word meaning is notoriously difficult to capture, both synchronically and diachronically.
no code implementations • NoDaLiDa 2021 • Simon Hengchen, Nina Tahmasebi
Language models are notoriously difficult to evaluate.
1 code implementation • 22 Jan 2021 • Valerio Perrone, Simon Hengchen, Marco Palma, Alessandro Vatri, Jim Q. Smith, Barbara McGillivray
In this chapter we build on GASC, a recent computational approach to semantic change based on a dynamic Bayesian mixture model.
no code implementations • 19 Jan 2021 • Simon Hengchen, Nina Tahmasebi, Dominik Schlechtweg, Haim Dubossarsky
The computational study of lexical semantic change (LSC) has taken off in the past few years and we are seeing increasing interest in the field, from both computational sciences and linguistics.
no code implementations • 20 Nov 2020 • Jani Marjanen, Elaine Zosa, Simon Hengchen, Lidia Pivovarova, Mikko Tolonen
This paper addresses methodological issues in diachronic data analysis for historical research.
1 code implementation • NoDaLiDa 2021 • Quan Duong, Mika Hämäläinen, Simon Hengchen
Historical corpora are known to contain errors introduced by OCR (optical character recognition) methods used in the digitization process, often said to be degrading the performance of NLP systems.
2 code implementations • SEMEVAL 2020 • Dominik Schlechtweg, Barbara McGillivray, Simon Hengchen, Haim Dubossarsky, Nina Tahmasebi
Lexical Semantic Change detection, i. e., the task of identifying words that change meaning over time, is a very active research area, with applications in NLP, lexicography, and linguistics.
no code implementations • LREC 2020 • Esteban Frossard, Mickael Coustaty, Antoine Doucet, Adam Jatowt, Simon Hengchen
Languages change over time and, thanks to the abundance of digital corpora, their evolutionary analysis using computational techniques has recently gained much research attention.
1 code implementation • RANLP 2019 • Mika Hämäläinen, Simon Hengchen
A great deal of historical corpora suffer from errors introduced by the OCR (optical character recognition) methods used in the digitization process.
1 code implementation • ACL 2019 • Haim Dubossarsky, Simon Hengchen, Nina Tahmasebi, Dominik Schlechtweg
State-of-the-art models of lexical semantic change detection suffer from noise stemming from vector space alignment.
no code implementations • WS 2019 • Valerio Perrone, Marco Palma, Simon Hengchen, Alessandro Vatri, Jim Q. Smith, Barbara McGillivray
Word meaning changes over time, depending on linguistic and extra-linguistic factors.