no code implementations • WS 2019 • Lasha Abzianidze, Rik van Noord, Hessel Haagsma, Johan Bos
To measure similarity between two DRSs, they are represented in a clausal form, i. e. as a set of tuples.
no code implementations • LREC 2020 • Hessel Haagsma, Johan Bos, Malvina Nissim
Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom processing and linguistic analysis by creating the largest-to-date corpus of idioms for English.
no code implementations • 20 Nov 2019 • Hessel Haagsma, Malvina Nissim, Johan Bos
To further progress on the extraction and disambiguation of potentially idiomatic expressions, larger corpora of PIEs are required.
no code implementations • COLING 2018 • Hessel Haagsma, Malvina Nissim, Johan Bos
Disambiguation of potentially idiomatic expressions involves determining the sense of a potentially idiomatic expression in a given context, e. g. determining that make hay in {`}Investment banks made hay while takeovers shone.
no code implementations • SEMEVAL 2018 • Daphne Groot, R{\'e}mon Kruizinga, Hennie Veldthuis, Simon de Wit, Hessel Haagsma
We present a system for emoji prediction on English and Spanish tweets, prepared for the SemEval-2018 task on Multilingual Emoji Prediction.
2 code implementations • LREC 2018 • Rik van Noord, Lasha Abzianidze, Hessel Haagsma, Johan Bos
A pilot study is performed to automatically find changes in meaning by comparing meaning representations of translations.
no code implementations • 12 Jul 2017 • Angelo Basile, Gareth Dwyer, Maria Medvedeva, Josine Rawee, Hessel Haagsma, Malvina Nissim
We describe our participation in the PAN 2017 shared task on Author Profiling, identifying authors' gender and language variety for English, Spanish, Arabic and Portuguese.
1 code implementation • EACL 2017 • Lasha Abzianidze, Johannes Bjerva, Kilian Evang, Hessel Haagsma, Rik van Noord, Pierre Ludmann, Duc-Duy Nguyen, Johan Bos
The Parallel Meaning Bank is a corpus of translations annotated with shared, formal meaning representations comprising over 11 million words divided over four languages (English, German, Italian, and Dutch).