Search Results for author: Simon Dobnik

Found 37 papers, 3 papers with code

How Vision Affects Language: Comparing Masked Self-Attention in Uni-Modal and Multi-Modal Transformer

no code implementations • ACL (mmsr, IWCS) 2021 • Nikolai Ilinykh, Simon Dobnik

In this paper, we examine masked self-attention in a multi-modal transformer trained for the task of image captioning.

Paper
Add Code

Fine-grained Entailment: Resources for Greek NLI and Precise Entailment

1 code implementation • DCLRL (LREC) 2022 • Eirini Amanaki, Jean-Philippe Bernardy, Stergios Chatzikyriakidis, Robin Cooper, Simon Dobnik, Aram Karimi, Adam Ek, Eirini Chrysovalantou Giannikouri, Vasiliki Katsouli, Ilias Kolokousis, Eirini Chrysovalantou Mamatzaki, Dimitrios Papadakis, Olga Petrova, Erofili Psaltaki, Charikleia Soupiona, Effrosyni Skoulataki, Christina Stefanidou

First, we extend the Greek version of the FraCaS test suite to include examples where the inference is directly linked to the syntactic/morphological properties of Greek.

Natural Language Inference RTE

Paper
Code

Annotating anaphoric phenomena in situated dialogue

no code implementations • ACL (mmsr, IWCS) 2021 • Sharid Loáiciga, Simon Dobnik, David Schlangen

With this paper, we intend to start a discussion on the annotation of referential phenomena in situated dialogue.

coreference-resolution

Paper
Add Code

When an Image Tells a Story: The Role of Visual and Semantic Information for Generating Paragraph Descriptions

no code implementations • INLG (ACL) 2020 • Nikolai Ilinykh, Simon Dobnik

Generating multi-sentence image descriptions is a challenging task, which requires a good model to produce coherent and accurate paragraphs, describing salient objects in the image.

Ranked #10 on Image Paragraph Captioning on Image Paragraph Captioning

Image Paragraph Captioning Sentence

Paper
Add Code

Pre-trained Models or Feature Engineering: The Case of Dialectal Arabic

no code implementations • OSACT (LREC) 2022 • Kathrein Abu Kwaik, Stergios Chatzikyriakidis, Simon Dobnik

The usage of social media platforms has resulted in the proliferation of work on Arabic Natural Language Processing (ANLP), including the development of resources.

Dialect Identification Feature Engineering +2

Paper
Add Code

Reference and coreference in situated dialogue

no code implementations • NAACL (ALVR) 2021 • Sharid Loáiciga, Simon Dobnik, David Schlangen

We argue that there is still significant room for corpora that increase the complexity of both visual and linguistic domains and which capture different varieties of perceptual and conversational contexts.

Paper
Add Code

Attention as Grounding: Exploring Textual and Cross-Modal Attention on Entities and Relations in Language-and-Vision Transformer

1 code implementation • Findings (ACL) 2022 • Nikolai Ilinykh, Simon Dobnik

We explore how a multi-modal transformer trained for generation of longer image descriptions learns syntactic and semantic representations about entities and relations grounded in objects at the level of masked self-attention (text generation) and cross-modal attention (information fusion).

Text Generation Visual Grounding

Paper
Code

In Search of Meaning and Its Representations for Computational Linguistics

no code implementations • CLASP 2022 • Simon Dobnik, Robin Cooper, Adam Ek, Bill Noble, Staffan Larsson, Nikolai Ilinykh, Vladislav Maraev, Vidya Somashekarappa

In this paper we examine different meaning representations that are commonly used in different natural language applications today and discuss their limits, both in terms of the aspects of the natural language meaning they are modelling and in terms of the aspects of the application for which they are used.

Paper
Add Code

Examining the Effects of Language-and-Vision Data Augmentation for Generation of Descriptions of Human Faces

no code implementations • PVLAM (LREC) 2022 • Nikolai Ilinykh, Rafal Černiavski, Eva Elžbieta Sventickaitė, Viktorija Buzaitė, Simon Dobnik

We conclude that face description generation systems are more susceptible to language rather than vision data augmentation.

Caption Generation Data Augmentation +1

Paper
Add Code

Fast visual grounding in interaction: bringing few-shot learning with neural networks to an interactive robot

no code implementations • PaM 2020 • José Miguel Cano Santín, Simon Dobnik, Mehdi Ghanimifard

The major shortcomings of using neural networks with situated agents are that in incremental interaction very few learning examples are available and that their visual sensory representations are quite different from image caption datasets.

Few-Shot Learning Transfer Learning +1

Paper
Add Code

Anaphoric Phenomena in Situated dialog: A First Round of Annotations

no code implementations • COLING (CRAC) 2022 • Sharid Loáiciga, Simon Dobnik, David Schlangen

We present a first release of 500 documents from the multimodal corpus Tell-me-more (Ilinykh et al., 2019) annotated with coreference information according to the ARRAU guidelines (Poesio et al., 2021).

Paper
Add Code

Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

no code implementations • 30 Aug 2023 • Elena Volodina, Simon Dobnik, Therese Lindström Tiedemann, Xuan-Son Vu

Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e. g names or political opinions.

Paper
Add Code

Multi-task recommendation system for scientific papers with high-way networks

no code implementations • 21 Apr 2022 • Aram Karimi, Simon Dobnik

Finding and selecting the most relevant scientific papers from a large number of papers written in a research community is one of the key challenges for researchers these days.

Vocal Bursts Intensity Prediction

Paper
Add Code

We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics

no code implementations • 10 Sep 2021 • Simon Dobnik, Robin Cooper, Adam Ek, Bill Noble, Staffan Larsson, Nikolai Ilinykh, Vladislav Maraev, Vidya Somashekarappa

Paper
Add Code

Sky + Fire = Sunset. Exploring Parallels between Visually Grounded Metaphors and Image Classifiers

no code implementations • WS 2020 • Yuri Bizzoni, Simon Dobnik

This work explores the differences and similarities between neural image classifiers{'} mis-categorisations and visually grounded metaphors - that we could conceive as intentional mis-categorisations.

Paper
Add Code

An Arabic Tweets Sentiment Analysis Dataset (ATSAD) using Distant Supervision and Self Training

no code implementations • LREC 2020 • Kathrein Abu Kwaik, Stergios Chatzikyriakidis, Simon Dobnik, Motaz Saad, Richard Johansson

As the number of social media users increases, they express their thoughts, needs, socialise and publish their opinions reviews.

8k Arabic Sentiment Analysis

Paper
Add Code

Normalising Non-standardised Orthography in Algerian Code-switched User-generated Data

no code implementations • WS 2019 • Wafia Adouane, Jean-Philippe Bernardy, Simon Dobnik

We work with Algerian, an under-resourced non-standardised Arabic variety, for which we compile a new parallel corpus consisting of user-generated textual data matched with normalised and corrected human annotations following data-driven and our linguistically motivated standard.

Semantic Textual Similarity Spelling Correction

Paper
Add Code

What goes into a word: generating image descriptions with top-down spatial knowledge

no code implementations • WS 2019 • Mehdi Ghanimifard, Simon Dobnik

The aim of this paper is to evaluate what representations facilitate generating image descriptions with spatial relations and lead to better grounded language generation.

Language Modelling Text Generation +1

Paper
Add Code

Neural Models for Detecting Binary Semantic Textual Similarity for Algerian and MSA

no code implementations • WS 2019 • Wafia Adouane, Jean-Philippe Bernardy, Simon Dobnik

We explore the extent to which neural networks can learn to identify semantically equivalent sentences from a small variable dataset using an end-to-end training.

Semantic Textual Similarity

Paper
Add Code

Can Modern Standard Arabic Approaches be used for Arabic Dialects? Sentiment Analysis as a Case Study

no code implementations • WS 2019 • Chatrine Qwaider, Stergios Chatzikyriakidis, Simon Dobnik

Sentiment Analysis

Paper
Add Code

ImageTTR: Grounding Type Theory with Records in Image Classification for Visual Question Answering

no code implementations • WS 2019 • Arild Matsson, Simon Dobnik, Staffan Larsson

General Classification Image Classification +2

Paper
Add Code

What a neural language model tells us about spatial relations

1 code implementation • WS 2019 • Mehdi Ghanimifard, Simon Dobnik

Understanding and generating spatial descriptions requires knowledge about what objects are related, their functional interactions, and where the objects are geometrically located.

Language Modelling

Paper
Code

Modular Mechanistic Networks: On Bridging Mechanistic and Phenomenological Models with Deep Neural Networks in Natural Language Processing

no code implementations • 21 Jul 2018 • Simon Dobnik, John D. Kelleher

Natural language processing (NLP) can be done using either top-down (theory driven) and bottom-up (data driven) approaches, which we call mechanistic and phenomenological respectively.

Paper
Add Code

What is not where: the challenge of integrating spatial representations into deep learning architectures

no code implementations • 21 Jul 2018 • John D. Kelleher, Simon Dobnik

This paper examines to what degree current deep learning architectures for image caption generation capture spatial language.

Caption Generation Image Captioning +1

Paper
Add Code

Improving Neural Network Performance by Injecting Background Knowledge: Detecting Code-switching and Borrowing in Algerian texts

no code implementations • WS 2018 • Wafia Adouane, Jean-Philippe Bernardy, Simon Dobnik

We explore the effect of injecting background knowledge to different deep neural network (DNN) configurations in order to mitigate the problem of the scarcity of annotated data when applying these models on datasets of low-resourced languages.

Word Embeddings

Paper
Add Code

A Comparison of Character Neural Language Model and Bootstrapping for Language Identification in Multilingual Noisy Texts

no code implementations • WS 2018 • Wafia Adouane, Simon Dobnik, Jean-Philippe Bernardy, Nasredine Semmar

This paper seeks to examine the effect of including background knowledge in the form of character pre-trained neural language model (LM), and data bootstrapping to overcome the problem of unbalanced limited resources.

Language Identification Language Modelling +1

Paper
Add Code

Exploring the Functional and Geometric Bias of Spatial Relations Using Neural Language Models

no code implementations • WS 2018 • Simon Dobnik, Mehdi Ghanimifard, John Kelleher

The challenge for computational models of spatial descriptions for situated dialogue systems is the integration of information from different modalities.

Image Captioning

Paper
Add Code

Shami: A Corpus of Levantine Arabic Dialects

no code implementations • LREC 2018 • Kathrein Abu Kwaik, Motaz Saad, Stergios Chatzikyriakidis, Simon Dobnik

Language Identification

Paper
Add Code

KILLE: a Framework for Situated Agents for Learning Language Through Interaction

no code implementations • WS 2017 • Simon Dobnik, Erik de Graaf

Paper
Add Code

Identification of Languages in Algerian Arabic Multilingual Documents

no code implementations • WS 2017 • Wafia Adouane, Simon Dobnik

This paper presents a language identification system designed to detect the language of each word, in its context, in a multilingual documents as generated in social media by bilingual/multilingual communities, in our case speakers of Algerian Arabic.

Chunking General Classification +1