Search Results for author: Tosin P. Adewumi

Found 5 papers, 4 papers with code

Potential Idiomatic Expression (PIE)-English: Corpus for Classes of Idioms

2 code implementations LREC 2022 Tosin P. Adewumi, Roshanak Vadoodi, Aparajita Tripathy, Konstantina Nikolaidou, Foteini Liwicki, Marcus Liwicki

The challenges with NLP systems with regards to tasks such as Machine Translation (MT), word sense disambiguation (WSD) and information retrieval make it imperative to have a labelled idioms dataset with classes such as it is in this work.

Information Retrieval Machine Translation +5

The Challenge of Diacritics in Yoruba Embeddings

1 code implementation15 Nov 2020 Tosin P. Adewumi, Foteini Liwicki, Marcus Liwicki

The major contributions of this work include the empirical establishment of a better performance for Yoruba embeddings from undiacritized (normalized) dataset and provision of new analogy sets for evaluation.

Corpora Compared: The Case of the Swedish Gigaword & Wikipedia Corpora

no code implementations6 Nov 2020 Tosin P. Adewumi, Foteini Liwicki, Marcus Liwicki

In this work, we show that the difference in performance of embeddings from differently sourced data for a given language can be due to other factors besides data size.

Exploring Swedish & English fastText Embeddings for NER with the Transformer

1 code implementation23 Jul 2020 Tosin P. Adewumi, Foteini Liwicki, Marcus Liwicki

To achieve a good network performance in natural language processing (NLP) downstream tasks, several factors play important roles: dataset size, the right hyper-parameters, and well-trained embeddings.

named-entity-recognition Named Entity Recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.