Search Results for author: Seid Muhie Yimam

Found 34 papers, 11 papers with code

Elvis vs. M. Jackson: Who has More Albums? Classification and Identification of Elements in Comparative Questions

no code implementations LREC 2022 Meriem Beloucif, Seid Muhie Yimam, Steffen Stahlhacke, Chris Biemann

Comparative Question Answering (cQA) is the task of providing concrete and accurate responses to queries such as: “Is Lyft cheaper than a regular taxi?” or “What makes a mortgage different from a regular loan?”.

Binary Classification Question Answering

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

1 code implementation13 Apr 2023 Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Sa'id Ahmad, Nedjma Ousidhoum, Abinew Ayele, Saif M. Mohammad, Meriem Beloucif, Sebastian Ruder

We present the first Africentric SemEval Shared task, Sentiment Analysis for African Languages (AfriSenti-SemEval) - The dataset is available at https://github. com/afrisenti-semeval/afrisent-semeval-2023.

Classification Sentiment Analysis +2

Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities

1 code implementation25 Mar 2023 Atnafu Lambebo Tonja, Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele, Moges Ahmed Mehamed, Olga Kolesnikova, Seid Muhie Yimam

This survey delves into the current state of natural language processing (NLP) for four Ethiopian languages: Amharic, Afaan Oromo, Tigrinya, and Wolaytta.

ARDIAS: AI-Enhanced Research Management, Discovery, and Advisory System

no code implementations25 Jan 2023 Debayan Banerjee, Seid Muhie Yimam, Sushil Awale, Chris Biemann

In this work, we present ARDIAS, a web-based application that aims to provide researchers with a full suite of discovery and collaboration tools.


The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation

2 code implementations27 Oct 2022 Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov, Alexander Gelbukh

Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another.

Machine Translation Translation

SCoT: Sense Clustering over Time: a tool for the analysis of lexical change

no code implementations EACL 2021 Christian Haase, Saba Anwar, Seid Muhie Yimam, Alexander Friedrich, Chris Biemann

There are two main approaches to the exploration of dynamic networks: the discrete one compares a series of clustered graphs from separate points in time.


How Hateful are Movies? A Study and Prediction on Movie Subtitles

1 code implementation KONVENS (WS) 2021 Niklas von Boguszewski, Sana Moin, Anirban Bhowmick, Seid Muhie Yimam, Chris Biemann

Hence, we show that transfer learning from the social media domain is efficacious in classifying hate and offensive speech in movies through subtitles.

Domain Adaptation Transfer Learning

ActiveAnno: General-Purpose Document-Level Annotation Tool with Active Learning Integration

no code implementations NAACL 2021 Max Wiechmann, Seid Muhie Yimam, Chris Biemann

ActiveAnno is built with extensible design and easy deployment in mind, all to enable users to perform annotation tasks with high efficiency and high-quality annotation results.

Active Learning

Word Complexity is in the Eye of the Beholder

no code implementations NAACL 2021 Sian Gooding, Ekaterina Kochmar, Seid Muhie Yimam, Chris Biemann

Lexical complexity is a highly subjective notion, yet this factor is often neglected in lexical simplification and readability systems which use a {''}one-size-fits-all{''} approach.

Lexical Simplification

MasakhaNER: Named Entity Recognition for African Languages

2 code implementations22 Mar 2021 David Ifeoluwa Adelani, Jade Abbott, Graham Neubig, Daniel D'souza, Julia Kreutzer, Constantine Lignos, Chester Palen-Michel, Happy Buzaaba, Shruti Rijhwani, Sebastian Ruder, Stephen Mayhew, Israel Abebe Azime, Shamsuddeen Muhammad, Chris Chinenye Emezue, Joyce Nakatumba-Nabende, Perez Ogayo, Anuoluwapo Aremu, Catherine Gitau, Derguene Mbaye, Jesujoba Alabi, Seid Muhie Yimam, Tajuddeen Gwadabe, Ignatius Ezeani, Rubungo Andre Niyongabo, Jonathan Mukiibi, Verrah Otiende, Iroro Orife, Davis David, Samba Ngom, Tosin Adewumi, Paul Rayson, Mofetoluwa Adeyemi, Gerald Muriuki, Emmanuel Anebi, Chiamaka Chukwuneke, Nkiruka Odu, Eric Peter Wairagala, Samuel Oyerinde, Clemencia Siro, Tobius Saul Bateesa, Temilola Oloyede, Yvonne Wambui, Victor Akinode, Deborah Nabagereka, Maurice Katusiime, Ayodele Awokoya, Mouhamadane MBOUP, Dibora Gebreyohannes, Henok Tilaye, Kelechi Nwaike, Degaga Wolde, Abdoulaye Faye, Blessing Sibanda, Orevaoghene Ahia, Bonaventure F. P. Dossou, Kelechi Ogueji, Thierno Ibrahima DIOP, Abdoulaye Diallo, Adewale Akinfaderin, Tendai Marengereke, Salomey Osei

We take a step towards addressing the under-representation of the African continent in NLP research by creating the first large publicly available high-quality dataset for named entity recognition (NER) in ten African languages, bringing together a variety of stakeholders.

named-entity-recognition Named Entity Recognition +2

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

6 code implementations18 Dec 2020 Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

We also observe that models, which utilize the human rationales for training, perform better in reducing unintended bias towards target communities.

Hate Speech Detection Text Classification

Exploring Amharic Sentiment Analysis from Social Media Texts: Building Annotation Tools and Classification Models

no code implementations COLING 2020 Seid Muhie Yimam, Hizkiel Mitiku Alemayehu, Abinew Ayele, Chris Biemann

To advance the sentiment analysis research in Amharic and other related low-resource languages, we release the dataset, the annotation tool, source code, and models publicly under a permissive.

Decision Making Sentiment Analysis +1

Analysis of the Ethiopic Twitter Dataset for Abusive Speech in Amharic

no code implementations9 Dec 2019 Seid Muhie Yimam, Abinew Ali Ayele, Chris Biemann

Since several languages can be written using the Fidel script, we have used the existing Amharic, Tigrinya and Ge'ez corpora to retain only the Amharic tweets.

A Multilingual Information Extraction Pipeline for Investigative Journalism

no code implementations EMNLP 2018 Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann

We introduce an advanced information extraction pipeline to automatically process very large collections of unstructured textual data for the purpose of investigative journalism.

Entity Extraction using GAN

New/s/leak 2.0 - Multilingual Information Extraction and Visualization for Investigative Journalism

no code implementations13 Jul 2018 Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann

Investigative journalism in recent years is confronted with two major challenges: 1) vast amounts of unstructured data originating from large text collections such as leaks or answers to Freedom of Information requests, and 2) multi-lingual data due to intensified global cooperation and communication in politics, business and civil society.

Efficient Exploration

Par4Sim -- Adaptive Paraphrasing for Text Simplification

no code implementations COLING 2018 Seid Muhie Yimam, Chris Biemann

Learning from a real-world data stream and continuously updating the model without explicit supervision is a new challenge for NLP applications with machine learning components.

Learning-To-Rank Text Simplification

Cannot find the paper you are looking for? You can Submit a new open access paper.