no code implementations • LREC 2022 • Michael Rosner, Sina Ahmadi, Elena-Simona Apostol, Julia Bosque-Gil, Christian Chiarcos, Milan Dojchinovski, Katerina Gkirtzou, Jorge Gracia, Dagmar Gromann, Chaya Liebeskind, Giedrė Valūnaitė Oleškevičienė, Gilles Sérasset, Ciprian-Octavian Truică
In this paper, we provide an overview of current technologies for cross-lingual link discovery, and we discuss challenges, experiences and prospects of their application to under-resourced languages.
no code implementations • EACL (GWC) 2021 • Sina Ahmadi, John P. McCrae
Words are defined based on their meanings in various ways in different resources.
1 code implementation • EMNLP (NLPOSS) 2020 • Sina Ahmadi
Despite the recent advances in applying language-independent approaches to various natural language processing tasks thanks to artificial intelligence, some language-specific tools are still essential to process a language in a viable manner.
1 code implementation • VarDial (COLING) 2020 • Sina Ahmadi
The Zaza–Gorani language family is a linguistic subgroup of the Northwestern Iranian languages for which there is no significant corpus available.
1 code implementation • VarDial (COLING) 2020 • Sina Ahmadi
We demonstrate how the morphological complexity of the language along with the lack of a unified orthography can be efficiently addressed in tokenization.
no code implementations • LREC 2022 • Nadhem Zmandar, Tobias Daudert, Sina Ahmadi, Mahmoud El-Haj, Paul Rayson
Natural Language Processing is increasingly being applied in the finance and business industry to analyse the text of many different types of financial documents.
no code implementations • 26 May 2023 • Md Mahfuz ibn Alam, Sina Ahmadi, Antonios Anastasopoulos
Neural machine translation (NMT) systems exhibit limited robustness in handling source-side linguistic variations.
no code implementations • 25 May 2023 • Sina Ahmadi, Antonios Anastasopoulos
The wide accessibility of social media has provided linguistically under-represented communities with an extraordinary opportunity to create content in their native languages.
1 code implementation • 10 Apr 2023 • Razhan Hameed, Sina Ahmadi, Fatemeh Daneshfar
Sentiment analysis is the process of identifying and extracting subjective information from text.
1 code implementation • 3 Apr 2023 • Sina Ahmadi, Milind Agarwal, Antonios Anastasopoulos
The Perso-Arabic scripts are a family of scripts that are widely adopted and used by various linguistic communities around the globe.
1 code implementation • 3 Apr 2023 • Sina Ahmadi, Zahra Azin, Sara Belelli, Antonios Anastasopoulos
One of the major challenges that under-represented and endangered language communities face in language technology is the lack or paucity of language data.
no code implementations • 6 Sep 2022 • Sina Ahmadi
This is a challenging task, especially due to differences in sense granularity, coverage and description in two resources.
1 code implementation • 14 Sep 2021 • Sina Ahmadi
Spell checking and morphological analysis are two fundamental tasks in text and natural language processing and are addressed in the early stages of the development of language technology.
no code implementations • 8 Sep 2021 • Sina Ahmadi
Sorani Kurdish, also known as Central Kurdish, has a complex morphology, particularly due to the patterns in which morphemes appear.
1 code implementation • loresmt (AACL) 2020 • Sina Ahmadi, Mariam Masoud
Machine translation is the task of translating texts from one language to another using computers.
1 code implementation • 4 Oct 2020 • Sina Ahmadi, Hossein Hassani, Daban Q. Jaff
We present a corpus containing 12, 327 translation pairs in the two major dialects of Kurdish, Sorani and Kurmanji.
no code implementations • 21 May 2020 • Sina Ahmadi, Hossein Hassani
Morphological analysis is the study of the formation and structure of words.
1 code implementation • LREC 2020 • Sina Ahmadi, Hossein Hassani, Kamaladdin Abedi
We believe that this corpus contributes to Kurdish language processing in several ways, such as compensation for the lack of a long history of written text by incorporating oral literature, presenting an unexplored realm in Kurdish language processing, and assisting the initiation of Kurdish computational folkloristics.
no code implementations • LREC 2020 • Ana Salgado, Sina Ahmadi, Alberto Sim{\~o}es, John Philip McCrae, Rute Costa
Word sense alignment involves searching for matching senses within dictionary entries of different lexical resources and linking them, which poses significant challenges.
1 code implementation • LREC 2020 • Patricia Mart{\'\i}n-Chozas, Sina Ahmadi, Elena Montiel-Ponsoda
In this paper we present an approach to validate terminological data retrieved from open encyclopaedic knowledge bases.
1 code implementation • LREC 2020 • Sina Ahmadi, John Philip McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette Pedersen, Thierry Declerck, Tanja Wissik, Bell, Andrea i, Irene Pisani, Thomas Troelsg{\aa}rd, Sussi Olsen, Simon Krek, Veronika Lipp, Tam{\'a}s V{\'a}radi, L{\'a}szl{\'o} Simon, Andr{\'a}s Gyorffy, Carole Tiberius, Tanneke Schoonheim, Yifat Ben Moshe, Maya Rudich, Raya Abu Ahmad, Dorielle Lonke, Kira Kovalenko, Margit Langemets, Jelena Kallas, Oksana Dereza, Theodorus Fransen, David Cillessen, David Lindemann, Mikel Alonso, Ana Salgado, Jos{\'e} Luis Sancho, Rafael-J. Ure{\~n}a-Ruiz, Jordi Porta Zamorano, Kiril Simov, Petya Osenova, Zara Kancheva, Ivaylo Radev, Ranka Stankovi{\'c}, Andrej Perdih, Dejan Gabrovsek
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography.
1 code implementation • WS 2019 • Roshna Omer Abdulrahman, Hossein Hassani, Sina Ahmadi
Kurdish is a less-resourced language consisting of different dialects written in various scripts.
1 code implementation • 26 Nov 2018 • Sina Ahmadi
In this article, we present a rule-based approach for transliterating two mostly used orthographies in Sorani Kurdish.
no code implementations • 9 Oct 2018 • Sina Ahmadi
Morphological declension, which aims to inflect nouns to indicate number, case and gender, is an important task in natural language processing (NLP).
no code implementations • 27 Sep 2018 • Shahin Salavati, Sina Ahmadi
The present paper aims at presenting a lemmatization and a word-level error correction system for Sorani Kurdish.