no code implementations • CLTW (LREC) 2022 • Mahmoud El-Haj, Ignatius Ezeani, Jonathan Morris, Dawn Knight
As part of the effort to increase the availability of Welsh digital technology, this paper introduces the first human vs metrics Welsh summarisation evaluation results and dataset, which we provide freely for research purposes to help advance the work on Welsh summarisation.
no code implementations • OSACT (LREC) 2022 • Mahmoud El-Haj, Elvis de Souza, Nouran Khallaf, Paul Rayson, Nizar Habash
This paper presents (AraSAS) the first open-source Arabic semantic analysis tagging system.
no code implementations • FNP (LREC) 2022 • Dominique Mariko, Hanna Abi-Akl, Kim Trottier, Mahmoud El-Haj
We present the FinCausal 2020 Shared Task on Causality Detection in Financial Documents and the associated FinCausal dataset, and discuss the participating systems and results.
no code implementations • FNP (LREC) 2022 • Juyeon Kang, Abderrahim Ait Azzi, Sandra Bellato, Blanca Carbajo Coronado, Mahmoud El-Haj, Ismail El Maarouf, Mei Gan, Ana Gisbert, Antonio Moreno Sandoval
This paper describes the FinTOC-2022 Shared Task on the structure extraction from financial documents, its participants results and their findings.
no code implementations • FNP (LREC) 2022 • Mahmoud El-Haj, Andrew Ogden
This paper describes the HTAC system submitted to the Financial Narrative Summarization Shared Task (FNS-2022).
no code implementations • FNP (LREC) 2022 • Mahmoud El-Haj, Nadhem Zmandar, Paul Rayson, Ahmed Abura’Ed, Marina Litvak, Nikiforos Pittaras, George Giannakopoulos, Aris Kosmopoulos, Blanca Carbajo-Coronado, Antonio Moreno-Sandoval
This paper presents the results and findings of the Financial Narrative Summarisation Shared Task on summarising UK, Greek and Spanish annual reports.
1 code implementation • LREC 2022 • Chiamaka Chukwuneke, Ignatius Ezeani, Paul Rayson, Mahmoud El-Haj
Our results show that, although the IgboNER task benefited hugely from fine-tuning large transformer model, fine-tuning a transformer model built from scratch with comparatively little Igbo text data seems to yield quite decent results for the IgboNER task.
no code implementations • LREC 2022 • Nadhem Zmandar, Tobias Daudert, Sina Ahmadi, Mahmoud El-Haj, Paul Rayson
Natural Language Processing is increasingly being applied in the finance and business industry to analyse the text of many different types of financial documents.
no code implementations • FNP (COLING) 2020 • Dominique Mariko, Hanna Abi-Akl, Estelle Labidurie, Stephane Durfort, Hugues de Mazancourt, Mahmoud El-Haj
We present the FinCausal 2020 Shared Task on Causality Detection in Financial Documents and the associated FinCausal dataset, and discuss the participating systems and results.
no code implementations • FNP (COLING) 2020 • Najah-Imane Bentabet, Rémi Juge, Ismail El Maarouf, Virginie Mouilleron, Dialekti Valsamou-Stanislawski, Mahmoud El-Haj
This paper presents the FinTOC-2020 Shared Task on structure extraction from financial documents, its participants results and their findings.
no code implementations • FNP (COLING) 2020 • Mahmoud El-Haj, Ahmed Abura’Ed, Marina Litvak, Nikiforos Pittaras, George Giannakopoulos
This paper presents the results and findings of the Financial Narrative Summarisation shared task (FNS 2020) on summarising UK annual reports.
no code implementations • 23 Jul 2022 • Daniel F. O. Onah, Elaine L. L. Pang, Mahmoud El-Haj
The topic modelling result shows prevalence within topics 1 and 2.
1 code implementation • LREC 2022 • Ignatius Ezeani, Mahmoud El-Haj, Jonathan Morris, Dawn Knight
Welsh is an official language in Wales and is spoken by an estimated 884, 300 people (29. 2% of the population of Wales).
no code implementations • 4 Dec 2020 • Dominique Mariko, Hanna Abi Akl, Estelle Labidurie, Stéphane Durfort, Hugues de Mazancourt, Mahmoud El-Haj
We present the FinCausal 2020 Shared Task on Causality Detection in Financial Documents and the associated FinCausal dataset, and discuss the participating systems and results.
no code implementations • LREC 2020 • Mahmoud El-Haj
This was achieved using a word-based Convolutional Neural Network (CNN) utilising a Continuous Bag of Words (CBOW) word embeddings model.
no code implementations • LREC 2020 • Mahmoud El-Haj, Nathan Rutherford, Matthew Coole, Ignatius Ezeani, Sheryl Prentice, Nancy Ide, Jo Knight, Scott Piao, John Mariani, Paul Rayson, Keith Suderman
The corpus database is distributed to permit fast indexing, and provides a simple web front-end with corpus linguistics methods for sub-corpus comparison and retrieval.
no code implementations • RANLP 2019 • Mahmoud El-Haj
The Financial Narrative Summarisation task at MultiLing 2019 aims to demonstrate the value and challenges of applying automatic text summarisation to financial text written in English, usually referred to as financial narrative disclosures.
no code implementations • 28 Mar 2019 • Mahmoud El-Haj, Paul Rayson, Martin Walker, Steven Young, Vasiliki Simaki
We critically assess mainstream accounting and finance research applying methods from computational linguistics (CL) to study financial discourse.
no code implementations • WS 2017 • Mahmoud El-Haj, Paul Rayson, Scott Piao, Stephen Wattam
Creating high-quality wide-coverage multilingual semantic lexicons to support knowledge-based approaches is a challenging time-consuming manual task.
1 code implementation • LREC 2016 • Scott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-Mar{\'\i}a Jim{\'e}nez, Dawn Knight, Michal K{\v{r}}en, Laura L{\"o}fberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey Lee Teh, Olga Mudraya
Lexical coverage is an important factor concerning the quality of the lexicons and the performance of the corpus annotation tools, and in this experiment we focus on evaluating the lexical coverage achieved by the multilingual lexicons and semantic annotation tools based on them.
no code implementations • LREC 2016 • Mahmoud El-Haj, Paul Rayson
The Arabic sentences were written with the absence of diacritics and in order to count the number of syllables we added the diacritics in using an open source tool called Mishkal.
no code implementations • LREC 2016 • Mahmoud El-Haj, Paul Rayson, Steve Young, Andrew Moore, Martin Walker, Thomas Schleicher, Vasiliki Athanasakou
Previous studies have only applied manual content analysis on a small scale to reveal such a bias in the narrative section of annual financial reports.
no code implementations • LREC 2014 • Mahmoud El-Haj, Paul Rayson, Steve Young, Martin Walker
In this paper we present the evaluation of our automatic methods for detecting and extracting document structure in annual financial reports.
no code implementations • LREC 2012 • Ahmet Aker, Mahmoud El-Haj, M-Dyaa Albakour, Udo Kruschwitz
For each run we assessed the results provided by 25 workers on a set of 10 tasks.