2 code implementations • 25 Mar 2024 • Hugo Sousa, Ricardo Campos, Alípio Jorge
This corpus comprises a total of $138, 069$ documents (over six languages) with $1, 050, 921$ temporal expressions, the largest open-source annotated dataset for temporal expression identification to date.
no code implementations • 3 Jan 2024 • Rúben Almeida, Hugo Sousa, Luís F. Cunha, Nuno Guimarães, Ricardo Campos, Alípio Jorge
The capabilities of the most recent language models have increased the interest in integrating them into real-world applications.
1 code implementation • 24 Nov 2023 • Hugo Sousa, Nuno Guimarães, Alípio Jorge, Ricardo Campos
By studying the strengths and limitations of these models in the context of information extraction, we offer insights that can guide future improvements and avenues to explore in this field.
no code implementations • 18 Apr 2023 • Hugo Sousa, Arian Pasquali, Alípio Jorge, Catarina Sousa Santos, Mário Amorim Lopes
In this paper, we present the approach we developed to extract procedures, drugs, and diseases from oncology health records written in European Portuguese.
1 code implementation • 11 Jan 2023 • Hugo Sousa, Alípio Jorge, Ricardo Campos
All in all, these problems have limited the fair comparison between approaches and consequently, the development of temporal extraction systems.