1 code implementation • 20 Jan 2023 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró
Modern image captioning system relies heavily on extracting knowledge from images to capture the concept of a static story.
no code implementations • 26 Sep 2022 • Ahmed Sabir
This paper focuses on enhancing the captions generated by image-caption generation systems.
1 code implementation • COLING 2022 • Ahmed Sabir, Francesc Moreno-Noguer, Pranava Madhyastha, Lluís Padró
In this work, we focus on improving the captions generated by image-caption generation systems.
no code implementations • 21 Apr 2020 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró
In this paper, we propose a visual context dataset for Text Spotting in the wild, where the publicly available dataset COCO-text [Veit et al. 2016] has been extended with information about the scene (such as objects and places appearing in the image) to enable researchers to include semantic relations between texts and scene in their Text Spotting systems, and to offer a common framework for such approaches.
3 code implementations • IJCNLP 2019 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró
We present a scenario where semantic similarity is not enough, and we devise a neural approach to learn semantic relatedness.
3 code implementations • 29 Oct 2018 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró
We propose a post-processing approach to improve scene text recognition accuracy by using occurrence probabilities of words (unigram language model), and the semantic correlation between scene and text.
1 code implementation • 23 Oct 2018 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró
In this paper, we propose a post-processing approach to improve the accuracy of text spotting by using the semantic relation between the text and the scene.