no code implementations • 21 Jun 2018 • Arka Ujjal Dey, Suman K. Ghosh, Ernest Valveny
We propose a framework for automated classification of Advertisement Images, using not just Visual features but also Textual cues extracted from embedded text.
no code implementations • 25 May 2019 • Arka Ujjal Dey, Suman Kumar Ghosh, Ernest Valveny, Gaurav Harit
Images with visual and scene text content are ubiquitous in everyday life.
no code implementations • 22 Aug 2021 • Arka Ujjal Dey, Ernest Valveny, Gaurav Harit
The open-ended question answering task of Text-VQA often requires reading and reasoning about rarely seen or completely unseen scene-text content of an image.
Open-Ended Question Answering Optical Character Recognition (OCR) +1