no code implementations • ECCV 2020 • Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas
People from different parts of the globe describe objects and concepts in distinct manners.
no code implementations • 24 Jun 2020 • Rafael Redondo, Jaume Gibert
Existing face datasets often lack sufficient representation of occluding objects, which can hinder recognition, but also supply meaningful information to understand the visual context.
1 code implementation • 9 Oct 2019 • Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas
In this work we target the problem of hate speech detection in multimodal publications formed by a text and an image.
1 code implementation • 4 Jun 2019 • Raul Gomez, Ali Furkan Biten, Lluis Gomez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas
This paper explores the possibilities of image style transfer applied to text maintaining the original transcriptions.
no code implementations • 10 May 2019 • Giuseppe Amato, Malte Behrmann, Frédéric Bimbot, Baptiste Caramiaux, Fabrizio Falchi, Ander Garcia, Joost Geurts, Jaume Gibert, Guillaume Gravier, Hadmut Holken, Hartmut Koenitz, Sylvain Lefebvre, Antoine Liutkus, Fabien Lotte, Andrew Perkis, Rafael Redondo, Enrico Turrin, Thierry Vieville, Emmanuel Vincent
Thanks to the Big Data revolution and increasing computing capacities, Artificial Intelligence (AI) has made an impressive revival over the past few years and is now omnipresent in both research and industry.
1 code implementation • 7 Jan 2019 • Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas
In this work we propose to exploit this free available data to learn a multimodal image and text embedding, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval.
1 code implementation • 20 Aug 2018 • Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas
In this paper we propose to learn a multimodal image and text embedding from Web and Social Media data, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval.
1 code implementation • 20 Aug 2018 • Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas
We perform a language separate treatment of the data and show that it can be extrapolated to a tourists and locals separate analysis, and that tourism is reflected in Social Media at a neighborhood level.