1 code implementation • CVPR 2021 • Amaia Salvador, Erhan Gundogdu, Loris Bazzani, Michael Donoser
Cross-modal recipe retrieval has recently gained substantial attention due to the importance of food in people's lives, as well as the availability of vast amounts of digital cooking recipes and food images to train machine learning models.
Ranked #6 on
Cross-Modal Retrieval
on Recipe1M
no code implementations • 25 Aug 2020 • Miriam Bellver, Amaia Salvador, Jordi Torres, Xavier Giro-i-Nieto
Our method consists in first predicting pseudo-masks for the unlabeled pool of samples, together with a score predicting the quality of the mask.
no code implementations • 23 Sep 2019 • Irene Amerini, Elena Balashova, Sayna Ebrahimi, Kathryn Leonard, Arsha Nagrani, Amaia Salvador
In this paper we present the Women in Computer Vision Workshop - WiCV 2019, organized in conjunction with CVPR 2019.
no code implementations • 14 May 2019 • Miriam Bellver, Amaia Salvador, Jordi Torres, Xavier Giro-i-Nieto
Methods that move towards less supervised scenarios are key for image segmentation, as dense labels demand significant human intervention.
1 code implementation • 11 Apr 2019 • Luis Pineda, Amaia Salvador, Michal Drozdzal, Adriana Romero
In this paper, we identify an important reproducibility challenge in the image-to-set prediction literature that impedes proper comparisons among published methods, namely, researchers use different evaluation protocols to assess their contributions.
3 code implementations • 25 Mar 2019 • Amanda Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giro-i-Nieto
Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker.
1 code implementation • CVPR 2019 • Carles Ventura, Miriam Bellver, Andreu Girbau, Amaia Salvador, Ferran Marques, Xavier Giro-i-Nieto
Multiple object video object segmentation is a challenging task, specially for the zero-shot case, when no object mask is given at the initial frame and the model has to find the objects to be segmented along the sequence.
Ranked #1 on
One-shot visual object segmentation
on YouTube-VOS
One-shot visual object segmentation
Unsupervised Video Object Segmentation
+1
4 code implementations • CVPR 2019 • Amaia Salvador, Michal Drozdzal, Xavier Giro-i-Nieto, Adriana Romero
Our system predicts ingredients as sets by means of a novel architecture, modeling their dependencies without imposing any order, and then generates cooking instructions by attending to both image and its inferred ingredients simultaneously.
Ranked #1 on
Recipe Generation
on Recipe1M
no code implementations • 14 Oct 2018 • Javier Marin, Aritro Biswas, Ferda Ofli, Nicholas Hynes, Amaia Salvador, Yusuf Aytar, Ingmar Weber, Antonio Torralba
In this paper, we introduce Recipe1M+, a new large-scale, structured corpus of over one million cooking recipes and 13 million food images.
Ranked #2 on
Cross-Modal Retrieval
on Recipe1M+
1 code implementation • 7 Jan 2018 • Didac Surís, Amanda Duarte, Amaia Salvador, Jordi Torres, Xavier Giró-i-Nieto
The increasing amount of online videos brings several opportunities for training self-supervised neural networks.
1 code implementation • 2 Dec 2017 • Amaia Salvador, Miriam Bellver, Victor Campos, Manel Baradad, Ferran Marques, Jordi Torres, Xavier Giro-i-Nieto
We present a recurrent model for semantic instance segmentation that sequentially generates binary masks and their associated class probabilities for every object in an image.
no code implementations • CVPR 2017 • Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, Ferda Ofli, Ingmar Weber, Antonio Torralba
In this paper, we introduce Recipe1M, a new large-scale, structured corpus of over 1m cooking recipes and 800k food images.
3 code implementations • 29 Aug 2016 • Alberto Montes, Amaia Salvador, Santiago Pascual, Xavier Giro-i-Nieto
This thesis explore different approaches using Convolutional and Recurrent Neural Networks to classify and temporally localize activities on videos, furthermore an implementation to achieve it has been proposed.
3 code implementations • 29 Apr 2016 • Amaia Salvador, Xavier Giro-i-Nieto, Ferran Marques, Shin'ichi Satoh
This work explores the suitability for instance retrieval of image- and region-wise representations pooled from an object detection CNN such as Faster R-CNN.
2 code implementations • 15 Apr 2016 • Eva Mohedano, Amaia Salvador, Kevin McGuinness, Ferran Marques, Noel E. O'Connor, Xavier Giro-i-Nieto
This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW).
1 code implementation • 20 Aug 2015 • Victor Campos, Amaia Salvador, Brendan Jou, Xavier Giró-i-Nieto
Visual media are powerful means of expressing emotions and sentiments.
no code implementations • 1 May 2015 • Ferran Cabezas, Axel Carlier, Amaia Salvador, Xavier Giró-i-Nieto, Vincent Charvillat
This paper explores processing techniques to deal with noisy data in crowdsourced object segmentation tasks.
no code implementations • 24 Apr 2015 • Amaia Salvador, Matthias Zeppelzauer, Daniel Manchon-Vizuete, Andrea Calafell, Xavier Giro-i-Nieto
Our solution is based on the combination of visual features extracted from convolutional neural networks with temporal information using a hierarchical classifier scheme.
no code implementations • 9 Apr 2015 • Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giró-i-Nieto, Graham Healy, Kevin McGuinness, Noel O'Connor, Alan F. Smeaton
We show that it is indeed possible to detect such objects in complex images and, also, that users with previous knowledge on the dataset or experience with the RSVP outperform others.