1 code implementation • 22 Dec 2023 • Ali Abdari, Alex Falcon, Giuseppe Serra
Recently, the Metaverse is becoming increasingly attractive, with millions of users accessing the many available virtual worlds.
1 code implementation • 6 Sep 2023 • Ali Abdari, Alex Falcon, Giuseppe Serra
Nowadays, many people frequently have to search for new accommodation options.
no code implementations • 27 Jun 2023 • Alex Falcon, Giuseppe Serra
In this report, we present the technical details of our submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2023.
1 code implementation • 3 Aug 2022 • Alex Falcon, Giuseppe Serra, Oswald Lanz
Data augmentation techniques were introduced to increase the performance on unseen test examples by creating new training samples with the application of semantics-preserving techniques, such as color space or geometric transformations on images.
no code implementations • 22 Jun 2022 • Alex Falcon, Giuseppe Serra, Sergio Escalera, Oswald Lanz
This report presents the technical details of our submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022.
Ranked #3 on Multi-Instance Retrieval on EPIC-KITCHENS-100
1 code implementation • 27 Apr 2022 • Alex Falcon, Swathikiran Sudhakaran, Giuseppe Serra, Sergio Escalera, Oswald Lanz
We show that even if we carefully tuned the fixed margin, our technique (which does not have the margin as a hyper-parameter) would still achieve better performance.
Ranked #7 on Multi-Instance Retrieval on EPIC-KITCHENS-100
2 code implementations • 16 Mar 2022 • Alex Falcon, Giuseppe Serra, Oswald Lanz
Due to the amount of videos and related captions uploaded every hour, deep learning-based solutions for cross-modal video retrieval are attracting more and more attention.
Ranked #5 on Multi-Instance Retrieval on EPIC-KITCHENS-100
no code implementations • 6 Oct 2021 • Swathikiran Sudhakaran, Adrian Bulat, Juan-Manuel Perez-Rua, Alex Falcon, Sergio Escalera, Oswald Lanz, Brais Martinez, Georgios Tzimiropoulos
This report presents the technical details of our submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021.
no code implementations • 22 Aug 2020 • Alex Falcon, Oswald Lanz, Giuseppe Serra
Video Question Answering (VideoQA) is a task that requires a model to analyze and understand both the visual content given by the input video and the textual part given by the question, and the interaction between them in order to produce a meaningful answer.
no code implementations • 9 Oct 2019 • Marco Menardi, Alex Falcon, Saida S. Mohamed, Lorenzo Seidenari, Giuseppe Serra, Alberto del Bimbo, Carlo Tasso
To address this issue, in this paper we propose an approach capable of generating images starting from a given text using conditional GANs trained on uncaptioned images dataset.