3 code implementations • 16 Jan 2020 • Shah Nawaz, Alessandro Calefati, Moreno Caraffini, Nicola Landro, Ignazio Gallo
In recent years, natural language descriptions are used to obtain information on discriminative parts of the object.
Ranked #1 on Multi-Modal Document Classification on CUB-200-2011
no code implementations • 18 Sep 2019 • Shah Nawaz, Muhammad Kamran Janjua, Ignazio Gallo, Arif Mahmood, Alessandro Calefati
We quantitatively and qualitatively evaluate the proposed approach on VoxCeleb, a benchmarks audio-visual dataset on a multitude of tasks including cross-modal verification, cross-modal matching, and cross-modal retrieval.
no code implementations • 9 Sep 2019 • Riccardo La Grassa, Ignazio Gallo, Alessandro Calefati, Dimitri Ognibene
The objective is to select the best structures created during the training phase using an ensemble of spanning trees.
1 code implementation • 9 Sep 2019 • Ignazio Gallo, Shah Nawaz, Alessandro Calefati, Riccardo La Grassa, Nicola Landro
Visualization refers to our ability to create an image in our head based on the text we read or the words we hear.
no code implementations • 3 Sep 2019 • Shah Nawaz, Muhammad Kamran Janjua, Ignazio Gallo, Arif Mahmood, Alessandro Calefati, Faisal Shafait
Our proposed measure evaluates the semantic similarity between the image and text representations in the latent embedding space.
no code implementations • 14 Jun 2019 • Riccardo La Grassa, Ignazio Gallo, Alessandro Calefati, Dimitri Ognibene
One-class classifiers are trained with target class only samples.
1 code implementation • 2 Apr 2019 • Omer Arshad, Ignazio Gallo, Shah Nawaz, Alessandro Calefati
With massive explosion of social media such as Twitter and Instagram, people daily share billions of multimedia posts, containing images and text.
no code implementations • 16 Oct 2018 • Muhammad Kamran Janjua, Shah Nawaz, Alessandro Calefati, Ignazio Gallo
Majority of the current dimensionality reduction or retrieval techniques rely on embedding the learned feature representations onto a computable metric space.
1 code implementation • 3 Oct 2018 • Ignazio Gallo, Alessandro Calefati, Shah Nawaz, Muhammad Kamran Janjua
To learn feature representations of resulting images, standard Convolutional Neural Networks (CNNs) are employed for the classification task.
no code implementations • 31 Aug 2018 • Shah Nawaz, Alessandro Calefati, Muhammad Kamran Janjua, Ignazio Gallo
The question we answer with this work is: can we convert a text document into an image to exploit best image classification models to classify documents?
1 code implementation • 23 Jul 2018 • Alessandro Calefati, Muhammad Kamran Janjua, Shah Nawaz, Ignazio Gallo
Conventionally, CNNs have been trained with softmax as supervision signal to penalize the classification loss.
Ranked #8 on Face Verification on YouTube Faces DB
no code implementations • 19 Jul 2018 • Shah Nawaz, Muhammad Kamran Janjua, Alessandro Calefati, Ignazio Gallo
We show that text encodings can capture semantic relationships between multiple modalities.