1 code implementation • 10 Mar 2023 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood
With the rapid growth of social media platforms, users are sharing billions of multimedia posts containing audio, images, and text.
no code implementations • 25 Feb 2023 • Saqlain Hussain Shah, Muhammad Saad Saeed, Shah Nawaz, Muhammad Haroon Yousaf
To achieve this task, we proposed a two-branch network to learn joint representations of faces and voices in a multimodal system.
1 code implementation • 22 Aug 2022 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue
In addition, we leverage cross-modal verification and matching tasks to analyze the impact of multiple languages on face-voice association.
2 code implementations • 20 Dec 2021 • Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue
Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks.
no code implementations • 28 Apr 2020 • Muhammad Saad Saeed, Shah Nawaz, Pietro Morerio, Arif Mahmood, Ignazio Gallo, Muhammad Haroon Yousaf, Alessio Del Bue
Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition.