no code implementations • 23 Feb 2024 • Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth Balasubramanian
To effectively utilize the newly proposed augmentation technique, we employ a Siamese architecture-based training mechanism with a Deep Canonical Correlation Analysis (DCCA)-based loss to achieve collective learning of high-level feature representations from two different views of the input images.
Ranked #1 on Facial Landmark Detection on WFLW
no code implementations • 18 Dec 2023 • Abhinav Thorat, Ravi Kolla, Niranjan Pedanekar, Naoyuki Onoe
To measure the representation loss, we extend existing metrics such as Wasserstein and Maximum Mean Discrepancy (MMD) from the binary treatment setting to the multiple treatments scenario.
no code implementations • 4 Jun 2023 • Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik
Semi-supervised object detection (SSOD) has made significant progress with the development of pseudo-label-based end-to-end methods.
no code implementations • 21 Feb 2023 • Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe
Primary goal of an emotional voice conversion (EVC) system is to convert the emotion of a given speech signal from one style to another style without modifying the linguistic content of the signal.
no code implementations • 5 Jun 2022 • Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe
We introduce a new feature extractor to extract latent features from the audio and visual modality.
Ranked #15 on Emotion Recognition in Conversation on MELD
no code implementations • 7 Apr 2021 • Additya Popli, Saraansh Tandon, Joshua J. Engelsma, Naoyuki Onoe, Atsushi Okubo, Anoop Namboodiri
Therefore, rather than performing the two tasks separately, we propose a joint model for spoof detection and matching to simultaneously perform both tasks without compromising the accuracy of either task.