1 code implementation • 25 Aug 2024 • Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen, Ulrich Neumann
We extend the analysis to a total of 4 datasets and discuss the best practice in synthetic data curation for training indoor monocular depth.
Indoor Monocular Depth Estimation
Monocular Depth Estimation
+1
1 code implementation • 24 Sep 2023 • Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen, Ulrich Neumann
To facilitate our investigation for robustness and address limitations of previous works, we collect InSpaceType, a high-quality and high-resolution RGBD dataset for general indoor environments.
Indoor Monocular Depth Estimation
Monocular Depth Estimation
no code implementations • 25 Jun 2022 • Chin-Cheng Hsu
We formulated non-speech vocalization (NSV) modeling as a text-to-speech task and verified its viability.
1 code implementation • CVPR 2022 • Cho-Ying Wu, Chin-Cheng Hsu, Ulrich Neumann
This work digs into a root question in human perception: can face geometry be gleaned from one's voices?
Ranked #1 on
3D Face Modelling
on Voxceleb-3D
1 code implementation • 21 Apr 2021 • Cho-Ying Wu, Ke Xu, Chin-Cheng Hsu, Ulrich Neumann
This work focuses on the analysis that whether 3D face models can be learned from only the speech inputs of speakers.
1 code implementation • 18 Aug 2020 • Arindam Jati, Chin-Cheng Hsu, Monisankha Pal, Raghuveer Peri, Wael Abd-Almageed, Shrikanth Narayanan
Robust speaker recognition, including in the presence of malicious attacks, is becoming increasingly important and essential, especially due to the proliferation of several smart speakers and personal agents that interact with an individual's voice commands to perform diverse, and even sensitive tasks.
1 code implementation • 4 Apr 2017 • Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang
Building a voice conversion (VC) system from non-parallel speech corpora is challenging but highly valuable in real application scenarios.
5 code implementations • 13 Oct 2016 • Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang
We propose a flexible framework for spectral conversion (SC) that facilitates training with unaligned corpora.
no code implementations • 13 Oct 2016 • Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang
In this paper, we propose a dictionary update method for Nonnegative Matrix Factorization (NMF) with high dimensional data in a spectral conversion (SC) task.