no code implementations • 26 Aug 2024 • Jinhyeok Yang, Junhyeok Lee, Hyeong-Seok Choi, Seunghun Ji, Hyeongju Kim, Juheon Lee
Text-to-Speech (TTS) models have advanced significantly, aiming to accurately replicate human speech's diversity, including unique speaker identities and linguistic nuances.
no code implementations • 22 Aug 2024 • RuiXiao Zhang, Juheon Lee, Xiaohao Cai, Adam Prugel-Bennett
Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving.
no code implementations • 5 Jul 2024 • Juheon Lee, Xiaohao Cai, Carola-Bibian Schönlieb, Simon Masnou
In this paper, we propose a new surface geometry characterisation within this realm, namely a neural varifold representation of point clouds.
1 code implementation • 4 Jul 2024 • RuiXiao Zhang, Yihong Wu, Juheon Lee, Adam Prugel-Bennett, Xiaohao Cai
This raises a fundamental question related to the evaluation of the 3D object detection models' cross-domain performance: Do we really need models to maintain excellent performance in their original 3D bounding boxes after being applied across domains?
1 code implementation • 17 Apr 2024 • Rachel, Chen, Juheon Lee, Chuang Gan, Zijiang Yang, Mohammad Amin Nabian, Jun Zeng
Metal Sintering is a necessary step for Metal Injection Molded parts and binder jet such as HP's metal 3D printer.
no code implementations • 11 Nov 2022 • Yoori Oh, Juheon Lee, Yoseob Han, Kyogu Lee
However, the emotional latent space generated from the existing models is difficult to control the continuous emotional intensity because of the entanglement of features like emotions, speakers, etc.
2 code implementations • NeurIPS 2021 • Hyeong-Seok Choi, Juheon Lee, Wansoo Kim, Jie Hwan Lee, Hoon Heo, Kyogu Lee
We present a neural analysis and synthesis (NANSY) framework that can manipulate voice, pitch, and speed of an arbitrary speech signal.
no code implementations • 29 Oct 2019 • Juheon Lee, Hyeong-Seok Choi, Junghyun Koo, Kyogu Lee
In this study, we define the identity of the singer with two independent concepts - timbre and singing style - and propose a multi-singer singing synthesis system that can model them separately.
Sound Audio and Speech Processing
no code implementations • 6 Aug 2019 • Juheon Lee, Hyeong-Seok Choi, Chang-Bin Jeon, Junghyun Koo, Kyogu Lee
In this paper, we propose an end-to-end Korean singing voice synthesis system from lyrics and a symbolic melody using the following three novel approaches: 1) phonetic enhancement masking, 2) local conditioning of text and pitch to the super-resolution network, and 3) conditional adversarial training.
Sound Audio and Speech Processing
1 code implementation • 20 Mar 2019 • Jonathan Williams, Carola-Bibiane Schönlieb, Tom Swinfield, Juheon Lee, Xiaohao Cai, Lan Qie, David A. Coomes
From these three-dimensional crowns, we are able to measure individual tree biomass.
no code implementations • 1 Dec 2017 • Sungkyun Chang, Juheon Lee, Sang Keun Choe, Kyogu Lee
To do this, we first build the CNN using as an input a cross-similarity matrix generated from a pair of songs.
no code implementations • 24 Jan 2017 • Juheon Lee, David Coomes, Carola-Bibiane Schonlieb, Xiaohao Cai, Jan Lellmann, Michele Dalponte, Yadvinder Malhi, Nathalie Butt, Mike Morecroft
Here we develop a 3D tree delineation method which uses graph cut to delineate trees from the full 3D LiDAR point cloud, and also makes use of any optical imagery available (hyperspectral imagery in our case).
no code implementations • 28 Jul 2014 • Juheon Lee, Xiaohao Cai, Carola-Bibiane Schonlieb, David Coomes
There is much current interest in using multi-sensor airborne remote sensing to monitor the structure and biodiversity of forests.