Search Results for author: George Sung

Found 8 papers, 1 papers with code

Binaural Angular Separation Network

no code implementations16 Jan 2024 Yang Yang, George Sung, Shao-Fu Shih, Hakan Erdogan, Chehung Lee, Matthias Grundmann

We propose a neural network model that can separate target speech sources from interfering sources at different angular regions using two microphones.

StreamVC: Real-Time Low-Latency Voice Conversion

no code implementations5 Jan 2024 Yang Yang, Yury Kartynnik, Yunpeng Li, Jiuqiang Tang, Xing Li, George Sung, Matthias Grundmann

We present StreamVC, a streaming voice conversion solution that preserves the content and prosody of any source speech while matching the voice timbre from any target speech.

Speech Synthesis Voice Conversion

On-device Real-time Custom Hand Gesture Recognition

no code implementations19 Sep 2023 Esha Uboweja, David Tian, Qifei Wang, Yi-Chun Kuo, Joe Zou, Lu Wang, George Sung, Matthias Grundmann

Our framework provides a pre-trained single-hand embedding model that can be fine-tuned for custom gesture recognition.

Hand Gesture Recognition Hand-Gesture Recognition

Guided Speech Enhancement Network

no code implementations13 Mar 2023 Yang Yang, Shao-Fu Shih, Hakan Erdogan, Jamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, Matthias Grundmann

Multi-microphone speech enhancement problem is often decomposed into two decoupled steps: a beamformer that provides spatial filtering and a single-channel speech enhancement model that cleans up the beamformer output.

Denoising Speech Enhancement

On-device Real-time Hand Gesture Recognition

no code implementations29 Oct 2021 George Sung, Kanstantsin Sokal, Esha Uboweja, Valentin Bazarevsky, Jonathan Baccash, Eduard Gabriel Bazavan, Chuo-Ling Chang, Matthias Grundmann

We present an on-device real-time hand gesture recognition (HGR) system, which detects a set of predefined static gestures from a single RGB camera.

Hand Gesture Recognition Hand-Gesture Recognition

MediaPipe Hands: On-device Real-time Hand Tracking

4 code implementations18 Jun 2020 Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann

We present a real-time on-device hand tracking pipeline that predicts hand skeleton from single RGB camera for AR/VR applications.

Cannot find the paper you are looking for? You can Submit a new open access paper.