Search Results for author: Yeon-Jun Kim

Found 6 papers, 0 papers with code

1SPU: 1-step Speech Processing Unit

no code implementations8 Nov 2023 Karan Singla, Shahab Jalalvand, Yeon-Jun Kim, Antonio Moreno Daniel, Srinivas Bangalore, Andrej Ljolje, Ben Stern

Recent studies have made some progress in refining end-to-end (E2E) speech recognition encoders by applying Connectionist Temporal Classification (CTC) loss to enhance named entity recognition within transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

E2E Spoken Entity Extraction for Virtual Agents

no code implementations16 Feb 2023 Karan Singla, Yeon-Jun Kim, Srinivas Bangalore

In human-computer conversations, extracting entities such as names, street addresses and email addresses from speech is a challenging task.

Building Text-To-Speech Voices in the Cloud

no code implementations LREC 2012 Alistair Conkie, Thomas Okken, Yeon-Jun Kim, Giuseppe Di Fabbrizio

The AT{\&}T VoiceBuilder provides a new tool to researchers and practitioners who want to have their voices synthesized by a high-quality commercial-grade text-to-speech system without the need to install, configure, or manage speech processing software and equipment. It is implemented as a web service on the AT{\&}T Speech Mashup Portal. The system records and validates users' utterances, processes them to build a synthetic voice and provides a web service API to make the voice available to real-time applications through a scalable cloud-based processing platform.

Speech Recognition Speech Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.