Search Results for author: Yi-Chen Chen

Found 19 papers, 4 papers with code

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

no code implementations • 29 Jul 2022 • Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-Yi Lee

GAN training is adopted in the first stage to find the mapping relationship between unpaired speech and phone sequence.

Acoustic Unit Discovery Generative Adversarial Network

Paper
Add Code

Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech

1 code implementation • 7 Nov 2021 • Sung-Feng Huang, Chyi-Jiunn Lin, Da-Rong Liu, Yi-Chen Chen, Hung-Yi Lee

On the one hand, speaker adaptation methods fine-tune a trained multi-speaker text-to-speech (TTS) model with few enrolled samples.

Meta-Learning Speech Synthesis

182

Paper
Code

Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning

no code implementations • 18 Oct 2021 • Yi-Chen Chen, Shu-wen Yang, Cheng-Kuang Lee, Simon See, Hung-Yi Lee

It has been shown that an SSL pretraining model can achieve excellent performance in various downstream tasks of speech processing.

Multi-Task Learning Representation Learning +1

Paper
Add Code

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

1 code implementation • 7 May 2021 • Yi-Chen Chen, Po-Han Chi, Shu-wen Yang, Kai-Wei Chang, Jheng-Hao Lin, Sung-Feng Huang, Da-Rong Liu, Chi-Liang Liu, Cheng-Kuang Lee, Hung-Yi Lee

The multi-task learning of a wide variety of speech processing tasks with a universal model has not been studied.

Multi-Task Learning

Paper
Code

Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training

1 code implementation • 29 Oct 2020 • Sung-Feng Huang, Shun-Po Chuang, Da-Rong Liu, Yi-Chen Chen, Gene-Ping Yang, Hung-Yi Lee

Speech separation has been well developed, with the very successful permutation invariant training (PIT) approach, although the frequent label assignment switching happening during PIT training remains to be a problem when better convergence speed and achievable performance are desired.

Ranked #6 on Speech Separation on Libri2Mix (using extra training data)

Speaker Separation Speech Enhancement +1

Paper
Code

DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation

no code implementations • 13 May 2020 • Yi-Chen Chen, Jui-Yang Hsu, Cheng-Kuang Lee, Hung-Yi Lee

In order to examine the generalizability of DARTS-ASR, we apply our approach not only on many languages to perform monolingual ASR, but also on a multilingual ASR setting.

speech-recognition Speech Recognition

Paper
Add Code

AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition

no code implementations • 27 Nov 2019 • Yi-Chen Chen, Zhaojun Yang, Ching-Feng Yeh, Mahaveer Jain, Michael L. Seltzer

As one of the major sources in speech variability, accents have posed a grand challenge to the robustness of speech recognition systems.

Representation Learning speech-recognition +1

Paper
Add Code

From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings

no code implementations • 10 Apr 2019 • Yi-Chen Chen, Sung-Feng Huang, Hung-Yi Lee, Lin-shan Lee

However, we note human babies start to learn the language by the sounds (or phonetic structures) of a small number of exemplar words, and "generalize" such knowledge to other words without hearing a large amount of data.

speech-recognition Speech Recognition +1

Paper
Add Code

Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection

no code implementations • 7 Nov 2018 • Sung-Feng Huang, Yi-Chen Chen, Hung-Yi Lee, Lin-shan Lee

Embedding audio signal segments into vectors with fixed dimensionality is attractive because all following processing will be easier and more efficient, for example modeling, classifying or indexing.

Clustering

Paper
Add Code

Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data

no code implementations • 30 Oct 2018 • Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang, Hung-Yi Lee, Lin-shan Lee

This can be learned by aligning a small number of spoken words and the corresponding text words in the embedding spaces.

speech-recognition Speech Recognition +1

Paper
Add Code

Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval

no code implementations • 21 Jul 2018 • Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-Yi Lee, Lin-shan Lee

Stage 1 performs phonetic embedding with speaker characteristics disentangled.

Retrieval

Paper
Add Code

Scalable Bilinear Pi Learning Using State and Action Features

no code implementations • ICML 2018 • Yi-Chen Chen, Lihong Li, Mengdi Wang

In this work, we study a primal-dual formulation of the ALP, and develop a scalable, model-free algorithm called bilinear $\pi$ learning for reinforcement learning when a sampling oracle is provided.

Paper
Add Code

Scalable Bilinear $π$ Learning Using State and Action Features

no code implementations • 27 Apr 2018 • Yi-Chen Chen, Lihong Li, Mengdi Wang

In this work, we study a primal-dual formulation of the ALP, and develop a scalable, model-free algorithm called bilinear $\pi$ learning for reinforcement learning when a sampling oracle is provided.

Paper
Add Code

Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

no code implementations • 29 Mar 2018 • Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang, Hung-Yi Lee

In this work, we propose a framework to achieve unsupervised ASR on a read English speech dataset, where audio and text are unaligned.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Order-Free RNN with Visual Attention for Multi-Label Classification

1 code implementation • 18 Jul 2017 • Shang-Fu Chen, Yi-Chen Chen, Chih-Kuan Yeh, Yu-Chiang Frank Wang

In this paper, we propose the joint learning attention and recurrent neural network (RNN) models for multi-label classification.

Classification General Classification +2

Paper
Code

Lower Bound On the Computational Complexity of Discounted Markov Decision Problems

no code implementations • 20 May 2017 • Yi-Chen Chen, Mengdi Wang

We study the computational complexity of the infinite-horizon discounted-reward Markov Decision Problem (MDP) with a finite state space $|\mathcal{S}|$ and a finite action space $|\mathcal{A}|$.

Paper
Add Code

Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning

no code implementations • 8 Dec 2016 • Yi-Chen Chen, Mengdi Wang

We study the online estimation of the optimal policy of a Markov decision process (MDP).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Random Multi-Constraint Projection: Stochastic Gradient Methods for Convex Optimization with Many Constraints

no code implementations • 12 Nov 2015 • Mengdi Wang, Yi-Chen Chen, Jialin Liu, Yuantao Gu

Consider convex optimization problems subject to a large number of constraints.

Paper
Add Code

Dictionary Learning from Ambiguously Labeled Data

no code implementations • CVPR 2013 • Yi-Chen Chen, Vishal M. Patel, Jaishanker K. Pillai, Rama Chellappa, P. J. Phillips

We propose a novel dictionary-based learning method for ambiguously labeled multiclass classification, where each training sample has multiple labels and only one of them is the correct label.

Dictionary Learning General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.