Search Results for author: Peng Chang

Found 7 papers, 1 papers with code

Bridging the Gap: Deciphering Tabular Data Using Large Language Model

no code implementations • 23 Aug 2023 • Hengyuan Zhang, Peng Chang, Zongcheng Ji

This research marks the first application of large language models to table-based question answering tasks, enhancing the model's comprehension of both table structures and content.

Language Modelling Large Language Model +1

Paper
Add Code

A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition

no code implementations • 15 Apr 2023 • Ruchao Fan, Wei Chu, Peng Chang, Abeer Alwan

During inference, an error-based alignment sampling method is investigated in depth to reduce the alignment mismatch in the training and testing processes.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

1 code implementation • 6 May 2022 • Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James Glass

Automatic pronunciation assessment is an important technology to help self-directed language learners.

Ranked #2 on Phone-level pronunciation scoring on speechocean762 (using extra training data)

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

118

Paper
Code

Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms

no code implementations • CVPR 2021 • Yuxing Tang, Zhenjie Cao, Yanbo Zhang, Zhicheng Yang, Zongcheng Ji, Yiwei Wang, Mei Han, Jie Ma, Jing Xiao, Peng Chang

Starting with a fully supervised model trained on the data with pixel-level masks, the proposed framework iteratively refines the model itself using the entire weakly labeled data (image-level soft label) in a self-training fashion.

Paper
Add Code

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

no code implementations • 18 Jun 2021 • Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

For the analyses, we plot attention weight distributions in the decoders to visualize the relationships between token-level acoustic embeddings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition

no code implementations • 28 Oct 2020 • Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao

The information are used to extract acoustic representation for each token in parallel, referred to as token-level acoustic embedding which substitutes the word embedding in autoregressive transformer (AT) to achieve parallel generation in decoder.

speech-recognition Speech Recognition

Paper
Add Code

Sim2Real2Sim: Bridging the Gap Between Simulation and Real-World in Flexible Object Manipulation

no code implementations • 6 Feb 2020 • Peng Chang, Taskin Padir

This paper addresses a new strategy called Simulation-to-Real-to-Simulation (Sim2Real2Sim) to bridge the gap between simulation and real-world, and automate a flexible object manipulation task.

Robotics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.