Search Results for author: Peng Chang

Found 7 papers, 1 papers with code

Bridging the Gap: Deciphering Tabular Data Using Large Language Model

no code implementations23 Aug 2023 Hengyuan Zhang, Peng Chang, Zongcheng Ji

This research marks the first application of large language models to table-based question answering tasks, enhancing the model's comprehension of both table structures and content.

Language Modelling Large Language Model +1

A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition

no code implementations15 Apr 2023 Ruchao Fan, Wei Chu, Peng Chang, Abeer Alwan

During inference, an error-based alignment sampling method is investigated in depth to reduce the alignment mismatch in the training and testing processes.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms

no code implementations CVPR 2021 Yuxing Tang, Zhenjie Cao, Yanbo Zhang, Zhicheng Yang, Zongcheng Ji, Yiwei Wang, Mei Han, Jie Ma, Jing Xiao, Peng Chang

Starting with a fully supervised model trained on the data with pixel-level masks, the proposed framework iteratively refines the model itself using the entire weakly labeled data (image-level soft label) in a self-training fashion.

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

no code implementations18 Jun 2021 Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

For the analyses, we plot attention weight distributions in the decoders to visualize the relationships between token-level acoustic embeddings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition

no code implementations28 Oct 2020 Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao

The information are used to extract acoustic representation for each token in parallel, referred to as token-level acoustic embedding which substitutes the word embedding in autoregressive transformer (AT) to achieve parallel generation in decoder.

speech-recognition Speech Recognition

Sim2Real2Sim: Bridging the Gap Between Simulation and Real-World in Flexible Object Manipulation

no code implementations6 Feb 2020 Peng Chang, Taskin Padir

This paper addresses a new strategy called Simulation-to-Real-to-Simulation (Sim2Real2Sim) to bridge the gap between simulation and real-world, and automate a flexible object manipulation task.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.