Search Results for author: Jyh-Shing Roger Jang

Found 18 papers, 5 papers with code

Novel Preprocessing Technique for Data Embedding in Engineering Code Generation Using Large Language Model

no code implementations27 Nov 2023 Yu-Chen Lin, Akhilesh Kumar, Norman Chang, Wenliang Zhang, Muhammad Zakir, Rucha Apte, Haiyang He, Chao Wang, Jyh-Shing Roger Jang

We present four main contributions to enhance the performance of Large Language Models (LLMs) in generating domain-specific code: (i) utilizing LLM-based data splitting and data renovation techniques to improve the semantic representation of embeddings' space; (ii) introducing the Chain of Density for Renovation Credibility (CoDRC), driven by LLMs, and the Adaptive Text Renovation (ATR) algorithm for assessing data renovation reliability; (iii) developing the Implicit Knowledge Expansion and Contemplation (IKEC) Prompt technique; and (iv) effectively refactoring existing scripts to generate new and high-quality scripts with LLMs.

Code Generation Language Modelling +2

Adapting pretrained speech model for Mandarin lyrics transcription and alignment

1 code implementation21 Nov 2023 Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

With the use of data augmentation and source separation model, results show that the proposed method achieves a character error rate of less than 18% on a Mandarin polyphonic dataset for lyrics transcription, and a mean absolute error of 0. 071 seconds for lyrics alignment.

Automatic Lyrics Transcription Data Augmentation

WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories

1 code implementation28 Jul 2023 Te-Yu Chi, Yu-Meng Tang, Chia-Wen Lu, Qiu-Xia Zhang, Jyh-Shing Roger Jang

To achieve this objective, we propose a novel self-training strategy that uses labels rather than text for training, significantly reducing the model's training time.

text-classification Text Classification +1

Personalized Audio Quality Preference Prediction

no code implementations16 Feb 2023 Chung-Che Wang, Yu-Chun Lin, Yu-Teng Hsu, Jyh-Shing Roger Jang

A siamese network is used to compare the inputs and predict the preference.

Multimodal Transformer Distillation for Audio-Visual Synchronization

2 code implementations27 Oct 2022 Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-Yi Lee, Jyh-Shing Roger Jang

This paper proposed an MTDVocaLiST model, which is trained by our proposed multimodal Transformer distillation (MTD) loss.

Audio-Visual Synchronization

Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification

no code implementations31 Mar 2022 Yen-Lun Liao, Xuanjun Chen, Chung-Che Wang, Jyh-Shing Roger Jang

The countermeasure (CM) model is developed to protect ASV systems from spoof attacks and prevent resulting personal information leakage in Automatic Speaker Verification (ASV) system.

Knowledge Distillation Speaker Verification

Learning to match transient sound events using attentional similarity for few-shot sound recognition

1 code implementation4 Dec 2018 Szu-Yu Chou, Kai-Hsiang Cheng, Jyh-Shing Roger Jang, Yi-Hsuan Yang

In this paper, we introduce a novel attentional similarity module for the problem of few-shot sound recognition.

Sound Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.