Search Results for author: Kazuki Yamauchi

Found 1 papers, 0 papers with code

StyleCap: Automatic Speaking-Style Captioning from Speech Based on Speech and Language Self-supervised Learning Models

no code implementations28 Nov 2023 Kazuki Yamauchi, Yusuke Ijima, Yuki Saito

The experimental results demonstrate that our StyleCap leveraging richer LLMs for the text decoder, speech self-supervised learning (SSL) features, and sentence rephrasing augmentation improves the accuracy and diversity of generated speaking-style captions.

Language Modelling Large Language Model +2

Cannot find the paper you are looking for? You can Submit a new open access paper.