Search Results for author: Sashi Novitasari

Found 7 papers, 0 papers with code

SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain

no code implementations • 8 Jan 2023 • Heli Qi, Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

This paper introduces SpeeChain, an open-source Pytorch-based toolkit designed to develop the machine speech chain for large-scale use.

Data Augmentation

Paper
Add Code

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing

no code implementations • 14 May 2022 • Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura

The existing paradigm of semi-supervised S2S ASR utilizes SpecAugment as data augmentation and requires a static teacher model to produce pseudo transcripts for untranscribed speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS

no code implementations • 10 Nov 2020 • Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura

This paper presents a newly developed, simultaneous neural speech-to-speech translation system and its evaluation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Add Code

Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis

no code implementations • LREC 2020 • Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

We then develop ASR and TTS of ethnic languages by utilizing Indonesian ASR and TTS in a cross-lingual machine speech chain framework with only text or only speech data removing the need for paired speech-text data of those ethnic languages.

Machine Translation speech-recognition +3

Paper
Add Code

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition

no code implementations • 4 Nov 2020 • Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

One main reason is because the model needs to decide the incremental steps and learn the transcription that aligns with the current short speech segment.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time

no code implementations • 4 Nov 2020 • Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura

By contrast, humans can listen to what hey speak in real-time, and if there is a delay in hearing, they won't be able to continue speaking.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas

no code implementations • LREC 2018 • Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari, Satoshi Nakamura

Emotion Recognition Speech Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.