TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Speech Recognition	Common Voice Frisian	wav2vec2-large-xls-r-1b-frisian	Test WER	15.99%	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-the-previous-state-of-the-art/speech-recognition-on-common-voice-frisian)](https://paperswithcode.com/sota/speech-recognition-on-common-voice-frisian?p=improving-the-previous-state-of-the-art)`

Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R

- 2023 · Dragoș Alexandru Bălan, Golshid Shekoufandeh ·

Automatic Speech Recognition (ASR), a system that converts human speech to text, has a major role in digitizing human communication. Despite their significance, most of these systems are designed for higher-resourced languages, like English, Mandarin, or Spanish, leaving lower-resourced languages, such as Frisian, underrepresented. To address this issue, our paper introduces a fine-tuned ASR model based on the Wav2Vec 2.0 XLS-R architecture, trained on the Common Voice corpus version 12.0, to transcribe Frisian speech. With a learning rate of 8e-5, our proposed ASR system has achieved a 15.99% word error rate (WER), surpassing the previous state-of-the-art of 16.25% and serving as a benchmark for future research in this field.

PDF