TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain + SpecAug	Case-insensitive tokenized BLEU	18.7	# 1
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain + SpecAug	Case-insensitive sacreBLEU	17.2	# 1
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain + SpecAug	Case-sensitive sacreBLEU	16.3	# 1
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain + SpecAug	Case-sensitive tokenized BLEU	17.8	# 1
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain	Case-insensitive tokenized BLEU	17.9	# 2
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain	Case-insensitive sacreBLEU	16.5	# 2
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	15.5	# 2
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain	Case-sensitive tokenized BLEU	16.9	# 2
Speech-to-Text Translation	MuST-C EN->DE	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	22.8	# 7
Speech-to-Text Translation	MuST-C EN->ES	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	26.8	# 5
Speech-to-Text Translation	MuST-C EN->ES	Transformer + ASR Pretrain + SpecAug	Case-sensitive sacreBLEU	27.4	# 4
Speech-to-Text Translation	MuST-C EN->FR	Transformer + ASR Pretrain + SpecAug	Case-sensitive sacreBLEU	33.3	# 2
Speech-to-Text Translation	MuST-C EN->FR	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	32.3	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neurst-neural-speech-translation-toolkit/speech-to-text-translation-on-libri-trans)](https://paperswithcode.com/sota/speech-to-text-translation-on-libri-trans?p=neurst-neural-speech-translation-toolkit)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neurst-neural-speech-translation-toolkit/speech-to-text-translation-on-must-c-en-fr)](https://paperswithcode.com/sota/speech-to-text-translation-on-must-c-en-fr?p=neurst-neural-speech-translation-toolkit)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neurst-neural-speech-translation-toolkit/speech-to-text-translation-on-must-c-en-es)](https://paperswithcode.com/sota/speech-to-text-translation-on-must-c-en-es?p=neurst-neural-speech-translation-toolkit)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neurst-neural-speech-translation-toolkit/speech-to-text-translation-on-must-c-en-de)](https://paperswithcode.com/sota/speech-to-text-translation-on-must-c-en-de?p=neurst-neural-speech-translation-toolkit)`

NeurST: Neural Speech Translation Toolkit

ACL 2021 · Chengqi Zhao, Mingxuan Wang, Qianqian Dong, Rong Ye, Lei LI ·

NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at facilitating the speech translation research for NLP researchers and building reliable benchmarks for this field. It provides step-by-step recipes for feature extraction, data preprocessing, distributed training, and evaluation. In this paper, we will introduce the framework design of NeurST and show experimental results for different benchmark datasets, which can be regarded as reliable baselines for future research. The toolkit is publicly available at https://github.com/bytedance/neurst/ and we will continuously update the performance of NeurST with other counterparts and studies at https://st-benchmark.github.io/.

PDF Abstract ACL 2021 PDF ACL 2021 Abstract

Code

Add Remove Mark official

bytedance/neurst official

293

Tasks

Add Remove

Speech-to-Text Translation

Translation

Datasets

LibriSpeech

MuST-C

Results from the Paper

Add Remove

Ranked #1 on Speech-to-Text Translation on libri-trans

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain + SpecAug	Case-insensitive tokenized BLEU	18.7	# 1	Compare
			Case-insensitive sacreBLEU	17.2	# 1	Compare
			Case-sensitive sacreBLEU	16.3	# 1	Compare
			Case-sensitive tokenized BLEU	17.8	# 1	Compare
Speech-to-Text Translation	libri-trans	Transformer + ASR Pretrain	Case-insensitive tokenized BLEU	17.9	# 2	Compare
			Case-insensitive sacreBLEU	16.5	# 2	Compare
			Case-sensitive sacreBLEU	15.5	# 2	Compare
			Case-sensitive tokenized BLEU	16.9	# 2	Compare
Speech-to-Text Translation	MuST-C EN->DE	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	22.8	# 7	Compare
Speech-to-Text Translation	MuST-C EN->ES	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	26.8	# 5	Compare
Speech-to-Text Translation	MuST-C EN->ES	Transformer + ASR Pretrain + SpecAug	Case-sensitive sacreBLEU	27.4	# 4	Compare
Speech-to-Text Translation	MuST-C EN->FR	Transformer + ASR Pretrain + SpecAug	Case-sensitive sacreBLEU	33.3	# 2	Compare
Speech-to-Text Translation	MuST-C EN->FR	Transformer + ASR Pretrain	Case-sensitive sacreBLEU	32.3	# 3	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

NeurST: Neural Speech Translation Toolkit

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove