Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Browse SoTA
> Speech
Speech
366 benchmarks • 86 tasks • 258 datasets • 3460 papers with code
Text Generation
Text Generation
179 benchmarks
1866 papers with code
Dialogue Generation
14 benchmarks
250 papers with code
Data-to-Text Generation
44 benchmarks
111 papers with code
Multi-Document Summarization
5 benchmarks
107 papers with code
Story Generation
8 benchmarks
102 papers with code
See all 27 tasks
Speech Recognition
Speech Recognition
199 benchmarks
1292 papers with code
Automatic Speech Recognition (ASR)
13 benchmarks
583 papers with code
Visual Speech Recognition
10 benchmarks
50 papers with code
Robust Speech Recognition
23 papers with code
Target Speaker Extraction
15 papers with code
See all 12 tasks
Speech Emotion Recognition
Vocal Bursts Intensity Prediction
1 benchmark
777 papers with code
Vocal Bursts Valence Prediction
1 benchmark
265 papers with code
Vocal Bursts Type Prediction
1 benchmark
155 papers with code
Cultural Vocal Bursts Intensity Prediction
1 benchmark
93 papers with code
Text-to-Image Generation
16 benchmarks
434 papers with code
Deblurring
32 benchmarks
384 papers with code
Conformal Prediction
224 papers with code
Text Simplification
11 benchmarks
125 papers with code
Self-Supervised Image Classification
3 benchmarks
95 papers with code
See all 18 tasks
Emotion Recognition
Emotion Recognition
59 benchmarks
562 papers with code
Speech Emotion Recognition
19 benchmarks
128 papers with code
Emotion Recognition in Conversation
12 benchmarks
79 papers with code
Multimodal Emotion Recognition
3 benchmarks
72 papers with code
Facial Emotion Recognition
2 benchmarks
26 papers with code
See all 13 tasks
Dialogue
Dialogue Generation
14 benchmarks
250 papers with code
Dialogue State Tracking
7 benchmarks
139 papers with code
Task-Oriented Dialogue Systems
6 benchmarks
128 papers with code
Visual Dialog
8 benchmarks
56 papers with code
Dialogue Understanding
19 benchmarks
35 papers with code
See all 22 tasks
Chatbot
Dialogue Generation
14 benchmarks
250 papers with code
Chatbot
15 benchmarks
239 papers with code
Speech Synthesis
Speech Synthesis
18 benchmarks
339 papers with code
Expressive Speech Synthesis
13 papers with code
Emotional Speech Synthesis
7 papers with code
Speech Synthesis - Gujarati
2 benchmarks
2 papers with code
text-to-speech translation
2 papers with code
See all 16 tasks
Speech Enhancement
Speech Enhancement
24 benchmarks
260 papers with code
Bandwidth Extension
6 benchmarks
19 papers with code
Speech Dereverberation
5 benchmarks
19 papers with code
Packet Loss Concealment
4 papers with code
Speech Intelligibility Evaluation
Speaker Verification
Speaker Verification
13 benchmarks
193 papers with code
Audio Deepfake Detection
1 benchmark
19 papers with code
Text-Independent Speaker Verification
19 papers with code
Text-Dependent Speaker Verification
2 papers with code
Audio Generation
Audio Generation
8 benchmarks
106 papers with code
Voice Cloning
28 papers with code
Audio Super-Resolution
4 benchmarks
15 papers with code
Room Impulse Response (RIR)
14 papers with code
Video-to-Sound Generation
1 benchmark
6 papers with code
Voice Conversion
Voice Conversion
3 benchmarks
165 papers with code
Spoken Language Understanding
Spoken Language Understanding
17 benchmarks
130 papers with code
Spoken language identification
12 benchmarks
12 papers with code
Speech Tokenization
4 papers with code
Speech Separation
Speech Separation
20 benchmarks
116 papers with code
Speech Extraction
1 benchmark
9 papers with code
Keyword Spotting
Keyword Spotting
13 benchmarks
108 papers with code
Small-Footprint Keyword Spotting
7 papers with code
Visual Keyword Spotting
3 benchmarks
4 papers with code
Text-To-Speech Synthesis
Text-To-Speech Synthesis
7 benchmarks
101 papers with code
Prosody Prediction
1 benchmark
4 papers with code
Zero-Shot Multi-Speaker TTS
3 papers with code
Speaker Recognition
Speaker Recognition
1 benchmark
97 papers with code
Cultural Vocal Bursts Intensity Prediction
Cultural Vocal Bursts Intensity Prediction
1 benchmark
93 papers with code
Speaker Diarization
Speaker Diarization
15 benchmarks
90 papers with code
Speaker Identification
Speaker Identification
4 benchmarks
71 papers with code
Speech Representation Learning
Speech Representation Learning
46 papers with code
Speech-to-Speech Translation
Speech-to-Speech Translation
3 benchmarks
38 papers with code
Audio-Visual Speech Recognition
Audio-Visual Speech Recognition
4 benchmarks
34 papers with code
Speech Denoising
Speech Denoising
2 benchmarks
33 papers with code
Singing Voice Synthesis
Singing Voice Synthesis
27 papers with code
Spoken Dialogue Systems
Spoken Dialogue Systems
25 papers with code
Speaker Separation
Speaker Separation
13 papers with code
Multi-Speaker Source Separation
6 papers with code
Acoustic echo cancellation
Acoustic echo cancellation
14 papers with code
Acoustic Modelling
Acoustic Modelling
12 papers with code
Pronunciation Assessment
Phone-level pronunciation scoring
1 benchmark
5 papers with code
Utterance-level pronounciation scoring
1 benchmark
2 papers with code
Word-level pronunciation scoring
1 benchmark
2 papers with code
Pronunciation Assessment
Arabic Text Diacritization
Arabic Text Diacritization
2 benchmarks
7 papers with code
Unsupervised Speech Recognition
Unsupervised Speech Recognition
7 papers with code
Text-Independent Speaker Recognition
Text-Independent Speaker Recognition
6 papers with code
Visual Speech Recognition
Lip to Speech Synthesis
8 benchmarks
6 papers with code
Spoken Command Recognition
Spoken Command Recognition
1 benchmark
5 papers with code
Voice Similarity
Voice Similarity
5 papers with code
Manner Of Articulation Detection
Manner Of Articulation Detection
2 papers with code
Speaker Profiling
Speaker Profiling
2 papers with code
Voice Query Recognition
Voice Query Recognition
1 benchmark
2 papers with code
Acoustic Question Answering
Acoustic Question Answering
1 papers with code
Automatic Speech Recognition (ASR)
Automatic Phoneme Recognition
6 benchmarks
1 papers with code
Speech Interruption Detection
Speech Interruption Detection
1 papers with code
Speech-to-Gesture Translation
Speech-to-Gesture Translation
1 papers with code
Speaking Style Synthesis
Speaking Style Synthesis