Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Or, discuss a change on
Slack
.
Browse SoTA
> Audio
Audio
90 benchmarks • 67 tasks • 150 datasets • 1220 papers with code
Classification
Classification
323 benchmarks
3044 papers with code
Text Classification
217 benchmarks
1035 papers with code
Graph Classification
64 benchmarks
355 papers with code
Medical Image Classification
4 benchmarks
104 papers with code
Plant Phenotyping
21 papers with code
See all 17 tasks
Speech Recognition
Speech Recognition
358 benchmarks
1025 papers with code
Automatic Speech Recognition (ASR)
7 benchmarks
454 papers with code
Visual Speech Recognition
10 benchmarks
35 papers with code
Robust Speech Recognition
21 papers with code
Distant Speech Recognition
2 benchmarks
10 papers with code
See all 11 tasks
2D Semantic Segmentation
Image Segmentation
2 benchmarks
1324 papers with code
Text Style Transfer
3 benchmarks
76 papers with code
Scene Parsing
57 benchmarks
69 papers with code
2D Semantic Segmentation
80 benchmarks
34 papers with code
Reflection Removal
5 benchmarks
24 papers with code
See all 13 tasks
Few-Shot Learning
Few-Shot Learning
48 benchmarks
951 papers with code
One-Shot Learning
1 benchmark
87 papers with code
Few-Shot Semantic Segmentation
16 benchmarks
72 papers with code
Cross-Domain Few-Shot
1 benchmark
44 papers with code
Unsupervised Few-Shot Learning
11 papers with code
See all 13 tasks
Emotion Recognition
Emotion Recognition
49 benchmarks
405 papers with code
Speech Emotion Recognition
17 benchmarks
89 papers with code
Emotion Recognition in Conversation
12 benchmarks
59 papers with code
Multimodal Emotion Recognition
3 benchmarks
43 papers with code
Emotion-Cause Pair Extraction
2 benchmarks
17 papers with code
See all 13 tasks
Speech Synthesis
Speech Synthesis
17 benchmarks
270 papers with code
Expressive Speech Synthesis
10 papers with code
Emotional Speech Synthesis
3 papers with code
text-to-speech translation
2 papers with code
Speech Synthesis - Assamese
1 benchmark
1 papers with code
See all 17 tasks
Accented Speech Recognition
Speech Synthesis
17 benchmarks
270 papers with code
Conformal Prediction
112 papers with code
Music Source Separation
3 benchmarks
51 papers with code
Audio Source Separation
6 benchmarks
44 papers with code
Decision Making Under Uncertainty
39 papers with code
Robust Speech Recognition
21 papers with code
See all 9 tasks
Speech Enhancement
Speech Enhancement
17 benchmarks
208 papers with code
Bandwidth Extension
1 benchmark
15 papers with code
Speech Dereverberation
4 benchmarks
15 papers with code
Packet Loss Concealment
4 papers with code
Language Identification
Language Identification
6 benchmarks
112 papers with code
Dialect Identification
31 papers with code
Native Language Identification
1 benchmark
5 papers with code
Audio Classification
Audio Classification
23 benchmarks
121 papers with code
Environmental Sound Classification
3 benchmarks
23 papers with code
Audio Multiple Target Classification
1 papers with code
Semi-supervised Audio Classification
1 papers with code
Voice Conversion
Voice Conversion
2 benchmarks
138 papers with code
Music Generation
Music Generation
116 papers with code
Music Texture Transfer
1 papers with code
DeepFake Detection
DeepFake Detection
6 benchmarks
103 papers with code
Synthetic Speech Detection
8 papers with code
Human Detection of Deepfakes
1 papers with code
Multimodal Forgery Detection
1 benchmark
1 papers with code
Audio Generation
Audio Generation
7 benchmarks
54 papers with code
Voice Cloning
15 papers with code
Audio Super-Resolution
4 benchmarks
13 papers with code
Room Impulse Response (RIR)
9 papers with code
Text-To-Speech Synthesis
Text-To-Speech Synthesis
6 benchmarks
84 papers with code
Prosody Prediction
1 benchmark
2 papers with code
Zero-Shot Multi-Speaker TTS
2 papers with code
Sound Event Detection
Sound Event Detection
4 benchmarks
67 papers with code
Audio Source Separation
Audio Source Separation
6 benchmarks
44 papers with code
Target Sound Extraction
2 benchmarks
2 papers with code
Directional Hearing
2 benchmarks
1 papers with code
Single-Label Target Sound Extraction
Sound Classification
Sound Classification
43 papers with code
Audio Tagging
Audio Tagging
1 benchmark
40 papers with code
Audio captioning
Audio captioning
4 benchmarks
33 papers with code
Zero-shot Audio Captioning
2 benchmarks
1 papers with code
Acoustic Scene Classification
Acoustic Scene Classification
3 benchmarks
34 papers with code
Environmental Sound Classification
Environmental Sound Classification
3 benchmarks
23 papers with code
Self-Supervised Sound Classification
1 papers with code
Sound Event Localization and Detection
Sound Event Localization and Detection
4 benchmarks
22 papers with code
Audio Signal Processing
Audio Signal Processing
18 papers with code
Audio Effects Modeling
1 papers with code
Instrument Recognition
Instrument Recognition
2 benchmarks
19 papers with code
Voice Anti-spoofing
Voice Anti-spoofing
3 benchmarks
12 papers with code
Direction of Arrival Estimation
Direction of Arrival Estimation
1 benchmark
11 papers with code
Instance Search
Instance Search
9 papers with code
Audio Fingerprint
1 papers with code
Audio inpainting
Audio inpainting
10 papers with code
Text-to-Music Generation
Text-to-Music Generation
2 benchmarks
10 papers with code
Audio Denoising
Audio Denoising
3 benchmarks
9 papers with code
Online Beat Tracking
Inference Optimization
9 papers with code
Chord Recognition
Chord Recognition
7 papers with code
Audio-Visual Synchronization
Audio-Visual Synchronization
5 papers with code
Audio Effects Modeling
Pitch control
2 papers with code
Timbre Interpolation
1 papers with code
Audio declipping
Audio declipping
3 papers with code
Music Compression
Music Compression
3 papers with code
Visually Guided Sound Source Separation
Visually Guided Sound Source Separation
3 papers with code
Bird Classification
Bird Audio Detection
2 papers with code
Bird Classification
Bird Species Classification With Audio-Visual Data
Hearing Aid and device processing
Cadenza 1 - Task 1 - Headphone
1 benchmark
1 papers with code
Cadenza 1 - Task 2 - In Car
1 benchmark
1 papers with code
Hearing Aid and device processing
Audio Signal Recognition
Audio Signal Recognition
1 papers with code
Gunshot Detection
1 papers with code
Vowel Classification
Vowel Classification
2 papers with code
fake voice detection
fake voice detection
1 benchmark
2 papers with code
Acoustic Novelty Detection
Acoustic Novelty Detection
1 benchmark
1 papers with code
Audio Dequantization
Audio Dequantization
1 papers with code
Directional Hearing
Real-time Directional Hearing
1 benchmark
1 papers with code
Shooter Localization
Shooter Localization
1 papers with code
Soundscape evaluation
Soundscape evaluation
1 papers with code
Speaker Orientation
Speaker Orientation
1 papers with code
Target Sound Extraction
Streaming Target Sound Extraction
1 benchmark
1 papers with code
Active Speaker Localization
Active Speaker Localization