Few-Shot Audio Classification

3 papers with code • 10 benchmarks • 9 datasets

Few-shot classification for audio signals. Presents a unique challenge compared to other few-shot domains as we deal with temporal dependencies as well.

Like other few-shot problems, few-shot audio classification can be tackled in a variety of ways, from using supervised meta-learning on the same primary dataset, to pre-training on an external dataset and utilising linear readout. For this reason, results in each dataset leaderboard should be correctly tagged e.g. with "Within Dataset Meta-Learning" etc

Benchmarks

Add a Result

These leaderboards are used to track progress in Few-Shot Audio Classification

Dataset	Best Model	Compare
NSynth	Meta-Curvature (CRNN)	See all
FSDKaggle2018	MAML (CRNN)	See all
VoxCeleb1	Meta-Curvature (CRNN)	See all
BirdClef 2020 (Pruned)	Meta-Curvature (CRNN)	See all
ESC-50	Meta-Curvature (CRNN)	See all
Watkins Marine Mammal Sounds	MT-SLVR (SimCLR + MLAP) w/ Parallel Adapters (FSD50K, RN18)	See all
Speech Command v2	SimCLR (FSD50K, RN18)	See all
CREMA-D	MT-SLVR (SimCLR + MLAP) w/ Parallel Adapters (FSD50K, RN18)	See all
Speech Accent Archive	MT-SLVR (SimCLR + MLAP) w/ Parallel Adapters (FSD50K, RN18)	See all
Common Voice	MT-SLVR (SimCLR + MLAP) w/ Parallel Adapters (FSD50K, RN18)	See all

Datasets

Most implemented papers

Most implemented Social Latest No code

MetaAudio: A Few-Shot Audio Classification Benchmark

cheggan/metaaudio-a-few-shot-audio-classification-benchmark • • 5 Apr 2022

Currently available benchmarks for few-shot learning (machine learning with few training examples) are limited in the domains they cover, primarily focusing on image classification.

Paper
Code

MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations

cheggan/mt-slvr • • 29 May 2023

Contrastive self-supervised learning has gained attention for its ability to create high-quality representations from large unlabelled data sets.

Paper
Code

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

jinhualiang/apt • 30 Nov 2023

Moreover, we improve the framework of audio language model by using interleaved audio-text embeddings as the input sequence.

Paper
Code

Few-Shot Audio Classification

Benchmarks Add a Result

Datasets

Most implemented papers

MetaAudio: A Few-Shot Audio Classification Benchmark

MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Content

Benchmarks

Add a Result