TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Audio Classification	ICBHI Respiratory Sound Database	AST (Patch-Mix CL)	ICBHI Score	62.37	# 1
Audio Classification	ICBHI Respiratory Sound Database	AST (Patch-Mix CL)	Sensitivity	43.07	# 3
Audio Classification	ICBHI Respiratory Sound Database	AST (Patch-Mix CL)	Specificity	81.66	# 1
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	Sensitivity	41.97	# 6
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	Specificity	77.14	# 5
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	ICBHI Score	59.55	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/patch-mix-contrastive-learning-with-audio/audio-classification-on-icbhi-respiratory)](https://paperswithcode.com/sota/audio-classification-on-icbhi-respiratory?p=patch-mix-contrastive-learning-with-audio)`

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

23 May 2023 · Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun ·

Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%.

PDF Abstract

Code

Add Remove Mark official

raymin0223/patch-mix_contrastive_le… official

Tasks

Add Remove

Audio Classification

Contrastive Learning

Sound Classification

Datasets

ImageNet

AudioSet

ICBHI Respiratory Sound Database

Results from the Paper

Edit

Ranked #1 on Audio Classification on ICBHI Respiratory Sound Database (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Audio Classification	ICBHI Respiratory Sound Database	AST (Patch-Mix CL)	ICBHI Score	62.37	# 1	Compare
			Sensitivity	43.07	# 3	Compare
			Specificity	81.66	# 1	Compare
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	Sensitivity	41.97	# 6	Compare
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	Specificity	77.14	# 5	Compare
Audio Classification	ICBHI Respiratory Sound Database	AST (fine-tuning)	ICBHI Score	59.55	# 5	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Contrastive Learning • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove