CNN Architectures for Large-Scale Audio Classification

29 Sep 2016Shawn HersheySourish ChaudhuriDaniel P. W. EllisJort F. GemmekeAren JansenR. Channing MooreManoj PlakalDevin PlattRif A. SaurousBryan SeyboldMalcolm SlaneyRon J. WeissKevin Wilson

Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 video-level labels... (read more)

PDF Abstract

Evaluation results from the paper


  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.