About

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Subtasks

Datasets

Greatest papers with code

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights

ICLR 2021 rwightman/pytorch-image-models

Because of the scale invariance, this modification only alters the effective step sizes without changing the effective update directions, thus enjoying the original convergence properties of GD optimizers.

AUDIO CLASSIFICATION IMAGE CLASSIFICATION LANGUAGE MODELLING OBJECT DETECTION

Self-Supervised MultiModal Versatile Networks

NeurIPS 2020 deepmind/deepmind-research

In particular, we explore how best to combine the modalities, such that fine-grained representations of the visual and audio modalities can be maintained, whilst also integrating text into a common embedding.

ACTION RECOGNITION IN VIDEOS AUDIO CLASSIFICATION SELF-SUPERVISED ACTION RECOGNITION SELF-SUPERVISED AUDIO CLASSIFICATION

Perceiver: General Perception with Iterative Attention

4 Mar 2021lucidrains/perceiver-pytorch

The perception models used in deep learning on the other hand are designed for individual modalities, often relying on domain-specific assumptions such as the local grid structures exploited by virtually all existing vision models.

3D POINT CLOUD CLASSIFICATION AUDIO CLASSIFICATION IMAGE CLASSIFICATION

Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

9 Jul 2018soerenab/AudioMNIST

Interpretability of deep neural networks is a recently emerging area of machine learning research targeting a better understanding of how models perform feature selection and derive their classification decisions.

AUDIO CLASSIFICATION DECISION MAKING FEATURE SELECTION

CNN Architectures for Large-Scale Audio Classification

29 Sep 2016harritaylor/torchvggish

Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio.

4 AUDIO CLASSIFICATION

Ubicoustics: Plug-and-Play Acoustic Activity Recognition

14 Oct 2018FIGLAB/ubicoustics

Despite sound being a rich source of information, computing devices with microphones do not leverage audio to glean useful insights about their physical and social context.

ACTIVITY RECOGNITION DATA AUGMENTATION ENVIRONMENTAL SOUND CLASSIFICATION SOUND EVENT DETECTION

Look, Listen and Learn

ICCV 2017 marl/l3embedding

We consider the question: what can be learnt by looking at and listening to a large number of unlabelled videos?

AUDIO CLASSIFICATION

Unified Probabilistic Deep Continual Learning through Generative Replay and Open Set Recognition

28 May 2019MrtnMndt/OCDVAEContinualLearning

We introduce a probabilistic approach to unify open set recognition with the prevention of catastrophic forgetting in deep continual learning, based on variational Bayesian inference.

AUDIO CLASSIFICATION BAYESIAN INFERENCE CONTINUAL LEARNING OPEN SET LEARNING

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

NeurIPS 2020 HumamAlwassel/XDC

To the best of our knowledge, XDC is the first self-supervised learning method that outperforms large-scale fully-supervised pretraining for action recognition on the same architecture.

AUDIO CLASSIFICATION DEEP CLUSTERING REPRESENTATION LEARNING SELF-SUPERVISED ACTION RECOGNITION SELF-SUPERVISED AUDIO CLASSIFICATION SELF-SUPERVISED LEARNING