Unsupervised Cross-lingual Representation Learning for Speech Recognition

facebookresearch/fairseq 24 Jun 2020

This paper presents XLSR which learns cross-lingual speech representations by pretraining a single model from the raw waveform of speech in multiple languages.

Quantization Representation Learning +1

16,956
8.28 stars / hour

PaddleNLP

PaddlePaddle/PaddleNLP ACL ARR January 2022

Easy-to-use and powerful NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.

Few-Shot Learning Link Prediction +2

3,599
1.49 stars / hour

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

hpcaitech/colossalai 28 Oct 2021

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing.

2D Human Pose Estimation

2,876
1.20 stars / hour

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

spotify/basic-pitch 18 Mar 2022

Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems.

Music Transcription

98
0.78 stars / hour

OPT: Open Pre-trained Transformer Language Models

facebookresearch/metaseq 2 May 2022

Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning.

Hate Speech Detection Language Modelling +1

2,872
0.60 stars / hour

Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems

shyamsn97/controllable-ncas 25 Apr 2022

Inspired by cellular growth and self-organization, Neural Cellular Automata (NCAs) have been capable of "growing" artificial cells into images, 3D structures, and even functional machines.

19
0.53 stars / hour

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

r-three/t-few 11 May 2022

ICL incurs substantial computational, memory, and storage costs because it involves processing all of the training examples every time a prediction is made.

Few-Shot Text Classification

84
0.53 stars / hour

Thin-Plate Spline Motion Model for Image Animation

yoyo-nb/thin-plate-spline-motion-model 27 Mar 2022

Firstly, we propose thin-plate spline motion estimation to produce a more flexible optical flow, which warps the feature maps of the source image to the feature domain of the driving image.

Image Animation Motion Estimation +1

367
0.47 stars / hour

Neo: Generalizing Confusion Matrix Visualization to Hierarchical and Multi-Output Labels

apple/ml-hierarchical-confusion-matrix 24 Oct 2021

The confusion matrix, a ubiquitous visualization for helping people evaluate machine learning models, is a tabular layout that compares predicted class labels against actual class labels over all data instances.

157
0.42 stars / hour

A Unified Framework for Implicit Sinkhorn Differentiation

marvin-eisenberger/implicit-sinkhorn 13 May 2022

The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields.

17
0.40 stars / hour