Unsupervised Cross-lingual Representation Learning for Speech Recognition

facebookresearch/fairseq 24 Jun 2020

This paper presents XLSR which learns cross-lingual speech representations by pretraining a single model from the raw waveform of speech in multiple languages.

Quantization Representation Learning +1

PaddlePaddle/PaddleNLP ACL ARR January 2022

Easy-to-use and powerful NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.

Few-Shot Learning Link Prediction +2

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

hpcaitech/colossalai 28 Oct 2021

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing.

2D Human Pose Estimation

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

spotify/basic-pitch 18 Mar 2022

Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems.

Music Transcription

OPT: Open Pre-trained Transformer Language Models

facebookresearch/metaseq 2 May 2022

Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning.

Hate Speech Detection Language Modelling +1

Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems

shyamsn97/controllable-ncas 25 Apr 2022

Inspired by cellular growth and self-organization, Neural Cellular Automata (NCAs) have been capable of "growing" artificial cells into images, 3D structures, and even functional machines.

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

r-three/t-few 11 May 2022

ICL incurs substantial computational, memory, and storage costs because it involves processing all of the training examples every time a prediction is made.

Few-Shot Text Classification

Thin-Plate Spline Motion Model for Image Animation

yoyo-nb/thin-plate-spline-motion-model 27 Mar 2022

Firstly, we propose thin-plate spline motion estimation to produce a more flexible optical flow, which warps the feature maps of the source image to the feature domain of the driving image.

Image Animation Motion Estimation +1

Neo: Generalizing Confusion Matrix Visualization to Hierarchical and Multi-Output Labels

apple/ml-hierarchical-confusion-matrix 24 Oct 2021

The confusion matrix, a ubiquitous visualization for helping people evaluate machine learning models, is a tabular layout that compares predicted class labels against actual class labels over all data instances.

A Unified Framework for Implicit Sinkhorn Differentiation

marvin-eisenberger/implicit-sinkhorn 13 May 2022

The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields.

