Improving the Cluster Structure Extracted from OPTICS Plots

scikit-learn/scikit-learn Lernen, Wissen, Daten, Analysen 2018

Density-based clustering is closely associated with the two algorithms DBSCAN and OPTICS.

Scikit-learn: Machine Learning in Python

scikit-learn/scikit-learn 2 Jan 2012

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems.

BIG-bench Machine Learning Clustering +3

SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives

scikit-learn/scikit-learn NeurIPS 2014

In this work we introduce a new optimisation method called SAGA in the spirit of SAG, SDCA, MISO and SVRG, a set of recently proposed incremental gradient algorithms with fast linear convergence rates.

Llama 2: Open Foundation and Fine-Tuned Chat Models

facebookresearch/llama 18 Jul 2023

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

Arithmetic Reasoning +5

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints

facebookresearch/llama 22 May 2023

Multi-query attention (MQA), which only uses a single key-value head, drastically speeds up decoder inference.

Decoder Language Modelling

Efficient Neural Audio Synthesis

CorentinJ/Real-Time-Voice-Cloning ICML 2018

The small number of weights in a Sparse WaveRNN makes it possible to sample high-fidelity audio on a mobile CPU in real time.

Audio Synthesis Speech Synthesis +1

Generalized End-to-End Loss for Speaker Verification

CorentinJ/Real-Time-Voice-Cloning 28 Oct 2017

In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss, which makes the training of speaker verification models more efficient than our previous tuple-based end-to-end (TE2E) loss function.

Domain Adaptation Speaker Verification