Search Results

What you can cram into a single vector: Probing sentence embeddings for linguistic properties

6 code implementations3 May 2018

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

General Classification Sentence +2

Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package

1 code implementation EMNLP 2018

Vector space embedding models like word2vec, GloVe, fastText, and ELMo are extremely popular representations in natural language processing (NLP) applications.

Word Embeddings

Document Embedding with Paragraph Vectors

5 code implementations29 Jul 2015

Paragraph Vectors has been recently proposed as an unsupervised method for learning distributed representations for pieces of texts.

Document Embedding Sentiment Analysis +1

Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs

45 code implementations30 Mar 2016

We present a new approach for the approximate K-nearest neighbor search based on navigable small world graphs with controllable hierarchy (Hierarchical NSW, HNSW).

Polysemous codes

11 code implementations7 Sep 2016

This paper considers the problem of approximate nearest neighbor search in the compressed domain.

Quantization

Billion-scale similarity search with GPUs

14 code implementations28 Feb 2017

Similarity search finds application in specialized database systems handling complex data such as images or videos, which are typically represented by high-dimensional features and require specific indexing structures.

Image Similarity Search Quantization

Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings

1 code implementation11 Jul 2021

The use of phonological features (PFs) potentially allows language-specific phones to remain linked in training, which is highly desirable for information sharing for multilingual and crosslingual speech recognition methods for low-resourced languages.

speech-recognition Speech Recognition

Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings

8 code implementations ICLR 2020

Our main insight is that queries can be embedded as boxes (i. e., hyper-rectangles), where a set of points inside the box corresponds to a set of answer entities of the query.

Complex Query Answering

Accelerating Large-Scale Inference with Anisotropic Vector Quantization

4 code implementations ICML 2020

Based on the observation that for a given query, the database points that have the largest inner products are more relevant, we develop a family of anisotropic quantization loss functions.

Benchmarking Quantization

Quicker ADC : Unlocking the hidden potential of Product Quantization with SIMD

3 code implementations21 Dec 2018

Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems.

Quantization Retrieval