3 code implementations • 2 Nov 2019 • Otmar Ertl
This paper introduces a class of one-pass locality-sensitive hash algorithms that are orders of magnitude faster than the original approach.
4 code implementations • 11 Feb 2019 • Ted Dunning, Otmar Ertl
We present on-line algorithms for computing approximations of rank-based statistics that give high accuracy, particularly near the tails of a distribution, with very small sketches.
Sequential Quantile Estimation Computation Data Structures and Algorithms
2 code implementations • 12 Feb 2018 • Otmar Ertl
Minwise hashing has become a standard tool to calculate signatures which allow direct estimation of Jaccard similarities.
Data Structures and Algorithms
1 code implementation • 20 Jun 2017 • Otmar Ertl
This work presents new cardinality estimation methods for data sets recorded by HyperLogLog sketches.
Data Structures and Algorithms
1 code implementation • 18 Jun 2017 • Otmar Ertl
This paper presents a new algorithm for calculating hash signatures of sets which can be directly used for Jaccard similarity estimation.
Data Structures and Algorithms
1 code implementation • 4 Feb 2017 • Otmar Ertl
This paper presents new methods to estimate the cardinalities of data sets recorded by HyperLogLog sketches.
Data Structures and Algorithms 68W15, 68W25, 62-07 E.1; I.1.2; H.2.8