Search Results for author: Owen Kaser

Found 12 papers, 11 papers with code

Faster Remainder by Direct Computation: Applications to Compilers and Software Libraries

2 code implementations5 Feb 2019 Daniel Lemire, Owen Kaser, Nathan Kurz

Currently, the remainder of the division by a constant is computed from the quotient by a multiplication and a subtraction.

Mathematical Software Performance

Roaring Bitmaps: Implementation of an Optimized Software Library

15 code implementations22 Sep 2017 Daniel Lemire, Owen Kaser, Nathan Kurz, Luca Deri, Chris O'Hara, François Saint-Jacques, Gregory Ssi-Yan-Kai

Compressed bitmap indexes are used in systems such as Git or Oracle to accelerate queries.

Databases

Consistently faster and smaller compressed bitmaps with Roaring

13 code implementations21 Mar 2016 Daniel Lemire, Gregory Ssi-Yan-Kai, Owen Kaser

To better handle these cases, we build a new Roaring hybrid that combines uncompressed bitmaps, packed arrays and RLE compressed segments.

Databases

Faster 64-bit universal hashing using carry-less multiplications

2 code implementations11 Mar 2015 Daniel Lemire, Owen Kaser

Intel and AMD support the Carry-less Multiplication (CLMUL) instruction set in their x64 processors.

Data Structures and Algorithms Performance

Better bitmap performance with Roaring bitmaps

14 code implementations26 Feb 2014 Samy Chambi, Daniel Lemire, Owen Kaser, Robert Godin

On synthetic and real data, we find that Roaring bitmaps (1) often compress significantly better (e. g., 2 times) and (2) are faster than the compressed alternatives (up to 900 times faster for intersections).

Databases

Compressed bitmap indexes: beyond unions and intersections

4 code implementations18 Feb 2014 Owen Kaser, Daniel Lemire

Compressed bitmap indexes are used to speed up simple aggregate queries in databases.

Databases Data Structures and Algorithms

Reordering Rows for Better Compression: Beyond the Lexicographic Order

3 code implementations9 Jul 2012 Daniel Lemire, Owen Kaser, Eduardo Gutarra

For minimizing the number of runs in a run-length encoding compression scheme, the best approaches to row-ordering are derived from traveling salesman heuristics, although there is a significant trade-off between running time and compression.

Databases H.4.0

Strongly universal string hashing is fast

4 code implementations22 Feb 2012 Owen Kaser, Daniel Lemire

Our tests include hash functions designed for processors with the Carry-Less Multiplication (CLMUL) instruction set.

Databases Data Structures and Algorithms

Reordering Columns for Smaller Indexes

2 code implementations7 Sep 2009 Daniel Lemire, Owen Kaser

Column-oriented indexes-such as projection or bitmap indexes-are compressed by run-length encoding to reduce storage and increase speed.

Databases

Sorting improves word-aligned bitmap indexes

6 code implementations23 Jan 2009 Daniel Lemire, Owen Kaser, Kamel Aouiche

Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage.

Databases

Removing Manually-Generated Boilerplate from Electronic Texts: Experiments with Project Gutenberg e-Books

no code implementations13 Jul 2007 Owen Kaser, Daniel Lemire

We investigate the case of the Project Gutenberg corpus, where most documents are in ASCII format with preambles and epilogues that are often copied and pasted or manually typed.

Recursive n-gram hashing is pairwise independent, at best

2 code implementations31 May 2007 Daniel Lemire, Owen Kaser

We prove that recursive hash families cannot be more than pairwise independent.

Cannot find the paper you are looking for? You can Submit a new open access paper.