1 code implementation • 2 Sep 2023 • Domenico Amato, Giosué Lo Bosco, Raffaele Giancarlo
We propose a novel paradigm that, complementing known specialized ones, can produce Learned versions of any Sorted Set Dictionary, for instance, Balanced Binary Search Trees or Binary Search on layouts other that sorted, i. e., Eytzinger.
1 code implementation • 28 Nov 2022 • Dario Malchiodi, Davide Raimondi, Giacomo Fumagalli, Raffaele Giancarlo, Marco Frasca
Learned Bloom Filters, i. e., models induced from data via machine learning techniques and solving the approximate set membership problem, have recently been introduced with the aim of enhancing the performance of standard Bloom Filters, with special focus on space occupancy.
1 code implementation • 21 Feb 2022 • Domenico Amato, Giosue' Lo Bosco, Raffaele Giancarlo
In turn, that would favour the use of Neural Networks as building blocks of Classic Data Structures.
1 code implementation • 5 Jan 2022 • Domenico Amato, Giosuè Lo Bosco, Raffaele Giancarlo
With the use of the Searching on Sorted Sets SOSD Learned Indexing benchmarking software, we investigate how to choose a Search routine for the final stage of searching in a Learned Index.
no code implementations • 13 Dec 2021 • Giacomo Fumagalli, Davide Raimondi, Raffaele Giancarlo, Dario Malchiodi, Marco Frasca
Bloom Filters are a fundamental and pervasive data structure.
1 code implementation • 19 Jul 2021 • Domenico Amato, Giosuè Lo Bosco, Raffaele Giancarlo
In modern applications, model space is a key factor and, in fact, a major open question concerning this area is to assess to what extent one can enjoy the speed-up of Binary Search achieved by Learned Indexes while using constant or nearly constant space models.
no code implementations • 27 Jun 2021 • Giuseppe Cattaneo, Umberto Ferraro Petrillo, Raffaele Giancarlo, Francesco Palini, Chiara Romualdi
Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error).
no code implementations • 20 Jul 2020 • Domenico Amato, Giosué Lo Bosco, Raffaele Giancarlo
Here we study to what extend Machine Learning Techniques can contribute to obtain such a speed-up via a systematic experimental comparison of known efficient implementations of Sorted Table Search procedures, with different Data Layouts, and their Learned counterparts developed here.
1 code implementation • 16 Jan 2017 • Umberto Ferraro Petrillo, Gianluca Roscigno, Giuseppe Cattaneo, Raffaele Giancarlo
We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files.