no code implementations • 18 Jan 2021 • Diego Santoro, Leonardo Pellegrina, Fabio Vandin
The extraction of $k$-mers is a fundamental component in many complex analyses of large next-generation sequencing datasets, including reads classification in genomics and the characterization of RNA-seq datasets.
no code implementations • 22 Oct 2020 • Leonardo Pellegrina
We derive sharper probabilistic concentration bounds for the Monte Carlo Empirical Rademacher Averages (MCERA), which are proved through recent results on the concentration of self-bounding functions.
1 code implementation • 16 Jun 2020 • Leonardo Pellegrina, Cyrus Cousins, Fabio Vandin, Matteo Riondato
To show the practical use of MCRapper, we employ it to develop an algorithm TFP-R for the task of True Frequent Pattern (TFP) mining.