1 code implementation • 1 Feb 2023 • Nicholas Matsumoto, Anil Kumar Saini, Pedro Ribeiro, Hyunjun Choi, Alena Orlenko, Leo-Pekka Lyytikäinen, Jari O Laurikka, Terho Lehtimäki, Sandra Batista, Jason H. Moore
In many evolutionary computation systems, parent selection methods can affect, among other things, convergence to a solution.
1 code implementation • 6 Dec 2022 • Pedro Henrique Ribeiro, Patryk Orzechowski, Joost Wagenaar, Jason H. Moore
Automated machine learning (AutoML) algorithms have grown in popularity due to their high performance and flexibility to adapt to different problems and data sets.
no code implementations • 24 Oct 2022 • Mateusz Godzik, Jacek Dajda, Marek Kisiel-Dorohinicki, Aleksander Byrski, Leszek Rutkowski, Patryk Orzechowski, Joost Wagenaar, Jason H. Moore
The discussed modifications are evaluated based on a number of difficult continuous-optimization benchmarks.
no code implementations • 30 Jun 2022 • Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz
When seeking a predictive model in biomedical data, one often has more than a single objective in mind, e. g., attaining both high accuracy and low complexity (to promote interpretability).
no code implementations • 25 Jun 2022 • Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz
We recently highlighted a fundamental problem recognized to confound algorithmic optimization, namely, \textit{conflating} the objective with the objective function.
no code implementations • 25 Jun 2022 • Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz
We have recently presented SAFE -- Solution And Fitness Evolution -- a commensalistic coevolutionary algorithm that maintains two coevolving populations: a population of candidate solutions and a population of candidate objective functions.
4 code implementations • 29 Jul 2021 • William La Cava, Patryk Orzechowski, Bogdan Burlacu, Fabrício Olivetti de França, Marco Virgolin, Ying Jin, Michael Kommenda, Jason H. Moore
We assess 14 symbolic regression methods and 7 machine learning methods on a set of 252 diverse regression problems.
no code implementations • 22 Jul 2021 • Roland Albert A. Romero, Mariefel Nicole Y. Deypalan, Suchit Mehrotra, John Titus Jungao, Natalie E. Sheils, Elisabetta Manduchi, Jason H. Moore
We ascertain and compare the performances of AutoML tools on large, highly imbalanced healthcare datasets.
1 code implementation • 14 Jul 2021 • Patryk Orzechowski, Jason H. Moore
Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial for determine their scope of application.
1 code implementation • 3 May 2021 • Paweł Renc, Patryk Orzechowski, Aleksander Byrski, Jarosław Wąs, Jason H. Moore
Biclustering is a data mining technique which searches for local patterns in numeric tabular data with main application in bioinformatics.
3 code implementations • 30 Nov 2020 • Joseph D. Romano, Trang T. Le, William La Cava, John T. Gregg, Daniel J. Goldberg, Natasha L. Ray, Praneel Chakraborty, Daniel Himmelstein, Weixuan Fu, Jason H. Moore
Motivation: Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets.
2 code implementations • 28 Aug 2020 • Ryan J. Urbanowicz, Pranshu Suri, Yuhan Cui, Jason H. Moore, Karen Ruth, Rachael Stolzenberg-Solomon, Shannon M. Lynch
Machine learning (ML) offers a collection of powerful approaches for detecting and modeling associations, often applied to data having a large number of features and/or complex associations.
no code implementations • 7 Jul 2020 • Thomas Bartz-Beielstein, Carola Doerr, Daan van den Berg, Jakob Bossek, Sowmya Chandrasekaran, Tome Eftimov, Andreas Fischbach, Pascal Kerschke, William La Cava, Manuel Lopez-Ibanez, Katherine M. Malan, Jason H. Moore, Boris Naujoks, Patryk Orzechowski, Vanessa Volz, Markus Wagner, Thomas Weise
This survey compiles ideas and recommendations from more than a dozen researchers with different backgrounds and from different institutes around the world.
2 code implementations • 11 Jun 2020 • Joseph D. Romano, Trang T. Le, Weixuan Fu, Jason H. Moore
providing a head-to-head comparison of AutoML and DL in the context of binary classification on 6 well-characterized public datasets, and (2.)
2 code implementations • 28 Apr 2020 • William La Cava, Jason H. Moore
In this paper, current approaches to fairness are discussed and used to motivate algorithmic proposals that incorporate fairness into genetic programming for classification.
no code implementations • 30 Jan 2020 • Stefano Ruberto, Valerio Terragni, Jason H. Moore
The evolution in each run is guided by a new (dynamic) target based on the residual errors.
1 code implementation • 15 Jan 2020 • Jaqueline J. Brito, Jun Li, Jason H. Moore, Casey S. Greene, Nicole A. Nogoy, Lana X. Garmire, Serghei Mangul
Computational methods have reshaped the landscape of modern biology.
1 code implementation • 22 May 2019 • William La Cava, Heather Williams, Weixuan Fu, Steve Vitale, Durga Srivatsan, Jason H. Moore
We have two goals in mind: first, to make it easy to construct sophisticated models of biomedical processes; and second, to provide a fully automated AI agent that can choose and conduct promising experiments for the user, based on the user's experiments as well as prior knowledge.
1 code implementation • 18 Apr 2019 • William La Cava, Jason H. Moore
Multidimensional genetic programming represents candidate solutions as sets of programs, and thereby provides an interesting framework for exploiting building block identification.
1 code implementation • 14 Mar 2019 • William La Cava, Christopher Bauer, Jason H. Moore, Sarah A Pendergrass
Electronic health records are an increasingly important resource for understanding the interactions between patient health, environment, and clinical decisions.
1 code implementation • 26 Jul 2018 • Patryk Orzechowski, Jason H. Moore
Motivation: In this paper we present the latest release of EBIC, a next-generation biclustering algorithm for mining genetic data.
3 code implementations • ICLR 2019 • William La Cava, Tilak Raj Singh, James Taggart, Srinivas Suri, Jason H. Moore
We propose and study a method for learning interpretable representations for the task of regression.
1 code implementation • 25 Apr 2018 • Patryk Orzechowski, William La Cava, Jason H. Moore
In this paper we provide a broad benchmarking of recent genetic programming approaches to symbolic regression in the context of state of the art machine learning approaches.
1 code implementation • 9 Jan 2018 • Patryk Orzechowski, Moshe Sipper, Xiuzhen Huang, Jason H. Moore
In this paper a novel biclustering algorithm based on artificial intelligence (AI) is introduced.
2 code implementations • 22 Nov 2017 • Ryan J. Urbanowicz, Randal S. Olson, Peter Schmitt, Melissa Meeker, Jason H. Moore
Modern biomedical data mining requires feature selection methods that can (1) be applied to large scale feature spaces (e. g. `omics' data), (2) function in noisy problems, (3) detect complex patterns of association (e. g. gene-gene interactions), (4) be flexibly adapted to various problem domains and data types (e. g. genetic variants, gene expression, and clinical data) and (5) are computationally tractable.
no code implementations • 22 Nov 2017 • Ryan J. Urbanowicz, Melissa Meeker, William LaCava, Randal S. Olson, Jason H. Moore
Feature selection plays a critical role in biomedical data mining, driven by increasing feature dimensionality in target problems and growing interest in advanced but computationally expensive methodologies able to model complex associations.
no code implementations • 9 Oct 2017 • Alena Orlenko, Jason H. Moore, Patryk Orzechowski, Randal S. Olson, Junmei Cairns, Pedro J. Caraballo, Richard M. Weinshilboum, Liewei Wang, Matthew K. Breitenstein
Automated Machine Learning (AutoML) approaches provide exciting opportunity to guide feature selection in agnostic metabolic profiling endeavors, where potentially thousands of independent data points must be evaluated.
1 code implementation • 15 Sep 2017 • William La Cava, Thomas Helmuth, Lee Spector, Jason H. Moore
Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection.
2 code implementations • 8 Aug 2017 • Randal S. Olson, William La Cava, Zairah Mustahsan, Akshay Varik, Jason H. Moore
As the bioinformatics field grows, it must keep pace not only with new data but with new algorithms.
no code implementations • 13 Jun 2017 • Moshe Sipper, Weixuan Fu, Karuna Ahuja, Jason H. Moore
The practice of evolutionary algorithms involves the tuning of many parameters.
1 code implementation • 1 May 2017 • Randal S. Olson, Moshe Sipper, William La Cava, Sharon Tartarone, Steven Vitale, Weixuan Fu, Patryk Orzechowski, Ryan J. Urbanowicz, John H. Holmes, Jason H. Moore
While artificial intelligence (AI) has become widespread, many commercial AI systems are not yet accessible to individual researchers nor the general public due to the deep knowledge of the systems required to use them.
1 code implementation • 20 Mar 2017 • William La Cava, Jason H. Moore
Recently we proposed a general, ensemble-based feature engineering wrapper (FEW) that was paired with a number of machine learning methods to solve regression problems.
1 code implementation • 1 Mar 2017 • Randal S. Olson, William La Cava, Patryk Orzechowski, Ryan J. Urbanowicz, Jason H. Moore
The selection, development, or comparison of machine learning methods in data mining can be a difficult task based on the target problem and goals of a particular study.
no code implementations • 6 Feb 2017 • Andrew Sohn, Randal S. Olson, Jason H. Moore
Machine learning has been gaining traction in recent years to meet the demand for tools that can efficiently analyze and make sense of the ever-growing databases of biomedical data in health care systems around the world.
1 code implementation • 29 Jul 2016 • Randal S. Olson, Jason H. Moore
In this chapter, we present a genetic programming-based AutoML system called TPOT that optimizes a series of feature preprocessors and machine learning models with the goal of maximizing classification accuracy on a supervised classification problem.
no code implementations • 27 Mar 2016 • Randal S. Olson, Jason H. Moore, Christoph Adami
Pattern recognition and classification is a central concern for modern information processing systems.
3 code implementations • 20 Mar 2016 • Randal S. Olson, Nathan Bartley, Ryan J. Urbanowicz, Jason H. Moore
As the field of data science continues to grow, there will be an ever-increasing demand for tools that make machine learning accessible to non-experts.
no code implementations • 29 Feb 2016 • Randal S. Olson, Arend Hintze, Fred C. Dyer, Jason H. Moore, Christoph Adami
In this model, we observe a coevolutionary cycle between prey swarming behavior and the predator's visual system, where the predator and prey continually adapt their visual system and behavior, respectively, over evolutionary time in reaction to one another due to the well-known "predator confusion effect."
1 code implementation • 28 Jan 2016 • Randal S. Olson, Ryan J. Urbanowicz, Peter C. Andrews, Nicole A. Lavender, La Creis Kidd, Jason H. Moore
Over the past decade, data science and machine learning has grown from a mysterious art form to a staple tool across a variety of fields in academia, business, and government.