AutoML

LightAutoML

Introduced by Vakhrushev et al. in LightAutoML: AutoML Solution for a Large Financial Services Ecosystem

LightAutoML is an AutoML solution targeted for financial services companies. A typical LightAutoML pipeline scheme is presented in the Figure, each pipeline containing:

  • Reader: object that receives raw data and task as input, calculates some useful metadata, performs initial data cleaning and decides about data manipulations that should be done before fitting different model types.

  • LightAutoML inner datasets that contains metadata and CV iterators that implements validation scheme for the datasets.

  • Multiple ML Pipelines that are stacked and/or blended to get a single prediction.

An ML pipeline in LightAutoML is one or multiple ML models that share a single data preprocessing and validation scheme. The preprocessing step may have up to two feature selection steps, a feature engineering step or even just be empty if no preprocessing is needed. The ML pipelines can be computed independently on the same datasets and then blended together using averaging (or weighted averaging). Alternatively, a stacking ensemble scheme can be used to build multi level ensemble architectures.

Source: LightAutoML: AutoML Solution for a Large Financial Services Ecosystem

Papers


Paper Code Results Date Stars

Components


Component Type
🤖 No Components Found You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories