Hyperparameter Optimization

278 papers with code • 1 benchmarks • 3 datasets

Hyperparameter Optimization is the problem of choosing a set of optimal hyperparameters for a learning algorithm. Whether the algorithm is suitable for the data directly depends on hyperparameters, which directly influence overfitting or underfitting. Each model requires different assumptions, weights or training speeds for different types of data under the conditions of a given loss function.

Source: Data-driven model for fracturing design optimization: focus on building digital database and production forecast

Benchmarks

Add a Result

These leaderboards are used to track progress in Hyperparameter Optimization

Trend	Dataset	Best Model	Paper	Code	Compare
	Bayesmark	HEBO			See all

Libraries

Use these libraries to find Hyperparameter Optimization models and implementations

jim-schwoebel/allie

5 papers

138

automl/auto-sklearn

3 papers

7,393

kubeflow/katib

3 papers

1,420

DataCanvasIO/Hypernets

3 papers

261

See all 14 libraries.

Datasets

Latest papers

Most implemented Social Latest No code

Intelligent Learning Rate Distribution to reduce Catastrophic Forgetting in Transformers

themody/nas-catastrophicforgetting • • 27 Mar 2024

We combine the learning rate distributions thus found and show that they generalize to better performance with respect to the problem of catastrophic forgetting.

27 Mar 2024

Paper
Code

Breast Cancer Classification Using Gradient Boosting Algorithms Focusing on Reducing the False Negative and SHAP for Explainability

joaomh/ieee-breast-cancer-classification-boosting • 14 Mar 2024

The main objective of this study is to use state-of-the-art boosting algorithms such as AdaBoost, XGBoost, CatBoost and LightGBM to predict and diagnose breast cancer and to find the most effective metric regarding recall, ROC-AUC, and confusion matrix.

14 Mar 2024

Paper
Code

FeatAug: Automatic Feature Augmentation From One-to-Many Relationship Tables

sfu-db/feataug • • 11 Mar 2024

To overcome this limitation, we propose FEATAUG, a new feature augmentation framework that automatically extracts predicate-aware SQL queries from one-to-many relationship tables.

11 Mar 2024

Paper
Code

Better Understandings and Configurations in MaxSAT Local Search Solvers via Anytime Performance Analysis

academicsubmission/ijcai2024-p3562 • 11 Mar 2024

Though numerous solvers have been proposed for the MaxSAT problem, and the benchmark environment such as MaxSAT Evaluations provides a platform for the comparison of the state-of-the-art solvers, existing assessments were usually evaluated based on the quality, e. g., fitness, of the best-found solutions obtained within a given running time budget.

11 Mar 2024

Paper
Code

Rethinking of Encoder-based Warm-start Methods in Hyperparameter Optimization

azoz01/liltab • • 7 Mar 2024

In this work, we evaluate Dataset2Vec and liltab on two common meta-tasks - representing entire datasets and hyperparameter optimization warm-start.

07 Mar 2024

Paper
Code

Hyperparameter Tuning MLPs for Probabilistic Time Series Forecasting

18kiran12/tsbench • • 7 Mar 2024

Time series forecasting attempts to predict future events by analyzing past trends and patterns.

07 Mar 2024

Paper
Code

Parallel Hyperparameter Optimization Of Spiking Neural Network

thomasfirmin/hpo_snn • • 1 Mar 2024

By defining an early stopping criterion detecting silent networks and by designing specific constraints, we were able to instantiate larger and more flexible search spaces.

01 Mar 2024

Paper
Code

Explainable Bayesian Optimization

tanmay-ty/tntrules • 24 Jan 2024

In industry, Bayesian optimization (BO) is widely applied in the human-AI collaborative parameter tuning of cyber-physical systems.

24 Jan 2024

Paper
Code

Bilevel Optimization under Unbounded Smoothness: A New Algorithm and Convergence Analysis

mingruiliu-ml-lab/bilevel-optimization-under-unbounded-smoothness • • 17 Jan 2024

When the upper-level problem is nonconvex and unbounded smooth, and the lower-level problem is strongly convex, we prove that our algorithm requires $\widetilde{\mathcal{O}}(1/\epsilon^4)$ iterations to find an $\epsilon$-stationary point in the stochastic setting, where each iteration involves calling a stochastic gradient or Hessian-vector product oracle.

17 Jan 2024

Paper
Code

Teaching Specific Scientific Knowledge into Large Language Models through Additional Training

kanhatakeyama/Additional-training-Llama2 • 6 Dec 2023

Through additional training, we explore embedding specialized scientific knowledge into the Llama 2 Large Language Model (LLM).

06 Dec 2023

Paper
Code

Hyperparameter Optimization

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result