Naive automated machine learning

Felix Mohr; Marcel Wever

doi:10.1007/s10994-022-06200-0

Details

Originalsprache	Englisch
Seiten (von - bis)	1131-1170
Seitenumfang	40
Fachzeitschrift	Machine learning
Jahrgang	112
Ausgabenummer	4
Publikationsstatus	Veröffentlicht - Apr. 2023
Extern publiziert	Ja

Abstract

An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

ASJC Scopus Sachgebiete

Informatik (insg.)
Software
Informatik (insg.)
Artificial intelligence

Zitieren

Naive automated machine learning. / Mohr, Felix; Wever, Marcel.
in: Machine learning, Jahrgang 112, Nr. 4, 04.2023, S. 1131-1170.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Mohr, F & Wever, M 2023, 'Naive automated machine learning', Machine learning, Jg. 112, Nr. 4, S. 1131-1170. https://doi.org/10.1007/s10994-022-06200-0

Mohr, F., & Wever, M. (2023). Naive automated machine learning. Machine learning, 112(4), 1131-1170. https://doi.org/10.1007/s10994-022-06200-0

Mohr F, Wever M. Naive automated machine learning. Machine learning. 2023 Apr;112(4):1131-1170. doi: 10.1007/s10994-022-06200-0

Mohr, Felix ; Wever, Marcel. / Naive automated machine learning. in: Machine learning. 2023 ; Jahrgang 112, Nr. 4. S. 1131-1170.

Download

@article{ac0870d820644e81a5ae39bea4c07c54,

title = "Naive automated machine learning",

abstract = "An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.",

keywords = "Automated Machine Learning, Black-Box Optimization, Data Science",

author = "Felix Mohr and Marcel Wever",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2023",

month = apr,

doi = "10.1007/s10994-022-06200-0",

language = "English",

volume = "112",

pages = "1131--1170",

journal = "Machine learning",

issn = "0885-6125",

publisher = "Springer Netherlands",

number = "4",

}

Download

TY - JOUR

T1 - Naive automated machine learning

AU - Mohr, Felix

AU - Wever, Marcel

PY - 2023/4

Y1 - 2023/4

N2 - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

AB - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

KW - Automated Machine Learning

KW - Black-Box Optimization

KW - Data Science

UR - http://www.scopus.com/inward/record.url?scp=85139115238&partnerID=8YFLogxK

U2 - 10.1007/s10994-022-06200-0

DO - 10.1007/s10994-022-06200-0

M3 - Article

AN - SCOPUS:85139115238

VL - 112

SP - 1131

EP - 1170

JO - Machine learning

JF - Machine learning

SN - 0885-6125

IS - 4

ER -

Research@Leibniz University

Naive automated machine learning

Autorschaft

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning