Naive automated machine learning

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

Externe Organisationen

  • Universidad de la Sabana
  • Universität Paderborn
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)1131-1170
Seitenumfang40
FachzeitschriftMachine learning
Jahrgang112
Ausgabenummer4
PublikationsstatusVeröffentlicht - Apr. 2023
Extern publiziertJa

Abstract

An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

ASJC Scopus Sachgebiete

Zitieren

Naive automated machine learning. / Mohr, Felix; Wever, Marcel.
in: Machine learning, Jahrgang 112, Nr. 4, 04.2023, S. 1131-1170.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Mohr F, Wever M. Naive automated machine learning. Machine learning. 2023 Apr;112(4):1131-1170. doi: 10.1007/s10994-022-06200-0
Mohr, Felix ; Wever, Marcel. / Naive automated machine learning. in: Machine learning. 2023 ; Jahrgang 112, Nr. 4. S. 1131-1170.
Download
@article{ac0870d820644e81a5ae39bea4c07c54,
title = "Naive automated machine learning",
abstract = "An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.",
keywords = "Automated Machine Learning, Black-Box Optimization, Data Science",
author = "Felix Mohr and Marcel Wever",
note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",
year = "2023",
month = apr,
doi = "10.1007/s10994-022-06200-0",
language = "English",
volume = "112",
pages = "1131--1170",
journal = "Machine learning",
issn = "0885-6125",
publisher = "Springer Netherlands",
number = "4",

}

Download

TY - JOUR

T1 - Naive automated machine learning

AU - Mohr, Felix

AU - Wever, Marcel

N1 - Publisher Copyright: © 2022, The Author(s).

PY - 2023/4

Y1 - 2023/4

N2 - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

AB - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

KW - Automated Machine Learning

KW - Black-Box Optimization

KW - Data Science

UR - http://www.scopus.com/inward/record.url?scp=85139115238&partnerID=8YFLogxK

U2 - 10.1007/s10994-022-06200-0

DO - 10.1007/s10994-022-06200-0

M3 - Article

AN - SCOPUS:85139115238

VL - 112

SP - 1131

EP - 1170

JO - Machine learning

JF - Machine learning

SN - 0885-6125

IS - 4

ER -

Von denselben Autoren