AutoML for Multi-Label Classification: Overview and Empirical Evaluation

Marcel Wever; Alexander Tornede; Felix Mohr; Eyke Hullermeier

doi:10.1109/TPAMI.2021.3051276

Details

Originalsprache	Englisch
Aufsatznummer	9321731
Seiten (von - bis)	3037-3054
Seitenumfang	18
Fachzeitschrift	IEEE Transactions on Pattern Analysis and Machine Intelligence
Jahrgang	43
Ausgabenummer	9
Publikationsstatus	Veröffentlicht - 1 Sept. 2021
Extern publiziert	Ja

Abstract

Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

ASJC Scopus Sachgebiete

Informatik (insg.)
Software
Informatik (insg.)
Maschinelles Sehen und Mustererkennung
Informatik (insg.)
Theoretische Informatik und Mathematik
Informatik (insg.)
Artificial intelligence
Mathematik (insg.)
Angewandte Mathematik

Zitieren

AutoML for Multi-Label Classification: Overview and Empirical Evaluation. / Wever, Marcel; Tornede, Alexander; Mohr, Felix et al.
in: IEEE Transactions on Pattern Analysis and Machine Intelligence, Jahrgang 43, Nr. 9, 9321731, 01.09.2021, S. 3037-3054.

Publikation: Beitrag in Fachzeitschrift › Übersichtsarbeit › Forschung › Peer-Review

Wever, M, Tornede, A, Mohr, F & Hullermeier, E 2021, 'AutoML for Multi-Label Classification: Overview and Empirical Evaluation', IEEE Transactions on Pattern Analysis and Machine Intelligence, Jg. 43, Nr. 9, 9321731, S. 3037-3054. https://doi.org/10.1109/TPAMI.2021.3051276

Wever, M., Tornede, A., Mohr, F., & Hullermeier, E. (2021). AutoML for Multi-Label Classification: Overview and Empirical Evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(9), 3037-3054. Artikel 9321731. https://doi.org/10.1109/TPAMI.2021.3051276

Wever M, Tornede A, Mohr F, Hullermeier E. AutoML for Multi-Label Classification: Overview and Empirical Evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021 Sep 1;43(9):3037-3054. 9321731. doi: 10.1109/TPAMI.2021.3051276

Wever, Marcel ; Tornede, Alexander ; Mohr, Felix et al. / AutoML for Multi-Label Classification : Overview and Empirical Evaluation. in: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021 ; Jahrgang 43, Nr. 9. S. 3037-3054.

Download

@article{ed7a6bd9a72343e2ab5b55081b37625a,

title = "AutoML for Multi-Label Classification: Overview and Empirical Evaluation",

abstract = "Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.",

keywords = "Automated machine learning, Bayesian optimization, hierarchical planning, multi-label classification",

author = "Marcel Wever and Alexander Tornede and Felix Mohr and Eyke Hullermeier",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2021",

month = sep,

day = "1",

doi = "10.1109/TPAMI.2021.3051276",

language = "English",

volume = "43",

pages = "3037--3054",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "9",

}

Download

TY - JOUR

T1 - AutoML for Multi-Label Classification

T2 - Overview and Empirical Evaluation

AU - Wever, Marcel

AU - Tornede, Alexander

AU - Mohr, Felix

AU - Hullermeier, Eyke

PY - 2021/9/1

Y1 - 2021/9/1

N2 - Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

AB - Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

KW - Automated machine learning

KW - Bayesian optimization

KW - hierarchical planning

KW - multi-label classification

UR - http://www.scopus.com/inward/record.url?scp=85099549846&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2021.3051276

DO - 10.1109/TPAMI.2021.3051276

M3 - Review article

C2 - 33439834

VL - 43

SP - 3037

EP - 3054

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

SN - 0162-8828

IS - 9

M1 - 9321731

ER -

Research@Leibniz University

AutoML for Multi-Label Classification: Overview and Empirical Evaluation

Autorschaft

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning