AutoML for Multi-Label Classification: Overview and Empirical Evaluation

Marcel Wever; Alexander Tornede; Felix Mohr; Eyke Hullermeier

doi:10.1109/TPAMI.2021.3051276

Details

Original language	English
Article number	9321731
Pages (from-to)	3037-3054
Number of pages	18
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	43
Issue number	9
Publication status	Published - 1 Sept 2021
Externally published	Yes

Abstract

Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

Keywords

Automated machine learning, Bayesian optimization, hierarchical planning, multi-label classification

ASJC Scopus subject areas

Computer Science(all)
Software
Computer Science(all)
Computer Vision and Pattern Recognition
Computer Science(all)
Computational Theory and Mathematics
Computer Science(all)
Artificial Intelligence
Mathematics(all)
Applied Mathematics

Cite this

AutoML for Multi-Label Classification: Overview and Empirical Evaluation. / Wever, Marcel; Tornede, Alexander; Mohr, Felix et al.
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 43, No. 9, 9321731, 01.09.2021, p. 3037-3054.

Research output: Contribution to journal › Review article › Research › peer review

Wever, M, Tornede, A, Mohr, F & Hullermeier, E 2021, 'AutoML for Multi-Label Classification: Overview and Empirical Evaluation', IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 9, 9321731, pp. 3037-3054. https://doi.org/10.1109/TPAMI.2021.3051276

Wever, M., Tornede, A., Mohr, F., & Hullermeier, E. (2021). AutoML for Multi-Label Classification: Overview and Empirical Evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(9), 3037-3054. Article 9321731. https://doi.org/10.1109/TPAMI.2021.3051276

Wever M, Tornede A, Mohr F, Hullermeier E. AutoML for Multi-Label Classification: Overview and Empirical Evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021 Sept 1;43(9):3037-3054. 9321731. doi: 10.1109/TPAMI.2021.3051276

Wever, Marcel ; Tornede, Alexander ; Mohr, Felix et al. / AutoML for Multi-Label Classification : Overview and Empirical Evaluation. In: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021 ; Vol. 43, No. 9. pp. 3037-3054.

Download

@article{ed7a6bd9a72343e2ab5b55081b37625a,

title = "AutoML for Multi-Label Classification: Overview and Empirical Evaluation",

abstract = "Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.",

keywords = "Automated machine learning, Bayesian optimization, hierarchical planning, multi-label classification",

author = "Marcel Wever and Alexander Tornede and Felix Mohr and Eyke Hullermeier",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2021",

month = sep,

day = "1",

doi = "10.1109/TPAMI.2021.3051276",

language = "English",

volume = "43",

pages = "3037--3054",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "9",

}

Download

TY - JOUR

T1 - AutoML for Multi-Label Classification

T2 - Overview and Empirical Evaluation

AU - Wever, Marcel

AU - Tornede, Alexander

AU - Mohr, Felix

AU - Hullermeier, Eyke

PY - 2021/9/1

Y1 - 2021/9/1

N2 - Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

AB - Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

KW - Automated machine learning

KW - Bayesian optimization

KW - hierarchical planning

KW - multi-label classification

UR - http://www.scopus.com/inward/record.url?scp=85099549846&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2021.3051276

DO - 10.1109/TPAMI.2021.3051276

M3 - Review article

C2 - 33439834

VL - 43

SP - 3037

EP - 3054

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

SN - 0162-8828

IS - 9

M1 - 9321731

ER -

Research@Leibniz University

AutoML for Multi-Label Classification: Overview and Empirical Evaluation

Authors

External Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning

A Survey of Methods for Automated Algorithm Configuration (Extended Abstract)

Annotation uncertainty in the context of grammatical change

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

Position: Why We Must Rethink Empirical Research in Machine Learning