Bayesian reinforcement learning reliability analysis

Tong Zhou; Tong Guo; Chao Dang; Michael Beer

doi:10.1016/j.cma.2024.116902

Details

Originalsprache	Englisch
Aufsatznummer	116902
Seitenumfang	35
Fachzeitschrift	Computer Methods in Applied Mechanics and Engineering
Jahrgang	424
Frühes Online-Datum	12 März 2024
Publikationsstatus	Veröffentlicht - 1 Mai 2024

Abstract

A Bayesian reinforcement learning reliability method that combines Bayesian inference for the failure probability estimation and reinforcement learning-guided sequential experimental design is proposed. The reliability-oriented sequential experimental design is framed as a finite-horizon Markov decision process (MDP), with the associated utility function defined by a measure of epistemic uncertainty about Kriging-estimated failure probability, referred to as integrated probability of misclassification (IPM). On this basis, a one-step Bayes optimal learning function termed integrated probability of misclassification reduction (IPMR), along with a compatible convergence criterion, is defined. Three effective strategies are implemented to accelerate IPMR-informed sequential experimental design: (i) Analytical derivation of the inner expectation in IPMR, simplifying it to a single expectation. (ii) Substitution of IPMR with its upper bound IPMR^U to avoid element-wise computation of its integrand. (iii) Rational pruning of both quadrature set and candidate pool in IPMR^U to alleviate computer memory constraint. The efficacy of the proposed approach is demonstrated on two benchmark examples and two numerical examples. Results indicate that IPMR^U facilitates a much more rapid reduction of IPM compared to other existing learning functions, while requiring much less computational time than IPMR itself. Therefore, the proposed reliability method offers a substantial advantage in both computational efficiency and accuracy, especially in complex dynamic reliability problems.

ASJC Scopus Sachgebiete

Ingenieurwesen (insg.)
Numerische Mechanik
Ingenieurwesen (insg.)
Werkstoffmechanik
Ingenieurwesen (insg.)
Maschinenbau
Physik und Astronomie (insg.)
Allgemeine Physik und Astronomie
Informatik (insg.)
Angewandte Informatik

Zitieren

Bayesian reinforcement learning reliability analysis. / Zhou, Tong; Guo, Tong; Dang, Chao et al.
in: Computer Methods in Applied Mechanics and Engineering, Jahrgang 424, 116902, 01.05.2024.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Zhou, T, Guo, T, Dang, C & Beer, M 2024, 'Bayesian reinforcement learning reliability analysis', Computer Methods in Applied Mechanics and Engineering, Jg. 424, 116902. https://doi.org/10.1016/j.cma.2024.116902

Zhou, T., Guo, T., Dang, C., & Beer, M. (2024). Bayesian reinforcement learning reliability analysis. Computer Methods in Applied Mechanics and Engineering, 424, Artikel 116902. https://doi.org/10.1016/j.cma.2024.116902

Zhou T, Guo T, Dang C, Beer M. Bayesian reinforcement learning reliability analysis. Computer Methods in Applied Mechanics and Engineering. 2024 Mai 1;424:116902. Epub 2024 Mär 12. doi: 10.1016/j.cma.2024.116902

Zhou, Tong ; Guo, Tong ; Dang, Chao et al. / Bayesian reinforcement learning reliability analysis. in: Computer Methods in Applied Mechanics and Engineering. 2024 ; Jahrgang 424.

Download

@article{354d3bd78f9f479bb5f55778e37ec3a5,

title = "Bayesian reinforcement learning reliability analysis",

abstract = "A Bayesian reinforcement learning reliability method that combines Bayesian inference for the failure probability estimation and reinforcement learning-guided sequential experimental design is proposed. The reliability-oriented sequential experimental design is framed as a finite-horizon Markov decision process (MDP), with the associated utility function defined by a measure of epistemic uncertainty about Kriging-estimated failure probability, referred to as integrated probability of misclassification (IPM). On this basis, a one-step Bayes optimal learning function termed integrated probability of misclassification reduction (IPMR), along with a compatible convergence criterion, is defined. Three effective strategies are implemented to accelerate IPMR-informed sequential experimental design: (i) Analytical derivation of the inner expectation in IPMR, simplifying it to a single expectation. (ii) Substitution of IPMR with its upper bound IPMRU to avoid element-wise computation of its integrand. (iii) Rational pruning of both quadrature set and candidate pool in IPMRU to alleviate computer memory constraint. The efficacy of the proposed approach is demonstrated on two benchmark examples and two numerical examples. Results indicate that IPMRU facilitates a much more rapid reduction of IPM compared to other existing learning functions, while requiring much less computational time than IPMR itself. Therefore, the proposed reliability method offers a substantial advantage in both computational efficiency and accuracy, especially in complex dynamic reliability problems.",

keywords = "Bayesian inference, Integrated probability of misclassification reduction, One-step Bayes optimal learning function, Reinforcement learning, Reliability analysis",

author = "Tong Zhou and Tong Guo and Chao Dang and Michael Beer",

note = "Funding Information: The support of the National Natural Science Foundation of China (Grant No. 52125802 ) is highly appreciated. ",

year = "2024",

month = may,

day = "1",

doi = "10.1016/j.cma.2024.116902",

language = "English",

volume = "424",

journal = "Computer Methods in Applied Mechanics and Engineering",

issn = "0045-7825",

publisher = "Elsevier BV",

}

Download

TY - JOUR

T1 - Bayesian reinforcement learning reliability analysis

AU - Zhou, Tong

AU - Guo, Tong

AU - Dang, Chao

AU - Beer, Michael

N1 - Funding Information: The support of the National Natural Science Foundation of China (Grant No. 52125802 ) is highly appreciated.

PY - 2024/5/1

Y1 - 2024/5/1

N2 - A Bayesian reinforcement learning reliability method that combines Bayesian inference for the failure probability estimation and reinforcement learning-guided sequential experimental design is proposed. The reliability-oriented sequential experimental design is framed as a finite-horizon Markov decision process (MDP), with the associated utility function defined by a measure of epistemic uncertainty about Kriging-estimated failure probability, referred to as integrated probability of misclassification (IPM). On this basis, a one-step Bayes optimal learning function termed integrated probability of misclassification reduction (IPMR), along with a compatible convergence criterion, is defined. Three effective strategies are implemented to accelerate IPMR-informed sequential experimental design: (i) Analytical derivation of the inner expectation in IPMR, simplifying it to a single expectation. (ii) Substitution of IPMR with its upper bound IPMRU to avoid element-wise computation of its integrand. (iii) Rational pruning of both quadrature set and candidate pool in IPMRU to alleviate computer memory constraint. The efficacy of the proposed approach is demonstrated on two benchmark examples and two numerical examples. Results indicate that IPMRU facilitates a much more rapid reduction of IPM compared to other existing learning functions, while requiring much less computational time than IPMR itself. Therefore, the proposed reliability method offers a substantial advantage in both computational efficiency and accuracy, especially in complex dynamic reliability problems.

AB - A Bayesian reinforcement learning reliability method that combines Bayesian inference for the failure probability estimation and reinforcement learning-guided sequential experimental design is proposed. The reliability-oriented sequential experimental design is framed as a finite-horizon Markov decision process (MDP), with the associated utility function defined by a measure of epistemic uncertainty about Kriging-estimated failure probability, referred to as integrated probability of misclassification (IPM). On this basis, a one-step Bayes optimal learning function termed integrated probability of misclassification reduction (IPMR), along with a compatible convergence criterion, is defined. Three effective strategies are implemented to accelerate IPMR-informed sequential experimental design: (i) Analytical derivation of the inner expectation in IPMR, simplifying it to a single expectation. (ii) Substitution of IPMR with its upper bound IPMRU to avoid element-wise computation of its integrand. (iii) Rational pruning of both quadrature set and candidate pool in IPMRU to alleviate computer memory constraint. The efficacy of the proposed approach is demonstrated on two benchmark examples and two numerical examples. Results indicate that IPMRU facilitates a much more rapid reduction of IPM compared to other existing learning functions, while requiring much less computational time than IPMR itself. Therefore, the proposed reliability method offers a substantial advantage in both computational efficiency and accuracy, especially in complex dynamic reliability problems.

KW - Bayesian inference

KW - Integrated probability of misclassification reduction

KW - One-step Bayes optimal learning function

KW - Reinforcement learning

KW - Reliability analysis

UR - http://www.scopus.com/inward/record.url?scp=85187206497&partnerID=8YFLogxK

U2 - 10.1016/j.cma.2024.116902

DO - 10.1016/j.cma.2024.116902

M3 - Article

AN - SCOPUS:85187206497

VL - 424

JO - Computer Methods in Applied Mechanics and Engineering

JF - Computer Methods in Applied Mechanics and Engineering

SN - 0045-7825

M1 - 116902

ER -

Research@Leibniz University

Bayesian reinforcement learning reliability analysis

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Multi-point Bayesian active learning reliability analysis

How Shapley Value and Its Generalizations Can Help in the Analysis of Complex Engineering Systems and What Next

What Is Optimal Granularity When Estimating Reliability of a Complex Engineering Systems

Assessing seismic vulnerability of modular buildings under earthquake ground motions

Output probability distribution estimation of stochastic static and dynamic systems using Laplace transform and maximum entropy

Multi-point Bayesian active learning reliability analysis

How Shapley Value and Its Generalizations Can Help in the Analysis of Complex Engineering Systems and What Next

What Is Optimal Granularity When Estimating Reliability of a Complex Engineering Systems

Assessing seismic vulnerability of modular buildings under earthquake ground motions

Output probability distribution estimation of stochastic static and dynamic systems using Laplace transform and maximum entropy

Multi-point Bayesian active learning reliability analysis