Loading [MathJax]/extensions/tex2jax.js

Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

Details

Original languageEnglish
Title of host publicationComputer Vision
Subtitle of host publicationECCV 2024 Workshops, Proceedings
EditorsAlessio Del Bue, Cristian Canton, Jordi Pont-Tuset, Tatiana Tommasi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages100-116
Number of pages17
ISBN (electronic)978-3-031-92591-7
ISBN (print)9783031925900
Publication statusPublished - 12 May 2025
Event18th European Conference on Computer Vision, ECCV 2024 - Milan, Italy
Duration: 29 Sept 20244 Oct 2024

Publication series

NameLecture Notes in Computer Science
Volume15634 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Training Reinforcement Learning agents directly in any real-world environment remains difficult, as such scenarios entail the risk of damaging the training setup or violating other safety constraints. The training process itself further requires extensive human supervision and intervention to reset the environment after each episode. Thus, we propose an innovative Safe Reinforcement Learning framework that combines Safe and Resetless RL to autonomously reset environments, while also reducing the number of safety constraint violations. In this context, we develop a novel risk-averse RL agent suitable for stringent safety constraints by combining Safe RL, Distributional RL, and Randomized Ensembled Double Q-Learning. Experiments conducted in a novel mobile robotics scenario indicate that our Safe Resetless RL framework reduces the number of human interactions required during training compared to state-of-the-art methods, filling a gap in current problem formulations and enhancing the autonomy of RL training processes in real-world settings.

Keywords

    Autonomous Navigation, Distributional Reinforcement Learning, Mobile Service Robotics, Resetless Reinforcement Learning, Safe Reinforcement Learning

ASJC Scopus subject areas

Cite this

Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents. / Gottwald, Tristan; Schier, Maximilian; Rosenhahn, Bodo.
Computer Vision : ECCV 2024 Workshops, Proceedings. ed. / Alessio Del Bue; Cristian Canton; Jordi Pont-Tuset; Tatiana Tommasi. Springer Science and Business Media Deutschland GmbH, 2025. p. 100-116 (Lecture Notes in Computer Science; Vol. 15634 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Gottwald, T, Schier, M & Rosenhahn, B 2025, Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents. in A Del Bue, C Canton, J Pont-Tuset & T Tommasi (eds), Computer Vision : ECCV 2024 Workshops, Proceedings. Lecture Notes in Computer Science, vol. 15634 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 100-116, 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 Sept 2024. https://doi.org/10.1007/978-3-031-92591-7_7
Gottwald, T., Schier, M., & Rosenhahn, B. (2025). Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents. In A. Del Bue, C. Canton, J. Pont-Tuset, & T. Tommasi (Eds.), Computer Vision : ECCV 2024 Workshops, Proceedings (pp. 100-116). (Lecture Notes in Computer Science; Vol. 15634 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-92591-7_7
Gottwald T, Schier M, Rosenhahn B. Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents. In Del Bue A, Canton C, Pont-Tuset J, Tommasi T, editors, Computer Vision : ECCV 2024 Workshops, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 100-116. (Lecture Notes in Computer Science). doi: 10.1007/978-3-031-92591-7_7
Gottwald, Tristan ; Schier, Maximilian ; Rosenhahn, Bodo. / Safe Resetless Reinforcement Learning : Enhancing Training Autonomy with Risk-Averse Agents. Computer Vision : ECCV 2024 Workshops, Proceedings. editor / Alessio Del Bue ; Cristian Canton ; Jordi Pont-Tuset ; Tatiana Tommasi. Springer Science and Business Media Deutschland GmbH, 2025. pp. 100-116 (Lecture Notes in Computer Science).
Download
@inproceedings{1153c5238cba476c9cfe434b06042273,
title = "Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents",
abstract = "Training Reinforcement Learning agents directly in any real-world environment remains difficult, as such scenarios entail the risk of damaging the training setup or violating other safety constraints. The training process itself further requires extensive human supervision and intervention to reset the environment after each episode. Thus, we propose an innovative Safe Reinforcement Learning framework that combines Safe and Resetless RL to autonomously reset environments, while also reducing the number of safety constraint violations. In this context, we develop a novel risk-averse RL agent suitable for stringent safety constraints by combining Safe RL, Distributional RL, and Randomized Ensembled Double Q-Learning. Experiments conducted in a novel mobile robotics scenario indicate that our Safe Resetless RL framework reduces the number of human interactions required during training compared to state-of-the-art methods, filling a gap in current problem formulations and enhancing the autonomy of RL training processes in real-world settings.",
keywords = "Autonomous Navigation, Distributional Reinforcement Learning, Mobile Service Robotics, Resetless Reinforcement Learning, Safe Reinforcement Learning",
author = "Tristan Gottwald and Maximilian Schier and Bodo Rosenhahn",
note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.; 18th European Conference on Computer Vision, ECCV 2024, ECCV 2024 ; Conference date: 29-09-2024 Through 04-10-2024",
year = "2025",
month = may,
day = "12",
doi = "10.1007/978-3-031-92591-7_7",
language = "English",
isbn = "9783031925900",
series = "Lecture Notes in Computer Science",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "100--116",
editor = "{Del Bue}, Alessio and Cristian Canton and Jordi Pont-Tuset and Tatiana Tommasi",
booktitle = "Computer Vision",
address = "Germany",

}

Download

TY - GEN

T1 - Safe Resetless Reinforcement Learning

T2 - 18th European Conference on Computer Vision, ECCV 2024

AU - Gottwald, Tristan

AU - Schier, Maximilian

AU - Rosenhahn, Bodo

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

PY - 2025/5/12

Y1 - 2025/5/12

N2 - Training Reinforcement Learning agents directly in any real-world environment remains difficult, as such scenarios entail the risk of damaging the training setup or violating other safety constraints. The training process itself further requires extensive human supervision and intervention to reset the environment after each episode. Thus, we propose an innovative Safe Reinforcement Learning framework that combines Safe and Resetless RL to autonomously reset environments, while also reducing the number of safety constraint violations. In this context, we develop a novel risk-averse RL agent suitable for stringent safety constraints by combining Safe RL, Distributional RL, and Randomized Ensembled Double Q-Learning. Experiments conducted in a novel mobile robotics scenario indicate that our Safe Resetless RL framework reduces the number of human interactions required during training compared to state-of-the-art methods, filling a gap in current problem formulations and enhancing the autonomy of RL training processes in real-world settings.

AB - Training Reinforcement Learning agents directly in any real-world environment remains difficult, as such scenarios entail the risk of damaging the training setup or violating other safety constraints. The training process itself further requires extensive human supervision and intervention to reset the environment after each episode. Thus, we propose an innovative Safe Reinforcement Learning framework that combines Safe and Resetless RL to autonomously reset environments, while also reducing the number of safety constraint violations. In this context, we develop a novel risk-averse RL agent suitable for stringent safety constraints by combining Safe RL, Distributional RL, and Randomized Ensembled Double Q-Learning. Experiments conducted in a novel mobile robotics scenario indicate that our Safe Resetless RL framework reduces the number of human interactions required during training compared to state-of-the-art methods, filling a gap in current problem formulations and enhancing the autonomy of RL training processes in real-world settings.

KW - Autonomous Navigation

KW - Distributional Reinforcement Learning

KW - Mobile Service Robotics

KW - Resetless Reinforcement Learning

KW - Safe Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=105006877519&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-92591-7_7

DO - 10.1007/978-3-031-92591-7_7

M3 - Conference contribution

AN - SCOPUS:105006877519

SN - 9783031925900

T3 - Lecture Notes in Computer Science

SP - 100

EP - 116

BT - Computer Vision

A2 - Del Bue, Alessio

A2 - Canton, Cristian

A2 - Pont-Tuset, Jordi

A2 - Tommasi, Tatiana

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 29 September 2024 through 4 October 2024

ER -

By the same author(s)