Loading [MathJax]/extensions/tex2jax.js

Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Martin Hoffmann
  • Christian Dietrich
  • Daniel Lohmann

External Research Organisations

  • Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU Erlangen-Nürnberg)

Details

Original languageEnglish
Title of host publicationInformatik 2013
Subtitle of host publicationInformatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany
Place of PublicationBonn
Number of pages15
Publication statusPublished - 2013
Externally publishedYes
EventTagung Informatik 2013 - Koblenz
Duration: 16 Sept 201320 Sept 2013

Abstract

Soft errors are emerging with the ongoing reduction of structure sizes in current and future hardware designs. This problematic is generally tackled by employing fault detection or tolerance measures from an applications' point of view. At the same time, research commences to harden the operating system, often considered as remaining single point of failure. Certainly, these measures can effectively treat the symptoms of hardware faults. However, we argue that the operating system design per se can offer an intrinsic resilience against errors. Dynamic operating system designs, often resembling Unix-like interfaces, are obliged to cope with pointers and list-based data structures to provide the demanded flexibility. In contrast, especially in the domain of embedded systems this flexibility is often not needed. Here, static system designs can be deployed, which allow to avoid error-prone pointer-based memory operations. We believe, that a fully static system design can enhance the resilience against memory errors solely by reduced memory consumption and inherently more robust data structures. This paper studies the influences of memory faults on both, a dynamic and a fully static embedded operating system. Extensive injection campaigns, covering the entire fault space within the kernel data structures, will show that even when applying hardware-based fault detection mechanisms to a dynamic kernel, a static kernel design is still more than 75 percent less susceptible to silent data corruptions.

ASJC Scopus subject areas

Cite this

Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience. / Hoffmann, Martin; Dietrich, Christian; Lohmann, Daniel.
Informatik 2013: Informatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany . Bonn, 2013.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Hoffmann, M, Dietrich, C & Lohmann, D 2013, Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience. in Informatik 2013: Informatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany . Bonn, Tagung Informatik 2013, Koblenz, 16 Sept 2013. <https://dl.gi.de/items/53256b70-60c5-4d29-b222-b578637a201f>
Hoffmann, M., Dietrich, C., & Lohmann, D. (2013). Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience. In Informatik 2013: Informatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany https://dl.gi.de/items/53256b70-60c5-4d29-b222-b578637a201f
Hoffmann M, Dietrich C, Lohmann D. Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience. In Informatik 2013: Informatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany . Bonn. 2013
Hoffmann, Martin ; Dietrich, Christian ; Lohmann, Daniel. / Failure by Design: Influence of the RTOS Interface on Memory Fault Resilience. Informatik 2013: Informatik angepasst an Mensch, Organisation und Umwelt : Tagung vom 16. - 20. September 2013 in Koblenz, Germany . Bonn, 2013.
Download
@inproceedings{d1707d32ed3a461d9a2f11f94e4a57f7,
title = "Failure by Design:: Influence of the RTOS Interface on Memory Fault Resilience",
abstract = "Soft errors are emerging with the ongoing reduction of structure sizes in current and future hardware designs. This problematic is generally tackled by employing fault detection or tolerance measures from an applications' point of view. At the same time, research commences to harden the operating system, often considered as remaining single point of failure. Certainly, these measures can effectively treat the symptoms of hardware faults. However, we argue that the operating system design per se can offer an intrinsic resilience against errors. Dynamic operating system designs, often resembling Unix-like interfaces, are obliged to cope with pointers and list-based data structures to provide the demanded flexibility. In contrast, especially in the domain of embedded systems this flexibility is often not needed. Here, static system designs can be deployed, which allow to avoid error-prone pointer-based memory operations. We believe, that a fully static system design can enhance the resilience against memory errors solely by reduced memory consumption and inherently more robust data structures. This paper studies the influences of memory faults on both, a dynamic and a fully static embedded operating system. Extensive injection campaigns, covering the entire fault space within the kernel data structures, will show that even when applying hardware-based fault detection mechanisms to a dynamic kernel, a static kernel design is still more than 75 percent less susceptible to silent data corruptions.",
author = "Martin Hoffmann and Christian Dietrich and Daniel Lohmann",
year = "2013",
language = "English",
isbn = "978-3-88579-614-5",
booktitle = "Informatik 2013",
note = "Tagung Informatik 2013 ; Conference date: 16-09-2013 Through 20-09-2013",

}

Download

TY - GEN

T1 - Failure by Design:

T2 - Tagung Informatik 2013

AU - Hoffmann, Martin

AU - Dietrich, Christian

AU - Lohmann, Daniel

PY - 2013

Y1 - 2013

N2 - Soft errors are emerging with the ongoing reduction of structure sizes in current and future hardware designs. This problematic is generally tackled by employing fault detection or tolerance measures from an applications' point of view. At the same time, research commences to harden the operating system, often considered as remaining single point of failure. Certainly, these measures can effectively treat the symptoms of hardware faults. However, we argue that the operating system design per se can offer an intrinsic resilience against errors. Dynamic operating system designs, often resembling Unix-like interfaces, are obliged to cope with pointers and list-based data structures to provide the demanded flexibility. In contrast, especially in the domain of embedded systems this flexibility is often not needed. Here, static system designs can be deployed, which allow to avoid error-prone pointer-based memory operations. We believe, that a fully static system design can enhance the resilience against memory errors solely by reduced memory consumption and inherently more robust data structures. This paper studies the influences of memory faults on both, a dynamic and a fully static embedded operating system. Extensive injection campaigns, covering the entire fault space within the kernel data structures, will show that even when applying hardware-based fault detection mechanisms to a dynamic kernel, a static kernel design is still more than 75 percent less susceptible to silent data corruptions.

AB - Soft errors are emerging with the ongoing reduction of structure sizes in current and future hardware designs. This problematic is generally tackled by employing fault detection or tolerance measures from an applications' point of view. At the same time, research commences to harden the operating system, often considered as remaining single point of failure. Certainly, these measures can effectively treat the symptoms of hardware faults. However, we argue that the operating system design per se can offer an intrinsic resilience against errors. Dynamic operating system designs, often resembling Unix-like interfaces, are obliged to cope with pointers and list-based data structures to provide the demanded flexibility. In contrast, especially in the domain of embedded systems this flexibility is often not needed. Here, static system designs can be deployed, which allow to avoid error-prone pointer-based memory operations. We believe, that a fully static system design can enhance the resilience against memory errors solely by reduced memory consumption and inherently more robust data structures. This paper studies the influences of memory faults on both, a dynamic and a fully static embedded operating system. Extensive injection campaigns, covering the entire fault space within the kernel data structures, will show that even when applying hardware-based fault detection mechanisms to a dynamic kernel, a static kernel design is still more than 75 percent less susceptible to silent data corruptions.

UR - http://www.scopus.com/inward/record.url?scp=85043976342&partnerID=8YFLogxK

M3 - Conference contribution

SN - 978-3-88579-614-5

BT - Informatik 2013

CY - Bonn

Y2 - 16 September 2013 through 20 September 2013

ER -