OSIRIS: Efficient and consistent recovery of compartmentalized operating systems

Koustubha Bhat, Dirk Vogt, Erik Van Der Kouwe, Ben Gras, Lionel Sambuc, Andrew S. Tanenbaum, Herbert Bos, Cristiano Giuffrida

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Much research has gone into making operating systems more amenable to recovery and more resilient to crashes. Traditional solutions rely on partitioning the operating system (OS) to contain the effects of crashes within compartments and facilitate modular recovery. However, state dependencies among the compartments hinder recovery that is globally consistent. Such recovery typically requires expensive runtime dependency tracking which results in high performance overhead, highcomplexity and a large Reliable Computing Base (RCB). We propose a lightweight strategy that limits recovery to cases where we can statically and conservatively prove that compartment recovery leads to a globally consistent state - trading recoverable surface for a simpler and smaller RCB with lower performance overhead and maintenance cost. We present OSIRIS, a research OS design prototype that demonstrates efficient and consistent crash recovery. Our evaluation shows that OSIRIS effectively recovers from important classes of real-world software bugs with a modest RCB and low overheads.

LanguageEnglish
Title of host publicationProceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages25-36
Number of pages12
ISBN (Electronic)9781467388917
DOIs
Publication statusPublished - 29 Sep 2016
Event46th IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016 - Toulouse, France
Duration: 28 Jun 20161 Jul 2016

Conference

Conference46th IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016
CountryFrance
CityToulouse
Period28/06/161/07/16

Fingerprint

Recovery
Systems analysis
Costs

Keywords

  • Crash recovery
  • Fault tolerance
  • Operating systems
  • Reliability
  • Static analysis

Cite this

Bhat, K., Vogt, D., Kouwe, E. V. D., Gras, B., Sambuc, L., Tanenbaum, A. S., ... Giuffrida, C. (2016). OSIRIS: Efficient and consistent recovery of compartmentalized operating systems. In Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016 (pp. 25-36). [7579727] Institute of Electrical and Electronics Engineers, Inc.. https://doi.org/10.1109/DSN.2016.12
Bhat, Koustubha ; Vogt, Dirk ; Kouwe, Erik Van Der ; Gras, Ben ; Sambuc, Lionel ; Tanenbaum, Andrew S. ; Bos, Herbert ; Giuffrida, Cristiano. / OSIRIS : Efficient and consistent recovery of compartmentalized operating systems. Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016. Institute of Electrical and Electronics Engineers, Inc., 2016. pp. 25-36
@inproceedings{6e5f4092a0e44214bcfb1c48ba9a72ae,
title = "OSIRIS: Efficient and consistent recovery of compartmentalized operating systems",
abstract = "Much research has gone into making operating systems more amenable to recovery and more resilient to crashes. Traditional solutions rely on partitioning the operating system (OS) to contain the effects of crashes within compartments and facilitate modular recovery. However, state dependencies among the compartments hinder recovery that is globally consistent. Such recovery typically requires expensive runtime dependency tracking which results in high performance overhead, highcomplexity and a large Reliable Computing Base (RCB). We propose a lightweight strategy that limits recovery to cases where we can statically and conservatively prove that compartment recovery leads to a globally consistent state - trading recoverable surface for a simpler and smaller RCB with lower performance overhead and maintenance cost. We present OSIRIS, a research OS design prototype that demonstrates efficient and consistent crash recovery. Our evaluation shows that OSIRIS effectively recovers from important classes of real-world software bugs with a modest RCB and low overheads.",
keywords = "Crash recovery, Fault tolerance, Operating systems, Reliability, Static analysis",
author = "Koustubha Bhat and Dirk Vogt and Kouwe, {Erik Van Der} and Ben Gras and Lionel Sambuc and Tanenbaum, {Andrew S.} and Herbert Bos and Cristiano Giuffrida",
year = "2016",
month = "9",
day = "29",
doi = "10.1109/DSN.2016.12",
language = "English",
pages = "25--36",
booktitle = "Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016",
publisher = "Institute of Electrical and Electronics Engineers, Inc.",

}

Bhat, K, Vogt, D, Kouwe, EVD, Gras, B, Sambuc, L, Tanenbaum, AS, Bos, H & Giuffrida, C 2016, OSIRIS: Efficient and consistent recovery of compartmentalized operating systems. in Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016., 7579727, Institute of Electrical and Electronics Engineers, Inc., pp. 25-36, 46th IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016, Toulouse, France, 28/06/16. https://doi.org/10.1109/DSN.2016.12

OSIRIS : Efficient and consistent recovery of compartmentalized operating systems. / Bhat, Koustubha; Vogt, Dirk; Kouwe, Erik Van Der; Gras, Ben; Sambuc, Lionel; Tanenbaum, Andrew S.; Bos, Herbert; Giuffrida, Cristiano.

Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016. Institute of Electrical and Electronics Engineers, Inc., 2016. p. 25-36 7579727.

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - OSIRIS

T2 - Efficient and consistent recovery of compartmentalized operating systems

AU - Bhat, Koustubha

AU - Vogt, Dirk

AU - Kouwe, Erik Van Der

AU - Gras, Ben

AU - Sambuc, Lionel

AU - Tanenbaum, Andrew S.

AU - Bos, Herbert

AU - Giuffrida, Cristiano

PY - 2016/9/29

Y1 - 2016/9/29

N2 - Much research has gone into making operating systems more amenable to recovery and more resilient to crashes. Traditional solutions rely on partitioning the operating system (OS) to contain the effects of crashes within compartments and facilitate modular recovery. However, state dependencies among the compartments hinder recovery that is globally consistent. Such recovery typically requires expensive runtime dependency tracking which results in high performance overhead, highcomplexity and a large Reliable Computing Base (RCB). We propose a lightweight strategy that limits recovery to cases where we can statically and conservatively prove that compartment recovery leads to a globally consistent state - trading recoverable surface for a simpler and smaller RCB with lower performance overhead and maintenance cost. We present OSIRIS, a research OS design prototype that demonstrates efficient and consistent crash recovery. Our evaluation shows that OSIRIS effectively recovers from important classes of real-world software bugs with a modest RCB and low overheads.

AB - Much research has gone into making operating systems more amenable to recovery and more resilient to crashes. Traditional solutions rely on partitioning the operating system (OS) to contain the effects of crashes within compartments and facilitate modular recovery. However, state dependencies among the compartments hinder recovery that is globally consistent. Such recovery typically requires expensive runtime dependency tracking which results in high performance overhead, highcomplexity and a large Reliable Computing Base (RCB). We propose a lightweight strategy that limits recovery to cases where we can statically and conservatively prove that compartment recovery leads to a globally consistent state - trading recoverable surface for a simpler and smaller RCB with lower performance overhead and maintenance cost. We present OSIRIS, a research OS design prototype that demonstrates efficient and consistent crash recovery. Our evaluation shows that OSIRIS effectively recovers from important classes of real-world software bugs with a modest RCB and low overheads.

KW - Crash recovery

KW - Fault tolerance

KW - Operating systems

KW - Reliability

KW - Static analysis

UR - http://www.scopus.com/inward/record.url?scp=84994246883&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84994246883&partnerID=8YFLogxK

U2 - 10.1109/DSN.2016.12

DO - 10.1109/DSN.2016.12

M3 - Conference contribution

SP - 25

EP - 36

BT - Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016

PB - Institute of Electrical and Electronics Engineers, Inc.

ER -

Bhat K, Vogt D, Kouwe EVD, Gras B, Sambuc L, Tanenbaum AS et al. OSIRIS: Efficient and consistent recovery of compartmentalized operating systems. In Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2016. Institute of Electrical and Electronics Engineers, Inc. 2016. p. 25-36. 7579727 https://doi.org/10.1109/DSN.2016.12