Technical Reports


Click here for full text: PDF

A Formal Foundation for Failure Avoidance and Diagnosis

Kelly, Terence; Wang, Yin; Lafortune, Stephane; Welsh, Matt
HP Laboratories


Keyword(s): fault tolerance, failure avoidance, concurrent programming, sensor networks, discrete control theory

Abstract: This paper argues that Discrete Control Theory (DCT) provides a useful formal foundation for failure avoidance and diagnosis in a wide variety of computing systems. Our experience applying DCT to several difficult systems problems during the past three years convinces us that this powerful, general, mature, and rigorous body of theory belongs in the standard dependability toolbox. It is particularly valuable in new contexts thrust upon us by recent technology trends, including sensor networks and the multicore revolution.

5 Pages

External Posting Date: August 21, 2009 [Fulltext]. Approved for External Publication
Internal Posting Date: August 21, 2009 [Fulltext]

Back to Index