Multi-perspective Evaluation of Self-Healing Systems Using Simple Probabilistic Models

Rean Griffith; Gail E. Kaiser; Javier Alonso López

Multi-perspective Evaluation of Self-Healing Systems Using Simple Probabilistic Models
Griffith, Rean
Kaiser, Gail E.
López, Javier Alonso
Technical reports
Computer Science
Persistent URL:
Columbia University Computer Science Technical Reports
Part Number:
Department of Computer Science, Columbia University
Publisher Location:
New York
Quantifying the efficacy of self-healing systems is a challenging but important task, which has implications for increasing designer, operator and end-user confidence in these systems. During design system architects benefit from tools and techniques that enhance their understanding of the system, allowing them to reason about the tradeoffs of proposed or existing self-healing mechanisms and the overall effectiveness of the system as a result of different mechanism-compositions. At deployment time, system integrators and operators need to understand how the selfhealing mechanisms work and how their operation impacts the system's reliability, availability and serviceability (RAS) in order to cope with any limitations of these mechanisms when the system is placed into production. In this paper we construct an evaluation framework for selfhealing systems around simple, yet powerful, probabilistic models that capture the behavior of the system's selfhealing mechanisms from multiple perspectives (designer, operator, and end-user). We combine these analytical models with runtime fault-injection to study the operation of VM-Rejuv — a virtual machine based rejuvenation scheme for web-application servers. We use the results from the fault-injection experiments and model-analysis to reason about the efficacy of VM-Rejuv, its limitations and strategies for managing/mitigating these limitations in systemdeployments. Whereas we use VM-Rejuv as the subject of our evaluation in this paper, our main contribution is a practical evaluation approach that can be generalized to other self-healing systems.
Computer science
Item views
text | xml
Suggested Citation:
Rean Griffith, Gail E. Kaiser, Javier Alonso López, , Multi-perspective Evaluation of Self-Healing Systems Using Simple Probabilistic Models, Columbia University Academic Commons, .

Columbia University Libraries | Policies | FAQ