Home

ASSURE: Automatic Software Self-healing Using Rescue Points

Stelios Sidiroglou; Oren Laadan; Carlos R. Perez; Nicolas Viennot; Jason Nieh; Angelos D. Keromytis

Title:
ASSURE: Automatic Software Self-healing Using Rescue Points
Author(s):
Sidiroglou, Stelios
Laadan, Oren
Perez, Carlos R.
Viennot, Nicolas
Nieh, Jason
Keromytis, Angelos D.
Date:
Type:
Articles
Department:
Computer Science
Permanent URL:
Book/Journal Title:
ASPLOS XIV: Fourteenth International Conference on Architectural Support for Programming Languages and Operating Systems: March 7-11, 2009, Washington, D.C., USA: Proceedings
Book Author:
Soffa, Mary Lou
Publisher:
ACM
Publisher Location:
New York
Abstract:
Software failures in server applications are a significant problem for preserving system availability. We present ASSURE, a system that introduces rescue points that recover software from unknown faults while maintaining both system integrity and availability, by mimicking system behavior under known error conditions. Rescue points are locations in existing application code for handling a given set of programmer-anticipated failures, which are automatically repurposed and tested for safely enabling fault recovery from a larger class of (unanticipated) faults. When a fault occurs at an arbitrary location in the program, ASSURE restores execution to an appropriate rescue point and induces the program to recover execution by virtualizing the program's existing error-handling facilities. Rescue points are identified using fuzzing, implemented using a fast coordinated checkpoint-restart mechanism that handles multi-process and multi-threaded applications, and, after testing, are injected into production code using binary patching. We have implemented an ASSURE Linux prototype that operates without application source code and without base operating system kernel changes. Our experimental results on a set of real-world server applications and bugs show that ASSURE enabled recovery for all of the bugs tested with fast recovery times, has modest performance overhead, and provides automatic self-healing orders of magnitude faster than current human-driven patch deployment methods.
Subject(s):
Computer science
Publisher DOI:
http://dx.doi.org/10.1145/1508244.1508250
Item views:
130
Metadata:
text | xml

In Partnership with the Center for Digital Research and Scholarship at Columbia University Libraries/Information Services | Terms of Use