Home // International Journal On Advances in Systems and Measurements, volume 6, numbers 3 and 4, 2013 // View article
Sick But Not Dead Failures - Adaptive Testing, Evaluation and Design Methodologies
Authors:
Tara Astigarraga
Michael Browne
Lou Dickens
Ian MacQuarrie
Keywords: Keywords- Software Testing, Sick but not Dead, Software Engineering, Partial Failure, Transient Error, Soft Failure, SAN Test, Storage Area Network Test, System Test.
Abstract:
Abstract- Enterprise data center implementations make significant investments in high availability configurations, redundant hardware, software and Input / Output (I/O) paths that are in many failure scenarios quite successful. However, in spite of all that investment clients are still facing unexpected outages and performance impacts related to a phenomenon referred to as Sick but not Dead (SBND) errors. SBND errors are sometimes lumped together in a category with other related errors including transient errors, partial failure scenarios and soft errors. While SBND errors do have many common characteristics with the errors described above, there are key differences and environment impacts which we will explore further in this paper. We will also present new proactive techniques, inject scenarios and methods to identify, characterize and address SBND failures including cross-component impacts and failures.
Pages: 324 to 334
Copyright: Copyright (c) to authors, 2013. Used with permission.
Publication date: December 31, 2013
Published in: journal
ISSN: 1942-261x