Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services

Kiran Nagaraja, Xiaoyan Li, Ricardo Bianchini, Richard P. Martin, Thu D. Nguyen

Research output: Contribution to conferencePaperpeer-review

26 Scopus citations

Abstract

We propose a two-phase methodology for quantifying the performability (performance and availability) of cluster-based Internet services. In the first phase, evaluators use a fault-injection infrastructure to measure the impact of faults on the server’s performance. In the second phase, evaluators use an analytical model to combine an expected fault load with measurements from the first phase to assess the server’s performability. Using this model, evaluators can study the server’s sensitivity to different design decisions, fault rates, and environmental factors. To demonstrate our methodology, we study the performability of 4 versions of the PRESS Web server against 5 classes of faults, quantifying the effects of different design decisions on performance and availability. Finally, to further show the utility of our model, we also quantify the impact of two hypothetical changes, reduced human operator response time and the use of RAIDs.

Original languageEnglish (US)
StatePublished - 2003
Externally publishedYes
Event4th USENIX Symposium on Internet Technologies and Systems, USITS 2003 - Seattle, United States
Duration: Mar 26 2003Mar 28 2003

Conference

Conference4th USENIX Symposium on Internet Technologies and Systems, USITS 2003
CountryUnited States
CitySeattle
Period3/26/033/28/03

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services'. Together they form a unique fingerprint.

Cite this