Abstract
We propose a two-phase methodology for quantifying the performability (performance and availability) of cluster-based Internet services. In the first phase, evaluators use a fault-injection infrastructure to measure the impact of faults on the server’s performance. In the second phase, evaluators use an analytical model to combine an expected fault load with measurements from the first phase to assess the server’s performability. Using this model, evaluators can study the server’s sensitivity to different design decisions, fault rates, and environmental factors. To demonstrate our methodology, we study the performability of 4 versions of the PRESS Web server against 5 classes of faults, quantifying the effects of different design decisions on performance and availability. Finally, to further show the utility of our model, we also quantify the impact of two hypothetical changes, reduced human operator response time and the use of RAIDs.
Original language | English (US) |
---|---|
State | Published - 2003 |
Externally published | Yes |
Event | 4th USENIX Symposium on Internet Technologies and Systems, USITS 2003 - Seattle, United States Duration: Mar 26 2003 → Mar 28 2003 |
Conference
Conference | 4th USENIX Symposium on Internet Technologies and Systems, USITS 2003 |
---|---|
Country/Territory | United States |
City | Seattle |
Period | 3/26/03 → 3/28/03 |
All Science Journal Classification (ASJC) codes
- Computer Networks and Communications
- Computer Science Applications