By Topic

Modeling and analysis of computer system availability

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $33
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Ambuj Goyal ; IBM Research Division, P.O. Box 704, Yorktown Heights, New York 10598, USA ; Stephen S. Lavenberg

The quantitative evaluation of computer-system availability is becoming increasingly important in the design and configuration of commercial computer systems. This paper deals with methods for constructing and solving large Markov-chain models of computer-system availability. A set of powerful high-level modeling constructs is discussed that can be used to represent the failure and repair behavior of the components that comprise a system, including important component interactions, and the repair actions that are taken when components fail. If time-independent failure and repair rates are assumed, then a time-homogeneous continuous-time Markov chain can be constructed automatically from the modeling constructs used to describe the system. Markov chains having tens of thousands of states can be readily constructed in this manner. Therefore, techniques that are particularly suitable for numerically solving such large Markov chains are also discussed, including techniques for computing the sensitivities of availability measures with respect to model parameters. A computer system modeling example is presented to illustrate the use of these modeling and analysis techniques. The modeling constructs, automatic Markov-chain construction, and model-solution methods have been implemented in a program package called the System Availability Estimator (SAVE).

Note: The Institute of Electrical and Electronics Engineers, Incorporated is distributing this Article with permission of the International Business Machines Corporation (IBM) who is the exclusive owner. The recipient of this Article may not assign, sublicense, lease, rent or otherwise transfer, reproduce, prepare derivative works, publicly display or perform, or distribute the Article.  

Published in:

IBM Journal of Research and Development  (Volume:31 ,  Issue: 6 )