A methodology for detection and estimation of software aging
Garg, S.
van Moorsel, A.
Vaidyanathan, K.
Trivedi, K.S.
AT&T Bell Labs., Murray Hill, NJ;
This paper appears in: Software Reliability Engineering, 1998. Proceedings. The Ninth International Symposium on
Publication Date: 4-7 Nov 1998
On page(s): 283-292
Meeting Date: 11/04/1998 - 11/07/1998
Location: Paderborn, Germany
ISBN: 0-8186-8991-9
References Cited: 21
INSPEC Accession Number: 6104431
Digital Object Identifier: 10.1109/ISSRE.1998.730892
Current Version Published: 2002-08-06
Abstract
The phenomenon of software aging refers to the accumulation of
errors during the execution of the software which eventually results in
it's crash/hang failure. A gradual performance degradation may also
accompany software aging. Pro-active fault management techniques such as
“software rejuvenation” (Y. Huang et al., 1995) may be used
to counteract aging if it exists. We propose a methodology for detection
and estimation of aging in the UNIX operating system. First, we present
the design and implementation of an SNMP based, distributed monitoring
tool used to collect operating system resource usage and system activity
data at regular intervals, from networked UNIX workstations. Statistical
trend detection techniques are applied to this data to detect/validate
the existence of aging. For quantifying the effect of aging in operating
system resources, we propose a metric: “estimated time to
exhaustion”, which is calculated using well known slope estimation
techniques. Although the distributed data collection tool is specific to
UNIX, the statistical techniques can be used for detection and
estimation of aging in other software as well
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.