Reproducing and learning from failures in deployed software is costly and difficult. Those activities can be facilitated, however, if the circumstances leading to a failure are properly captured. In this paper, we empirically investigate how various anomaly detection schemes can serve to identify the conditions that precede failures in deployed software. Our results expose the tradeoffs between different detection algorithms applied to several types of events under varying levels of in-house testing.
Published in:
Software Reliability Engineering, 2003. ISSRE 2003. 14th International Symposium on
Date of Conference: 17-20 Nov. 2003