Discovering Neglected Conditions in Software by Mining Dependence Graphs
Ray-Yaung Chang
Podgurski, A.
Jiong Yang
Case Western Reserve Univ., Cleveland, OH;
This paper appears in: Software Engineering, IEEE Transactions on
Publication Date: Sept.-Oct. 2008
Volume: 34,
Issue: 5
On page(s): 579-596
Location: San Francisco, CA, USA,
ISSN: 0098-5589
Digital Object Identifier: 10.1109/TSE.2008.24
First Published: 2008-04-18
Current Version Published: 2008-09-30
Abstract
Neglected conditions are an important but difficult-to-find class of software defects. This paper presents a novel approach to revealing neglected conditions that integrates static program analysis and advanced data mining techniques to discover implicit conditional rules in a code base and to discover rule violations that indicate neglected conditions. The approach requires the user to indicate minimal constraints on the context of the rules to be sought, rather than specific rule templates. To permit this generality, rules are modeled as graph minors of enhanced procedure dependence graphs (EPDGs), in which control and data dependence edges are augmented by edges representing shared data dependences. A heuristic maximal frequent subgraph mining algorithm is used to extract candidate rules from EPDGs, and a heuristic graph matching algorithm is used to identify rule violations. We also report the results of an empirical study in which the approach was applied to four open source projects (openssl, make, procmail, amaya). These results indicate that the approach is effective and reasonably efficient.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.