Concept discovery in structural data requires the identification of repetitive substructures in the data. A method for discovering substructures in data using an inexact graph match is described. An implementation of the authors' SUBDUE system that employs an inexact graph match to discover substructures which occur often in the data, but not always in the same form, is described. This inexact substructure discovery can be used to formulate fuzzy concepts, compress the data description, and discover interesting structures in data that are found either in an identical or in a slightly convoluted form. Examples from the domains of scene analysis and chemical compound analysis demonstrate the benefits of the inexact discovery technique
Published in:
Knowledge and Data Engineering, IEEE Transactions on
(Volume:5
,
Issue:
6
)
Date of Publication: Dec 1993