By Topic

A Robust Clustering Method for XML Documents

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Bin Zhao ; Coll. of Inf. Sci. & Eng., Shandong Normal Univ., Jinan, China ; Yong-Sheng Zhang ; Hua-Xiang Zhang

With the increase of XML data over the Internet, managing and analyzing huge amount of XML documents has played an important role for information management. This paper addresses the problem of clustering XML documents. Borrowing the idea of semi-clustering, it proposes a robust clustering method through a combination of single partitional and hierarchical clustering algorithms, which can eliminate the defects of single clustering algorithms. Experiments on real XML documents collection show that our method can group large collection of XML documents into appropriate clusters efficiently without fixed number of clusters. Moreover, our method is less sensitive to noises than single clustering algorithm.

Published in:

Information Management, Innovation Management and Industrial Engineering, 2008. ICIII '08. International Conference on  (Volume:1 )

Date of Conference:

19-21 Dec. 2008