By Topic

Research and Application of Improved K-Means Algorithm Based on Fuzzy Feature Selection

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)

K-means is a widely-used clustering algorithm in data mining. In traditional algorithm, each feature is treated equally and each one gives the same contribution to K-means. In fact, redundant and irrelevant features may disturb the clustering result. This paper proposes a improved K-means algorithm based on a fuzzy feature selection strategy. The method is based on measuring 'feature important factor' (FIF). Firstly, make use of the first time clustering result to get class labels; secondly, set up decision tree to calculate the FIF; thirdly, do the cluster algorithm again with the FIF to modify the similarity measure and then get the modified clustering result. The experiment with UCI datasets proves that, the strategy of fuzzy feature selection can improve the clustering result effectively. At last, the application is done in human resource dataset of a domestic university for further proof of the effectiveness and practicability of the algorithm.

Published in:

Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on  (Volume:1 )

Date of Conference:

18-20 Oct. 2008