Text clustering is one of the difficult and hot research fields in the Internet search engine research. Using and improving K-means clustering techniques, a new text clustering algorithm is presented. Firstly, texts are preprocessed to satisfy succeed process. Secondly, the paper improves the gravity centers calculation method and algorithm flow of K-means cluster algorithm to improve efficiency and stability of original K_means algorithm. The experimental results indicate that the improved algorithm has a higher accuracy compared with the original algorithm, and has a better stability.
Published in:
Future Computer and Communication, 2009. FCC '09. International Conference on
Date of Conference: 6-7 June 2009