Skip to Main Content
In this paper, we present a text segmentation method using wavelet packet analysis and k-means clustering algorithm. This approach assumes that the text and non-text regions are considered as two different texture regions. The text segmentation is achieved by using wavelet packet analysis as a feature analysis. The wavelet packet analysis is a method of wavelet decomposition that offers a richer range of possibilities for document image. From these multiscale features, we compute the local energy and intensify the features before adapting the k-means clustering algorithm based on the unsupervised learning rule. The results show that our text segmentation method is effective for document images scanned from newspapers and journals.