Skip to Main Content
In this paper, we present a text segmentation method using wavelet packet analysis and k-means clustering algorithm. This approach assumes that the text and non-text regions are considered as two different texture regions. The text segmentation is achieved by using wavelet packet analysis as a feature analysis. The wavelet packet analysis is a method of wavelet decomposition that offers a richer range of possibilities for document image. From these multiscale features, we compute the local energy and intensify the features before adapting the k-means clustering algorithm based on the unsupervised learning rule. The results show that our text segmentation method is effective for document images scanned from newspapers and journals.
Advanced Communication Technology, The 9th International Conference on (Volume:1 )
Date of Conference: 12-14 Feb. 2007