Experimental comparisons of binarization and multi-thresholdingmethods on document images
Oapos;Gorman, L.
Pattern Recognition, 1994. Vol. 2 - Conference B: Computer Vision & Image Processing., Proceedings of the 12th IAPR International. Conference on
Volume 2, Issue , 9-13 Oct 1994 Page(s):395 - 398 vol.2
Digital Object Identifier 10.1109/ICPR.1994.576954
Summary:Thresholding methods are applied here to document images and their
experimental results compared. In one set of tests, different
thresholding methods are used to binarize document images, then optical
character recognition (OCR) is performed on the resulting text and the
recognition results are compared. In the other set of tests,
multi-thresholding is performed on document images-to obtain three or
more levels for images with more than binary levels-and the results are
compared. Four thresholding methods are compared in the experiments: a
discriminant analysis method, a maximum entropy method, a
moment-preserving method, and a connectivity-preserving method. A method
using a minimum-error criterion is also commented upon. The
moment-preserving and connectivity-preserving methods are found to yield
the best OCR results from the binarized images, and the
connectivity-preserving method yields the fewest binarization and
multi-thresholding failures
View citation and abstract |