A robust algorithm for text string separation from mixedtext/graphics images
Fletcher, L.A.
Kasturi, R.
Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA;
This paper appears in: Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publication Date: Nov 1988
Volume: 10,
Issue: 6
On page(s): 910-918
ISSN: 0162-8828
References Cited: 13
CODEN: ITPIDJ
INSPEC Accession Number: 3322969
Digital Object Identifier: 10.1109/34.9112
Current Version Published: 2002-08-06
Abstract
The development and implementation of an algorithm for automated
text string separation that is relatively independent of changes in text
font style and size and of string orientation are described. It is
intended for use in an automated system for document analysis. The
principal parts of the algorithm are the generation of connected
components and the application of the Hough transform in order to group
components into logical character strings that can then be separated
from the graphics. The algorithm outputs two images, one containing text
strings and the other graphics. These images can then be processed by
suitable character recognition and graphics recognition systems. The
performance of the algorithm, both in terms of its effectiveness and
computational efficiency, was evaluated using several test images and
showed superior performance compared to other techniques
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.