The document spectrum for page layout analysis
O'Gorman, L.
AT&T Bell Labs., Murray Hill, NJ;
This paper appears in: Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publication Date: Nov 1993
Volume: 15,
Issue: 11
On page(s): 1162-1173
ISSN: 0162-8828
References Cited: 20
CODEN: ITPIDJ
INSPEC Accession Number: 4580748
Digital Object Identifier: 10.1109/34.244677
Current Version Published: 2002-08-06
Abstract
Page layout analysis is a document processing technique used to
determine the format of a page. This paper describes the document
spectrum (or docstrum), which is a method for structural page layout
analysis based on bottom-up, nearest-neighbor clustering of page
components. The method yields an accurate measure of skew, within-line,
and between-line spacings and locates text lines and text blocks. It is
advantageous over many other methods in three main ways: independence
from skew angle, independence from different text spacings, and the
ability to process local regions of different text orientations within
the same image. Results of the method shown for several different page
formats and for randomly oriented subpages on the same image illustrate
the versatility of the method. We also discuss the differences,
advantages, and disadvantages of the docstrum with respect to other
lay-out methods
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.