Skew angle detection of digitized Indian script documents
Chaudhuri, B.B.
Pal, U.
Comput. Vision & Pattern Recognition, Indian Stat. Inst., Calcutta;
This paper appears in: Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publication Date: Feb 1997
Volume: 19,
Issue: 2
On page(s): 182-186
ISSN: 0162-8828
References Cited: 17
CODEN: ITPIDJ
INSPEC Accession Number: 5529633
Digital Object Identifier: 10.1109/34.574803
Current Version Published: 2002-08-06
Abstract
Skew angle detection of scanned documents containing most popular
Indian scripts (Devnagari and Bangla) is considered. Most characters in
these scripts have horizontal lines at the top, called head lines. The
character head lines mostly join one another in a word and the word
appears as a single component. In the proposed method the components are
at first labeled. The upper envelope of a component is found by
columnwise scanning from an imaginary line above the component. Portions
of upper envelope satisfying the properties of digital straight line are
detected. They are clustered as belonging to single text lines.
Estimates from individual clusters are combined to get the skew angle.
Apart from accuracy and efficiency, an advantage of the method is that
character segmentation and zone detection can be readily done from head
line information, which is useful in optical character recognition
approaches of these scripts
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.