Abstract:
Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely cali...Show MoreMetadata
Abstract:
Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely calibrated special hardware such as stereo cameras, laser scanners, etc. are used to infer the 3D model of the distorted image which is used to remove the distortion. Recent methods focus on creating a 3D shape model based on the 2D document image. The performance of these methods is highly dependent on estimating an accurate 2D distortion grid. In the domain of printed document images, the white space between the text lines carries as much information about the 2D distortion as the text lines themselves. Based on this intuitive idea, we build a 2D distortion grid from white space lines, which can be used to rectify a printed document image by a Dewar ping algorithm. These white space lines are extracted using a propagation technique on the distance transform of the binarized document image, guided by an open active contour algorithm. We compare our proposed method against a state-of-the-art 2D distortion grid construction method and obtain better results. We also present qualitative and quantitative evaluations for the proposed method.
Date of Conference: 05-09 January 2015
Date Added to IEEE Xplore: 23 February 2015
Electronic ISBN:978-1-4799-6683-7
Print ISSN: 1550-5790