Abstract:
Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely cali...Show MoreMetadata
Abstract:
Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely calibrated special hardware such as stereo cameras, laser scanners, etc. are used to infer the 3D model of the distorted image which is used to remove the distortion. Recent methods focus on creating a 3D shape model based on the 2D document image. The performance of these methods is highly dependent on estimating an accurate 2D distortion grid. In the domain of printed document images, the white space between the text lines carries as much information about the 2D distortion as the text lines themselves. Based on this intuitive idea, we build a 2D distortion grid from white space lines, which can be used to rectify a printed document image by a Dewar ping algorithm. These white space lines are extracted using a propagation technique on the distance transform of the binarized document image, guided by an open active contour algorithm. We compare our proposed method against a state-of-the-art 2D distortion grid construction method and obtain better results. We also present qualitative and quantitative evaluations for the proposed method.
Date of Conference: 05-09 January 2015
Date Added to IEEE Xplore: 23 February 2015
Electronic ISBN:978-1-4799-6683-7
Print ISSN: 1550-5790
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Distance Map ,
- Image Correction ,
- Active Contour ,
- Document Images ,
- White Line ,
- 3D Shape ,
- Shape Model ,
- Text Lines ,
- Optical Character Recognition ,
- White Space ,
- Stereo Camera ,
- Solid Line ,
- Body Height ,
- Vertical Direction ,
- Binary Image ,
- Red Box ,
- Green Curve ,
- External Energy ,
- Rigid Model ,
- Cylindrical Surface ,
- Connected Component Analysis ,
- Red Dashed Box ,
- Gabor Filters ,
- Distortion Problem ,
- Line Tracing ,
- Parallelogram
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Distance Map ,
- Image Correction ,
- Active Contour ,
- Document Images ,
- White Line ,
- 3D Shape ,
- Shape Model ,
- Text Lines ,
- Optical Character Recognition ,
- White Space ,
- Stereo Camera ,
- Solid Line ,
- Body Height ,
- Vertical Direction ,
- Binary Image ,
- Red Box ,
- Green Curve ,
- External Energy ,
- Rigid Model ,
- Cylindrical Surface ,
- Connected Component Analysis ,
- Red Dashed Box ,
- Gabor Filters ,
- Distortion Problem ,
- Line Tracing ,
- Parallelogram