Abstract:
Document layout helps users to focus on important content of the documents while neglecting the rest whenever possible. This paper presents a novel Optical Character Reco...Show MoreMetadata
Abstract:
Document layout helps users to focus on important content of the documents while neglecting the rest whenever possible. This paper presents a novel Optical Character Recognition (OCR) algorithm whose performance is enhanced by post-processing based on information collected from document layout analysis. Initial OCR results are used for text block classification, whose results are then used to fine-tune the final results. Experimental results show that it outperforms the start-of-the-art OCR algorithms.
Date of Conference: 16-19 December 2016
Date Added to IEEE Xplore: 19 January 2017
ISBN Information: