Page segmentation and classification are important parts of the document analysis process. The aim is to extract and classify different parts of the page. This paper proposes an approach in which these two phases are combined. The integration process includes fast feature extraction with rule-based classification and label propagation using connectivity analysis providing classified areas in three categories: background, text and picture
Published in:
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
(Volume:2
)
Date of Conference: 14-16 Aug 1995