By Topic

A Content-Based Retrieval Algorithm for Document Image Database

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Dewen Hou ; Key Lab. for Distrib. Comput. Software, Shandong Normal Univ., Jinan, China ; Xichang Wang ; Jiang Liu

This paper makes a study on content-based image retrieval algorithm for document image database. Given a query image the system returns overall similar images in database. For document images, we propose the algorithm based on hierarchical matching tree. First segment an image into several regions with paragraph marking based on paragraph height estimation, and then segment the region into line blocks, the algorithm for document image retrieval by regions and line blocks with hierarchical matching tree is presented. Also we describe the matching model and the texture character strings for indexing. This algorithm is tested through trials. The experiment results indicate this algorithm is accuracy and effective. The response time of retrieval is strongly reduced by image scaling. The efficiency of retrieval is highly valuable in document image database.

Published in:

Multimedia Technology (ICMT), 2010 International Conference on

Date of Conference:

29-31 Oct. 2010