By Topic

A Web-based System for Retrieving Document Images from Digital Library

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Zhang, L. ; National University of Singapore, Kent Ridge ; Yue Lu ; Tan, C.L.

A web-based system for retrieving imaged documents from a digital library is described in this paper. First, some image preprocessing is performed off-line on the underlying imaged document to extract its word objects. Then, each word object is represented by a string known as its feature code, based on which a feature code file of the corresponding document is constructed. On the web interface side, the system allows the user to input a set of query words and indicate either to perform "AND" or "OR" operation on them. Once receiving user's request, the system will process each query word and combine the results based on the "AND" or "OR" operation the user has chosen. As for each query word, it is first looked up in an index table that stores words being queried before. If matches are found, results will be retrieved from the index table directly and stored temporarily for subsequent merging. This speeds up searching and makes the system an incremental intelligence system. Otherwise, the system will convert the query word to a feature code string and employ a partial word matching approach to perform search on the pre-generated feature code files. Preliminary experimental results with the imaged documents of students' theses provided by our digital library show that the proposed system is efficient and promising for document image retrieval, and thus has potential applications to digital libraries.

Published in:

Computer Vision and Pattern Recognition Workshop, 2003. CVPRW '03. Conference on  (Volume:3 )

Date of Conference:

16-22 June 2003