By Topic

Web Page Downloading and Classification

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Tran, L.Q. ; Nat. Libr. of Med., Bethesda, MD, USA ; Moon, C.W. ; Le, D.X. ; Thoma, G.R.

Describes the processes of downloading and classifying Web-based articles in online medical journals as a preliminary step to extracting bibliographic data to populate MEDLINE(R), the widely-used database of the National Library of Medicine (NLM). The processes are combined to develop an automated system named WPDC (“Web Page Downloading and Classification”). The system downloads the Web pages using Microsoft's Windows Internet API tool WinInet, and a combination of several artificial intelligence (AI) techniques, including the breadth-first search algorithm and the constraint satisfaction method. The breadth-first search algorithm and the constraint satisfaction method are then used to traverse the Web page's links, identify these pages as abstract, full text, PDF or image files, and recognize and generate the successors of the downloading pages

Published in:

Computer-Based Medical Systems, 2001. CBMS 2001. Proceedings. 14th IEEE Symposium on

Date of Conference: