By Topic

Design and implementation of a web structure mining algorithm using breadth first search strategy for academic search application

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Jeyalatha, S. ; Dept. of Comput. Sci., BITS Pilani, Dubai, United Arab Emirates ; Vijayakumar, B.

This paper deals with Web Structure Mining, using the Breadth First Search strategy. While browsing the web, the user has to go through many pages of the Internet, filter data and download required information. This task of searching and downloading is time consuming. Sometimes the search queries call for specific option, say, limiting search to few links. To reduce the time spent by users, a web link extraction tool has been designed and implemented in Java, that analyzes the ways of extracting web link information using a standard interface. The Test Scenario has been presented with various keywords like Higher Education, Conference Alerts and Special Interest Group. The present work can be a useful input to Web Users, Faculty, Students and Web Administrators in a University Environment. The web extraction tool helps to save time in searching and downloading files from the web. Another strong requirement is to verify whether the search keywords which have been entered by the user, gives an user accurate and relevant results. This is made possible by performing a quick check on search links. The user can also view the internal links present in the selected HTML files and the adjacency list of the crawled files.

Published in:

Internet Technology and Secured Transactions (ICITST), 2011 International Conference for

Date of Conference:

11-14 Dec. 2011