By Topic

A Method for Web Data Collection for Pervasive Computing

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Lihong Wang ; School of Computer Science and Technology, Shandong University, Jinan, P. R. China., ; Qingzhong Li ; Deng Li

A new method for web data collection for pervasive computing is proposed by this paper. With the fast expansion of World Wide Web, dynamic web pages become more important. They are usually generated from a database through a common template. The structured data extracted from these pages with semantic annotation are valuable for information system. In this paper, we study how to label attribute on data value, to automatically detect the template behind these pages and extract embedded data. To label attribute on data value, we rely on the fact that the label text is visually closed to the data element. And we propose a bootstrapping method for learning label. A novel algorithm is presented to detect template and construct wrapper. Experimental results obtained using a large number of pages show that the proposed technique is highly effective.

Published in:

Pervasive Computing and Applications, 2008. ICPCA 2008. Third International Conference on  (Volume:2 )

Date of Conference:

6-8 Oct. 2008