By Topic

Web object information extraction based on generalized hidden Markov model

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Jing Wang ; Xidian Univ., Xi''an ; Yong Yao ; Zhijing Liu

Due to the differences between Web page and plain text document, the concept of Web object is introduced in this paper. Besides, the supposed state transition and the emission symbol conditions are improved based on generalized hidden Markov model (GHMM), and a novel web objects information extraction method is proposed. Finally, through an example, it shows that the proposed method has a very high precision for Web objects information extraction.

Published in:

Communications and Information Technologies, 2007. ISCIT '07. International Symposium on

Date of Conference:

17-19 Oct. 2007