A novel web page text information extraction method | IEEE Conference Publication | IEEE Xplore