A method of automatic web information extraction based on page clustering | IEEE Conference Publication | IEEE Xplore