Structural analysis and regular expressions based noise elimination from web pages for web content mining | IEEE Conference Publication | IEEE Xplore