HTML web content extraction using paragraph tags | IEEE Conference Publication | IEEE Xplore