A novel web page duplication detection framework | IEEE Conference Publication | IEEE Xplore