An approach to identify duplicated web pages | IEEE Conference Publication | IEEE Xplore