Detecting near-replicas on the Web by content and hyperlink analysis | IEEE Conference Publication | IEEE Xplore