Near-duplicate web page detection by enhanced TDW and simHash technique | IEEE Conference Publication | IEEE Xplore