Parallelized Near-Duplicate Document Detection Algorithm for Large Scale Chinese Web Pages | IEEE Conference Publication | IEEE Xplore