Classification & detection of near duplicate web pages using five stage algorithm | IEEE Conference Publication | IEEE Xplore