Leveraging hadoop framework to develop duplication detector and analysis using Mapreduce, Hive and Pig | IEEE Conference Publication | IEEE Xplore