A Duplication Reduction Approach for Unstructured Data Using Machine Learning Method | IEEE Conference Publication | IEEE Xplore