Similarity detection among data files-a machine learning approach | IEEE Conference Publication | IEEE Xplore