An improved K-means algorithm using modified cosine distance measure for document clustering using Mahout with Hadoop | IEEE Conference Publication | IEEE Xplore