Fast and scalable protein motif sequence clustering based on Hadoop framework | IEEE Conference Publication | IEEE Xplore