An efficient parallel approach for identifying protein families in large-scale metagenomic data sets | IEEE Conference Publication | IEEE Xplore