An efficient algorithm for computing communication sets for data parallel programs with block-cyclic distribution | IEEE Conference Publication | IEEE Xplore