Vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines | IEEE Conference Publication | IEEE Xplore