Optimizing Process-to-Core Mappings for Two Dimensional Broadcast/Reduce on Multicore Architectures | IEEE Conference Publication | IEEE Xplore