Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputer | IEEE Conference Publication | IEEE Xplore