Improving the performance of MPI derived datatypes by optimizing memory-access cost | IEEE Conference Publication | IEEE Xplore