Accelerating NoC-Based MPI Primitives via Communication Architecture Customization | IEEE Conference Publication | IEEE Xplore