Optimizing non-blocking collective operations for infiniband | IEEE Conference Publication | IEEE Xplore