Efficient RDMA-based multi-port collectives on multi-rail QsNet/sup II/ clusters | IEEE Conference Publication | IEEE Xplore