A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters | IEEE Conference Publication | IEEE Xplore