Using overdecomposition to overlap communication latencies with computation and take advantage of SMT processors | IEEE Conference Publication | IEEE Xplore