Reducing overhead in implementing fine-grain parallel data-structures of a dataflow language on off-the-shelf distributed-memory parallel computers | IEEE Conference Publication | IEEE Xplore