Efficient Implementation of the Overlap Operator on Multi-GPUs | IEEE Conference Publication | IEEE Xplore