DBCSR: A Library for Dense Matrix Multiplications on Distributed GPU-Accelerated Systems | IEEE Conference Publication | IEEE Xplore