Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputers | IEEE Conference Publication | IEEE Xplore