Evaluating Multi-Level Checkpointing for Distributed Deep Neural Network Training | IEEE Conference Publication | IEEE Xplore