Towards Low-Overhead Resilience for Data Parallel Deep Learning | IEEE Conference Publication | IEEE Xplore