Entropy-Aware I/O Pipelining for Large-Scale Deep Learning on HPC Systems | IEEE Conference Publication | IEEE Xplore

Entropy-Aware I/O Pipelining for Large-Scale Deep Learning on HPC Systems


Abstract:

Deep neural networks have recently gained tremendous interest due to their capabilities in a wide variety of application areas such as computer vision and speech recognit...Show More

Abstract:

Deep neural networks have recently gained tremendous interest due to their capabilities in a wide variety of application areas such as computer vision and speech recognition. Thus it is important to exploit the unprecedented power of leadership High-Performance Computing (HPC) systems for greater potential of deep learning. While much attention has been paid to leverage the latest processors and accelerators, I/O support also needs to keep up with the growth of computing power for deep neural networks. In this research, we introduce an entropy-aware I/O framework called DeepIO for large-scale deep learning on HPC systems. Its overarching goal is to coordinate the use of memory, communication, and I/O resources for efficient training of datasets. DeepIO features an I/O pipeline that utilizes several novel optimizations: RDMA (Remote Direct Memory Access)-assisted in-situ shuffling, input pipelining, and entropy-aware opportunistic ordering. In addition, we design a portable storage interface to support efficient I/O on any underlying storage system. We have implemented DeepIO as a prototype for the popular TensorFlow framework and evaluated it on a variety of different storage systems. Our evaluation shows that DeepIO delivers significantly better performance than existing memory-based storage systems.
Date of Conference: 25-28 September 2018
Date Added to IEEE Xplore: 08 November 2018
ISBN Information:

ISSN Information:

Conference Location: Milwaukee, WI, USA

Contact IEEE to Subscribe

References

References is not available for this document.