Loading [MathJax]/extensions/MathMenu.js
Scheduling the I/O of HPC Applications Under Congestion | IEEE Conference Publication | IEEE Xplore

Scheduling the I/O of HPC Applications Under Congestion


Abstract:

A significant percentage of the computing capacity of large-scale platforms is wasted because of interferences incurred by multiple applications that access a shared para...Show More

Abstract:

A significant percentage of the computing capacity of large-scale platforms is wasted because of interferences incurred by multiple applications that access a shared parallel file system concurrently. One solution to handling I/O bursts enlarge-scale HPC systems is to absorb them at an intermediate storage layer consisting of burst buffers. However, our analysis of the Argonne's Mira system shows that burst buffers cannot prevent congestion at all times. Consequently, I/O performances dramatically degraded, showing in some cases a decrease in I/O throughput of 67%. In this paper, we analyze the effects of interference on application I/O bandwidth and propose several scheduling techniques to mitigate congestion. We show through extensive experiments that our global I/O scheduler is able to reduce the effects of congestion, even on systems where burst buffers are used, and can increase the overall system throughput up to 56%. We also show that it outperforms current Mira I/O schedulers.
Date of Conference: 25-29 May 2015
Date Added to IEEE Xplore: 20 July 2015
Electronic ISBN:978-1-4799-8649-1
Print ISSN: 1530-2075
Conference Location: Hyderabad, India

References

References is not available for this document.