Abstract:
Moving toward exascale computing, the size of data stored and accessed by applications is ever increasing. However, traditional disk-based storage has not seen improvemen...Show MoreMetadata
Abstract:
Moving toward exascale computing, the size of data stored and accessed by applications is ever increasing. However, traditional disk-based storage has not seen improvements that keep up with the explosion of data volume or the speed of processors. Multiple levels of non-volatile storage devices are being added to handle bursty I/O, however, moving data across the storage hierarchy can take longer than the data generation or analysis. Asynchronous I/O can reduce the impact of I/O latency as it allows applications to schedule I/O early and to check their status later. I/O is thus overlapped with application communication or computation or both, effectively hiding some or all of the I/O latency. POSIX and MPI-I/O provide asynchronous read and write operations, but lack the support for non-data operations such as file open and close. Users also have to manually manage data dependencies and use low-level byte offsets, which requires significant effort and expertise to adopt. In this article, we present an asynchronous I/O framework that supports all types of I/O operations, manages data dependencies transparently and automatically, provides implicit and explicit modes for application flexibility, and error information retrieval. We implemented these techniques in HDF5. Our evaluation of several benchmarks and application workloads demonstrates it effectiveness on hiding the I/O cost from the application.
Published in: IEEE Transactions on Parallel and Distributed Systems ( Volume: 33, Issue: 4, 01 April 2022)
Funding Agency:

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Houjun Tang received the PhD degree in computer science from NC State University in 2016 and the BEng degree in computer science and technology from Shenzhen University, China, in 2012. He is currently a computer research scientist with Lawrence Berkeley National Laboratory. His research interests include data management, parallel I/O, and storage systems in HPC.
Houjun Tang received the PhD degree in computer science from NC State University in 2016 and the BEng degree in computer science and technology from Shenzhen University, China, in 2012. He is currently a computer research scientist with Lawrence Berkeley National Laboratory. His research interests include data management, parallel I/O, and storage systems in HPC.View more

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Quincey Koziol received the BS degree in electrical engineering from the University of Illinois. He is currently a principal data architect with Lawrence Berkeley National Laboratory, where he drives scientific data architecture discussions and participates in NERSC system design activities. He is currently a principal architect for the HDF5 Project and a founding member of the HDF Group.
Quincey Koziol received the BS degree in electrical engineering from the University of Illinois. He is currently a principal data architect with Lawrence Berkeley National Laboratory, where he drives scientific data architecture discussions and participates in NERSC system design activities. He is currently a principal architect for the HDF5 Project and a founding member of the HDF Group.View more

NC State University, Raleigh, NC, USA
John Ravi received the BS and MS degrees in computer engineering from NC State University in 2018 and 2019, respectively. He is currently working toward the PhD degree in computer engineering from his alma mater. His research focuses on GPU utilization in HPC environments.
John Ravi received the BS and MS degrees in computer engineering from NC State University in 2018 and 2019, respectively. He is currently working toward the PhD degree in computer engineering from his alma mater. His research focuses on GPU utilization in HPC environments.View more

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Suren Byna received the PhD degree in computer science from the Illinois Institute of Technology, Chicago, in 2006. He is currently a staff scientist with Scientific Data Management Group, CRD, Lawrence Berkeley National Laboratory. He works on optimizing parallel I/O and developing systems for managing scientific data. He leads the ECP funded ExaIO Project that is developing features in HDF5 and UnifyFS, and various proj...Show More
Suren Byna received the PhD degree in computer science from the Illinois Institute of Technology, Chicago, in 2006. He is currently a staff scientist with Scientific Data Management Group, CRD, Lawrence Berkeley National Laboratory. He works on optimizing parallel I/O and developing systems for managing scientific data. He leads the ECP funded ExaIO Project that is developing features in HDF5 and UnifyFS, and various proj...View more

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Houjun Tang received the PhD degree in computer science from NC State University in 2016 and the BEng degree in computer science and technology from Shenzhen University, China, in 2012. He is currently a computer research scientist with Lawrence Berkeley National Laboratory. His research interests include data management, parallel I/O, and storage systems in HPC.
Houjun Tang received the PhD degree in computer science from NC State University in 2016 and the BEng degree in computer science and technology from Shenzhen University, China, in 2012. He is currently a computer research scientist with Lawrence Berkeley National Laboratory. His research interests include data management, parallel I/O, and storage systems in HPC.View more

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Quincey Koziol received the BS degree in electrical engineering from the University of Illinois. He is currently a principal data architect with Lawrence Berkeley National Laboratory, where he drives scientific data architecture discussions and participates in NERSC system design activities. He is currently a principal architect for the HDF5 Project and a founding member of the HDF Group.
Quincey Koziol received the BS degree in electrical engineering from the University of Illinois. He is currently a principal data architect with Lawrence Berkeley National Laboratory, where he drives scientific data architecture discussions and participates in NERSC system design activities. He is currently a principal architect for the HDF5 Project and a founding member of the HDF Group.View more

NC State University, Raleigh, NC, USA
John Ravi received the BS and MS degrees in computer engineering from NC State University in 2018 and 2019, respectively. He is currently working toward the PhD degree in computer engineering from his alma mater. His research focuses on GPU utilization in HPC environments.
John Ravi received the BS and MS degrees in computer engineering from NC State University in 2018 and 2019, respectively. He is currently working toward the PhD degree in computer engineering from his alma mater. His research focuses on GPU utilization in HPC environments.View more

Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Suren Byna received the PhD degree in computer science from the Illinois Institute of Technology, Chicago, in 2006. He is currently a staff scientist with Scientific Data Management Group, CRD, Lawrence Berkeley National Laboratory. He works on optimizing parallel I/O and developing systems for managing scientific data. He leads the ECP funded ExaIO Project that is developing features in HDF5 and UnifyFS, and various projects on managing scientific data.
Suren Byna received the PhD degree in computer science from the Illinois Institute of Technology, Chicago, in 2006. He is currently a staff scientist with Scientific Data Management Group, CRD, Lawrence Berkeley National Laboratory. He works on optimizing parallel I/O and developing systems for managing scientific data. He leads the ECP funded ExaIO Project that is developing features in HDF5 and UnifyFS, and various projects on managing scientific data.View more