Conferences >2019 19th IEEE/ACM Internatio...

Data Transfer between Scientific Facilities – Bottleneck Analysis, Insights and Optimizations

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Wide area file transfers play an important role in many science applications. File transfer tools typically deliver the highest performance for datasets with a small numb...Show More

Metadata

Abstract:

Wide area file transfers play an important role in many science applications. File transfer tools typically deliver the highest performance for datasets with a small number of large files, but many science datasets consist of many small files. Thus it is important to understand the factors that contribute to the decrease in wide area data transfer performance for datasets with many small files. To this end, we (i) benchmark the performance of subsystems involved in end-to-end file transfer between two HPC facilities for a many-file dataset that is representative of production science transfers; (ii) characterize the per-file overhead introduced by different subsystems; (iii) identify potential dependencies and bottlenecks; (iv) study the effectiveness of transferring many files concurrently as a means of reducing per-file overheads; and (v) prototype a prefetching mechanism as an alternative of concurrency to reduce the per-file overhead on source storage system. We show that both concurrency and prefetching can help reduce the per-file overhead significantly. A reasonable level of concurrency combined with prefetching can bring the per-file overhead down to a negligible level.

Published in: 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

Date of Conference: 14-17 May 2019

Date Added to IEEE Xplore: 08 July 2019

ISBN Information:

DOI: 10.1109/CCGRID.2019.00023

Conference Location: Larnaca, Cyprus

Contents

References is not available for this document.

Data Transfer between Scientific Facilities – Bottleneck Analysis, Insights and Optimizations

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Data Transfer between Scientific Facilities – Bottleneck Analysis, Insights and Optimizations

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?