Conferences >2023 IEEE International Sympo...

FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recent studies have shown that using fine-grained peer-to-peer (P2P) stores to communicate among devices in multi-GPU systems is a promising path to achieve strong perfor...Show More

Metadata

Abstract:

Recent studies have shown that using fine-grained peer-to-peer (P2P) stores to communicate among devices in multi-GPU systems is a promising path to achieve strong performance scaling. In many irregular applications, such as graph algorithms and sparse linear algebra, small sub-cache line (4-32B) stores arise naturally when using the P2P paradigm. This is particularly problematic in multi-GPU systems because inter-GPU interconnects are optimized for bulk transfers rather than small operations. As a consequence, application developers either resort to complex programming techniques to work around this small transfer inefficiency or fall back to bulk inter-GPU DMA transfers that have limited performance scalability. We propose FinePack, a set of limited I/O interconnect and GPU hardware enhancements that enable small peer-to-peer stores to achieve interconnect efficiency that rivals bulk transfers while maintaining the simplicity of a peer-to-peer memory access programming model. Exploiting the GPU’s weak memory model, FinePack dynamically coalesces and compresses small writes into a larger I/O message that reduces link-level protocol overhead. FinePack is fully transparent to software and requires no changes to the GPU’s virtual memory system. We evaluate FinePack on a system comprising 4 Volta GPUs on a PCIe 4.0 interconnect to show FinePack improves interconnect efficiency for small peer-to-peer stores by 3×. This results in 4-GPU strong scaling performance 1.4× better than traditional DMA based multi-GPU programming and comes within 71% of the maximum achievable strong scaling performance.

Published in: 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA)

Date of Conference: 25 February 2023 - 01 March 2023

Date Added to IEEE Xplore: 24 March 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/HPCA56546.2023.10070949

Conference Location: Montreal, QC, Canada

Contents

References is not available for this document.

FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?