Conferences >2022 IEEE International Paral...

Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Disaggregated architecture brings new opportunities to memory -consuming applications like graph processing. It allows one to outspread memory access pressure from local ...Show More

Metadata

Abstract:

Disaggregated architecture brings new opportunities to memory -consuming applications like graph processing. It allows one to outspread memory access pressure from local to far memory, providing an attractive alternative to disk-based processing. Although existing works on general-purpose far mem-ory platforms show great potentials for application expansion, it is unclear how graph processing applications could benefit from disaggregated architecture, and how different optimization methods influence the overall performance. In this paper, we take the first step to analyze the impact of graph processing workload on disaggregated architecture by extending the GridGraph framework on top of the RDMA-based far memory system. We design Fargraph, a far memory coordi-nation strategy for enhancing graph processing workload. Specif-ically, Fargraph reduces the overall data movement through a well-crafted, graph-aware data segment offloading mechanism. In addition, we use optimal data segment splitting and asynchronous data buffering to achieve graph iteration-friendly far memory access. We show that Fargraph achieves near-oracle performance for typical in-local-memory graph processing systems. Fargraph shows up to 8.3 x speedup compared to Fastswap, the state-of-the-art, general-purpose far memory platform.

Published in: 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Date of Conference: 30 May 2022 - 03 June 2022

Date Added to IEEE Xplore: 15 July 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/IPDPS53621.2022.00104

Conference Location: Lyon, France

Funding Agency:

Jing Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Chao Li

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Shanghai Qi Zhi Institute, Shanghai, China

Taolei Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Lu Zhang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Pengyu Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Junyi Mei

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Minyi Guo

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Shanghai Qi Zhi Institute, Shanghai, China

Contents

I. Introduction

Today's various graph applications demand better memory performance at different graph scales [18], [32], [36], [39], [40]. In the past, most graph applications can be processed by a single-node system given the relatively small size of the graph in existing in-memory graph frameworks [29], [39], [40]. Distributed graph frameworks are required only for very large-scale data analytic problems due to the communication overhead [24], [35], [41]. Nevertheless, as shown in Figure 1-(a), many graph frameworks mainly focus on medium-sized graphs (from 1GB to several hundreds of GB) [17], [36], [42]. Although current out-of-core graph computing frameworks could handle medium-sized graphs with external storage, they suffer performance degradation due to the I/O bottleneck.

Jing Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Chao Li

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Shanghai Qi Zhi Institute, Shanghai, China

Taolei Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Lu Zhang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Pengyu Wang

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Junyi Mei

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Minyi Guo

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

Shanghai Qi Zhi Institute, Shanghai, China

References is not available for this document.

Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?