Conferences >2016 18th International Confe...

ASC: Improving spark driver performance with automatic spark checkpoint

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Many great big data processing platforms, for example Hadoop Map Reduce, are keeping improving large-scale data processing performance which make big data processing focu...Show More

Metadata

Abstract:

Many great big data processing platforms, for example Hadoop Map Reduce, are keeping improving large-scale data processing performance which make big data processing focus of IT industry. Among them Spark has become increasingly popular big data processing framework since it was presented in 2010 first time. Spark use RDD for its data abstraction, targeting at the multiple iteration large-scale data processing with reuse of data, the in-memory feature of RDD make Spark faster than many other non-in-memory big data processing platform. However in-memory feature also bring the volatile problem, a failure or a missing RDD will cause Spark to recompute all the missing RDD on the lineage. And a long lineage will also increasing the time cost and memory usage of Driver analysing the lineage. A checkpoint will cut off the lineage and save the data which is required in the coming computing, the frequency to make a checkpoint and the RDDs which are selected to save will significantly influence the performance. In this paper, we are presenting an automatic checkpoint algorithm on Spark to help solve the long lineage problem with less influence on the performance. The automatic checkpoint will select the necessary RDD to save and bring an acceptable overhead and improve the time performance for multiple iteration.

Published in: 2016 18th International Conference on Advanced Communication Technology (ICACT)

Date of Conference: 31 January 2016 - 03 February 2016

Date Added to IEEE Xplore: 03 March 2016

ISBN Information:

DOI: 10.1109/ICACT.2016.7423490

Conference Location: PyeongChang, Korea (South)

Contents

References is not available for this document.

ASC: Improving spark driver performance with automatic spark checkpoint

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

ASC: Improving spark driver performance with automatic spark checkpoint

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?