Conferences >2019 IEEE 18th International ...

Nap: Network-Aware Data Partitions for Efficient Distributed Processing

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In order to support emerging data-intensive applications, many clever frameworks have been developed over the last years to efficiently and distributedly process big data...Show More

Metadata

Abstract:

In order to support emerging data-intensive applications, many clever frameworks have been developed over the last years to efficiently and distributedly process big data sets, such as MapReduce. However, these frameworks are often optimized for relatively homogeneous environments, and accounting, e.g., for the varying connectivity of wide-area network infrastructure, may require complex placement algorithms. In this paper, we present Nap, which allows optimizing distributed data processing frameworks such as MapReduce for heterogeneous environments. Nap allows adapting resources dynamically, without requiring complex placement or migration algorithms, or modifications to the logic of the mappers and reducers. Rather, Nap simply changes the data partition, by spawning virtual nodes (e.g., reducers) depending on the demand. To this end, Nap leverages a connection to integer partition problems and employs Young lattices to guarantee minimal completion times (i.e., the makespan). In fact, Nap comes with provable performance guarantees and also supports applications that leverage redundancy to speed up executions further. In particular, to demonstrate our framework, as a case study, we show how to execute multiway joins across wide-area networks with limited bandwidth efficiently. Our experiments, based on a proof-of-concept prototype implementation, confirm the potential of Nap to reduce completion times.

Published in: 2019 IEEE 18th International Symposium on Network Computing and Applications (NCA)

Date of Conference: 26-28 September 2019

Date Added to IEEE Xplore: 19 December 2019

ISBN Information:

ISSN Information:

DOI: 10.1109/NCA.2019.8935013

Conference Location: Cambridge, MA, USA

Contents

References is not available for this document.

Nap: Network-Aware Data Partitions for Efficient Distributed Processing

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Nap: Network-Aware Data Partitions for Efficient Distributed Processing

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?