Conferences >2017 IEEE 14th International ...

Indexing for Large Scale Data Querying Based on Spark SQL

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Spark SQL lets spark programmers query structured data inside Spark programs using SQL statements. It provides spark programmers with great convenience to leverage the be...Show More

Metadata

Abstract:

Spark SQL lets spark programmers query structured data inside Spark programs using SQL statements. It provides spark programmers with great convenience to leverage the benefits of relational processing, and its internal RDD distributed processing also accelerates query on large data sets. However, Spark SQL is not designed for long-run services and its built-in data source would load data from storage system, such as HDFS and local file system, in each table scan without cache mechanism. Although users could keep data in memory using "cache" command explicitly, the data cached in memory is coarse grained. In this paper, we present an indexing structure which is a pluggable component of Spark SQL based on Apache Spark. Compared with Spark SQL, it has some additional advantages. Firstly, it allows users to create index of structured data to be processed, which speeds up the query performance greatly. Secondly, it enables programmers to load fine-grained data file of structured data into memory, which is flexible to load "hot data" into memory and to evict "cold data" out of memory.

Published in: 2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)

Date of Conference: 04-06 November 2017

Date Added to IEEE Xplore: 23 November 2017

ISBN Information:

DOI: 10.1109/ICEBE.2017.25

Conference Location: Shanghai, China

Contents

References is not available for this document.

Indexing for Large Scale Data Querying Based on Spark SQL

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Indexing for Large Scale Data Querying Based on Spark SQL

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?