Establishment and Application of Big Data Processing Platform | IEEE Conference Publication | IEEE Xplore

Establishment and Application of Big Data Processing Platform


Abstract:

With the advent of the era of big data, more and more enterprises begin to use big data technology to deal with related analysis work. Hadoop is the dominant processing p...Show More

Abstract:

With the advent of the era of big data, more and more enterprises begin to use big data technology to deal with related analysis work. Hadoop is the dominant processing platform in the field of big data, an ecosystem that integrates distributed computing, storage, and management. The Spark framework, on the other hand, is a faster, more versatile distributed computing platform. However, it is only a computing platform and does not provide distributed storage and management per se, and computing remains dependent on distributed file system HDFS and cluster Resource Manager Yarn in the Hadoop ecosystem. Therefore, the combination of Spark and Hadoop to build a big data processing platform can better improve the algorithm efficiency and processing scale. This article explains the setup process and running state of Hadoop and Spark in detail, and verifies its feasibility through several ways.
Date of Conference: 25-27 September 2020
Date Added to IEEE Xplore: 09 November 2020
ISBN Information:
Conference Location: Xi'an, China

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.