Chapter Abstract:
In big data storage, architecture data reaches users through multiple organization data structures. Cluster computing is a distributed or parallel computing system compri...Show MoreMetadata
Chapter Abstract:
In big data storage, architecture data reaches users through multiple organization data structures. Cluster computing is a distributed or parallel computing system comprising multiple stand‐alone PCs connected together working as a single, integrated, highly available resource. The main reason behind distributing data over a large cluster is to overcome the difficulty and to cut the cost of buying expensive servers. The key concept of a distributed file system is the data replication where the copies of data called replicas are distributed on multiple cluster nodes so that there is no single point of failure, which increases the reliability. Relational databases organize data into tables of rows and columns. With the advent of the big data era there is an imperative need to scale data storage platforms to make them capable of storing petabytes of data. The storage platforms can be scaled in two ways: scaling‐up and scaling‐out.
Page(s): 31 - 52
Copyright Year: 2021
Edition: 1
ISBN Information: