The Hadoop Distributed File System | IEEE Conference Publication | IEEE Xplore

The Hadoop Distributed File System


Abstract:

The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a l...Show More

Abstract:

The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 25 petabytes of enterprise data at Yahoo!.
Date of Conference: 03-07 May 2010
Date Added to IEEE Xplore: 28 June 2010
ISBN Information:

ISSN Information:

Conference Location: Incline Village, NV, USA

Contact IEEE to Subscribe

References

References is not available for this document.