This content is part of the Essential Guide: Complete guide to Hadoop technology and storage
Evaluate Weigh the pros and cons of technologies, products and projects you are considering.

What to expect when using HDFS in Hadoop architecture

Analyst John Webster details issues with Hadoop architecture and what users can expect from Hadoop Version 2.0.

Which issues can I expect to pop up with Hadoop Distributed File System (HDFS) and NameNode in Hadoop architecture?

According to advanced Hadoop users and vendors that offer alternatives to HDFS, there are a number of issues you can expect. The one cited most often is that NameNode represents a single point of failure. When it goes offline, the cluster shuts down and has to be restarted at the beginning of the process that was running at the time of the failure. The Apache Hadoop community is working to address this problem, and Version 2.0 of Hadoop includes manual failover to a standby NameNode without the need to restart the cluster. A newer release is expected to include automated NameNode failover. Vendors are also coming to market with fixes such as a NameNode failover mode in HDFS, as well as file-system alternatives that don't use a NameNode function (that means no NameNode to fail).

Next Steps

MapReduce waning with Hadoop system user

HDFS architecture features 

Dig Deeper on Big data storage