Apache Hadoop plays a key role in Apache Mahout's scalability, which differentiates it from other machine learning libraries.
Apache Hadoop provides data processing (YARN) and data storage (HDFS) capabilities to Apache Mahout. The key components of Apache Hadoop (daemons) are the resource manager, node managers, name node, data nodes, and secondary node.
Apache Hadoop can be installed in three different modes, namely local mode, pseudo-distributed mode, and fully-distributed mode.
Furthermore, Apache Hadoop provides scripts and Web UIs to monitor its daemons.
In next chapter, we will discuss visualization techniques in Apache Mahout.