Guide How to Install Hadoop on Windows 10 for Beginners

Richards
By -
0
Guide How to Install Hadoop on Windows 10 for Beginners

Hadoop is a framework based on the Java programming language and is open source. Hadoop is useful for processing large amounts of data in a distributed manner using data storage capabilities that are also distributed.

Hadoop can store any amount of data and can process it using the MapReduce programming model. Hadoop has also been designed to be able to overcome a problem caused by the failure of a hardware function that can prevent loss of work processes and some data.

Hadoop will cut the data so that it can be distributed to each node in a cluster so that later it will be processed in parallel or locally on each interconnected node.

How to Install Hadoop on Windows 10


Hadoop is often used to handle a number of BigData processes. In this article you will learn how to install Hadoop on Windows 10. However, before that you have to download the components that will be used later when installing Hadoop on Windows 10.
  • Here are the components that you must download first before installing Hadoop on Windows 10.
  • Download and Install the JDK or Java Development Kit first. You can download it at the following link: https://www.oracle.com/java/technologies/javase/javase-jdk8-downloads.html
  • Download Apache Hadoop at the following link: https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-2.9.2/hadoop-2.9.2.tar.gz
  • Download the Hadoop Patch File for the Windows operating system at the following link: https://github.com/wmwijaya/hadoop2-standalone-windows7-windows10

Well, if you have downloaded some of the files above, then you can take the steps or how to install Hadoop on Windows 10 as described below.

  • Download the Apache Hadoop file from the link above, then extract the file. Go to the bin folder and copy the location of the folder.
  • Go to control panel menu >> System and Security >> System >> Advanced system settings. Then the System Properties box will appear and click Environment Variables.
  • After that, the User Variables click the New button and enter the name Variable JAVA_HOME and paste the link to the bin folder location in the variable value. Then click OK.
  • Next in System Variables find the Path variable and click Edit. Click the New button and paste the link to the bin folder location. After that click OK.
  • Go to the etc folder and then enter the hadoop folder. Open core-site.xml, hadoop-env.cmd, hdfs-site.xml, yarn-site.xml, mapred-site.xml files using notepad or notepad++.
  • In the core-site.xml file write the script shown in the red box below. After that click the Save button.
  • In the mapred-site.xml file write the script shown in the red box below. After that click the Save button.
  • In the yarn-site.xml file write the script shown in the red box below. After that click the Save button.
  • Inside the hadoop folder, create a new folder named datanode and namenode. After that copy the link to the location of the two folders.
  • Back again to Notepad++. Open the hdfs-site.xml file and write the script as shown below. And remember, in the namenode and datanode value tags, fill in the namenode and datanode folder location links.
  • In the hadoop-env.cmd file, change %JAVA_HOME% to link the JDK folder location.
  • Re-open Environment Variables, and under use variables click the New button. After that, in the variable name, type HADOOP_HOME and in the variable value, enter the bin folder link in the hadoop folder. Then click OK.
  • In the system valiables click Path and click the Edit button. After that click the New button and paste the bin folder link.
  • Click New again and paste the sbin folder link in the hadoop folder. After that click OK.
  • Extract the hadoop patch file that you previously downloaded and then enter the bin menu. Copy the entire contents of the file in the bin folder in the hadoop patch file and paste it into the bin folder in the hadoop file. Previously, delete all files that are in the bin folder in the hadoop folder. So, the contents of the bin folder in the hadoop file will be filled with files in the bin folder belonging to the hadoop patch file.
  • Open Command prompt and type the command hdfs namenode -format. then look at the results, the Startup_msg will display the Starting NameNode.
  • Then you go to the hadoop folder and then go back to the sbin folder in CMD. After that type start-all.cmd. Then the result will be like this.

 
Thus the article that explains how to install hadoop on Windows 10. Hopefully this article can provide you benefits.
Tags:

Post a Comment

0Comments

Post a Comment (0)

#buttons=(Ok, Go it!) #days=(20)

Our website uses cookies to enhance your experience. Learn more
Ok, Go it!