Configuring Hadoop Cluster using Ansible Playbook…

Arjun Nigam
3 min readDec 8, 2020

Everyone is much aware about the Big data , its problems and the role Hadoop has played in tackling this problem.

If you have no clue what i just said . You can refer to my previous blog where i have talked about the same and explained this in a much simpler way:

https://arjunnigam10.medium.com/big-data-in-todays-world-e8778dd4743e

But What the World needs ?

The Answer to it is Automation

Sometimes you just want to sit back and just run a program which will do everything you want . So here’s my Article on Automating configuration of Hadoop Cluster using Ansible Playbook. Anyone can use this Playbook without knowing the actual Linux commands and set-up all of this with just a click of a button…

If you are a newbie and have no idea of those dark highlighted words above ,then lets give them some meaning first…

Ansible

Ansible is a software tool that provides simple but powerful automation for cross-platform computer support. It is primarily intended for IT professionals, who use it for application deployment, updates on workstations and servers, cloud provisioning, configuration management, intra-service orchestration, and nearly anything a systems administrator does on a weekly or daily basis. Ansible doesn’t depend on agent software and has no additional security infrastructure, so it’s easy to deploy.

Hadoop

I guess you have already referred to my previous blog the link to which i gave at the starting of this Article.

So now that you are well versed with those dark highlighted words :)

Lets get on with the showcase of my Playbook…

First things first…

Here are the pictures of my Ansible Playbook which i used to achieve the same…

Now that you have seen my Playbook.

Its time to run it…

Command used: ansible-playbook Your_Playbook_name

And as you can observe in the images below ,the playbook has been successfully executed…

Now its time to check whether we were successful or not in configuring Hadoop .

To check, command used: hadoop dfsadmin -report

And as you can observe in the image above , our Hadoop cluster has been successfully configured!!!!!!!!!

That was all from side , Hope this was beneficial for you guys…

View my Playbook on Github:

https://github.com/Arjun0071/hadoop_Playbook

Arjun Nigam

https://www.linkedin.com/in/arjun-nigam-4728b11b8

Signing off…

--

--