🔰 Integration of LVM with Hadoop-Cluster & providing Elasticity to Datanode Storage 🔰

4 min readNov 4, 2020

Let’s understand few concepts related to our task :

What is LVM ?

Logical Volume Management(LVM) enables the combining of multiple individual hard drives or disk partitions into a single volume group (VG). That volume group can then be subdivided into logical volumes (LV) or used as a single large volume. Regular file systems, such as EXT3 or EXT4, can then be created on a logical volume.

The EXT2, 3, and 4 filesystems all allow both offline (unmounted) and online (mounted) resizing when increasing the size of a filesystem, and offline resizing when reducing the size.

LVM helps to provides Elasticity to the storage Device and it’s an advance version of partition.

Task Description 📄

🌀 7.1: Elasticity Task

🔅Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

🔅Increase or Decrease the Size of Static Partition in Linux.

👉🏻Lets get started…😃

🎯Step 1 : Add physical Harddisks to our datanode, here I have added two HD:

/dev/sdb (20GiB) and

/dev/sdc (20GiB)

✔️To check it is successfully attached or not run the command :

# fdisk -l

🎯Step 2 : We have to convert this HD into Physical Volume (PV)

✔️To Convert Harddisk into physical volumes the command is :

# pvcreate /dev/sdb(first HD) /dev/sdc (second HD)

🎯Step 3 : Create Volume Group (VG) with physical volumes

✔️For creating VG use command as :

# vgcreate vg_name  /dev/sdb  /dev/sdc

✔️To see whether the VG is created or not use command :

# vgdisplay vg_name

🎯Step 4 : Create partition i.e. Logical Volume (LV) of a volume group of size you want to contribute to namenode. Here I am contributing 25GB.

✔️For creating a partition (LV) use command :

# lvcreate  --size 25G  --name LV_name VG

We know that for using the new partition for storing any data we have to format it first …

🎯Step 5 : Format the partition using command :

# mkfs.ext4  /dev/VG_name/LV_name

🎯Step 6 : Mount that partition on datanode folder (/dn) use command :

# mount /dev/VG_name/LV_name /dn

🎯Step 7 : Start the datanode daemon service and check the volume contribution to namenode.

✔️For starting the datanode service use command :

# hadoop-daemon.sh start datanode

✔️for checking the contribution report use command :

# hadoop dfsadmin -report

On the fly we can increase/decrease the storage to be contributed to namenode without unmounting or stopping any services.

We can only able to increase the size upto the space available currently in volume group (here 40GB). So check for size availability .

🎯Step 7 : For extending the volume contribution use command :

# lvextend --size +7G /dev/VG_name/LV_name

🎯Step 8 : Format the extended part use the command as:

# resize2fs /dev/VG_name/LV_name

🎯Step 9 : Now again check the size of volume contribution of datanode to namenode.

✔️For this use the command as:

# hadoop dfsadmin -report

We can clearly see that on the fly we have increased the size of storage from 25 GB to 32 GB.

TASK COMPLETED👨🏻‍💻

Thanks for reading !!!😊✨

🔰Keep Learning ❗❗🔰Keep Sharing ❗❗

🔰 Integration of LVM with Hadoop-Cluster & providing Elasticity to Datanode Storage 🔰

TASK COMPLETED👨🏻‍💻

🔰Keep Learning ❗❗🔰Keep Sharing ❗❗

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by rishabhsharma

No responses yet

More from rishabhsharma

☁️Configuring AWS VPC using Ansible☁️

In this article we will see how to configure AWS VPC and launch ec2 instance in one of the subnet inside that VPC using Ansible.

🔰Ansible Roles to configure HAProxy (Load Balancer)🔰

Roles:

Artificial Intelligence(AI) in Space Exploration ‍🚀👨🏿‍🚀

“The stars will never be won by little minds; we must be big as space itself.”

🔰Amazon SQS : A Case Study🔰

Amazon Simple Queue Service (Amazon SQS) is a distributed message queuing service introduced by Amazon in late 2004.

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Lists

Staff picks

Stories to Help You Level-Up at Work

Self-Improvement 101

Productivity 101

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

It literally took one try. I was shocked.

This Is How Tesla Will Die

The vultures are circling the tech giant.

Google just confirmed the AI reality many programmers are desperately trying to deny

AI is slowly taking over coding but many programmers are still sticking their head in the sand about what’s coming…