Saturday, November 15, 2014

HADOOP ADMINISTRATION - Backup and Restoration methodologies

Data Backup: 


Backup the Data using DISCP

hadoop distcp hdfs://nn1.cluster1.com:9000/ hdfs://nn1.cluster2.com:9000/

distcp will run a mapreduce job.


Application Backup : scripts/map reduce programs etc.

hadoop dfsadmin -saveNameSpace

hdfs dfsadmin -metasave filename.txt

hdfs oiv -i /data/namenode/current/fsimage -o fsimage.txt 

No comments:

Post a Comment