一、集群部署规划
s201 | s201 | s201 | |
---|---|---|---|
HDFS | DataNode NameNode | DataNode | DataNode SecondaryNameNode |
YARN | ResourceManger NodeManger | NodeManger | NodeManger |
二、配置IP映射
1.配置IP映射
vim /etc/hostname 修改主机名 vim /etc/hosts 修改映射文件
192.168.5.130 s201 192.168.5.131 s202 192.168.5.132 s203
2.修改静态IP
cd /etc/sysconfig/network-scripts/
BOOTPROTO="static" IPADDR="192.168.5.201" GATEWAY="255.255.255.0" DNS1="192.168.5.2"
三、配置免密登录
运行 ssh-keygen
ssh-copy-id s201 ssh-copy-id s202 ssh-copy-id s203
ssh s202 测试下
四、关闭防火墙
systemctl stop firewalld systemctl status firewalld systemctl disable firewalld
五、安装JDK
略
六、安装hadoop
1.解压
tar -zxvf hadoop-2.7.7.tar.gz -C /usr/local/
2.配置环境变量
vim /etc/profile 添加以下内容
export HADOOP_HOME=/usr/local/hadoop-2.7.7/ export PATH=$HADOOP_HOME/bin:$HADOOP_HOME/sbin:$PATH
3.修改配置文件(重点)
(1)核心配置文件 core-site.xml 配置namenade
<property> <name>fs.defaultFS</name> <value>hdfs://s201:9000</value> </property> <property> <name>hadoop.tmp.dir</name> <value>/usr/local/hadoop-2.7.7/data/tmp</value>
</property>
(2)配置HDFS hadoop-env.sh 运行环境
export JAVA_HOME=/usr/local/jdk1.8.0_291
hdfs-site.xml 配置secondarynamenode
<property>
<name>dfs.namenode.secondary.http-address</name>
<value>s203:50090</value>
</property>
(3)配置YARN yarn-env.sh 运行环境
export JAVA_HOME=/usr/local/jdk1.8.0_291
yarn-site.xml 混洗方式和配置resourcemanager
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>s201</value>
</property>
(4)配置MR mapred-env.sh
export JAVA_HOME=/usr/local/jdk1.8.0_291
cp mapred-site.xml.template mapred-site.xml 配置MR运行模式为yarn模式(还有本地模式local)
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<!-- 历史服务器 -->
<property>
<name>mapreduce.jobhistory.address</name>
<value>s201:10020</value>
</property>
<property>
<name>mapreduce.jobhistory.webapp.address</name>
<value>s201:19888</value>
</property>
(5)配置从节点 vim slaves
s201
s202
s203
4.格式化并启动
格式化
hdfs namenode -format
启动集群
start-dfs.sh
start-yarn.sh
#启动历史服务器
mr-jobhistory-daemon.sh start historyserver
停止集群
stop-dfs.sh
stop-yarn.sh
5.集群测试
192.168.5.130:5007 集群访问地址
新建文件夹
hdfs dfs -mkdir -p /user/input
上传文件
hdfs dfs -put /root/a.txt /user/input
删除
hdfs dfs -rmr /user/imput