2025 年 Windows 10/11 最稳最全版
Hadoop 2.7.2 单机伪分布式完整安装教程(细节到每个配置文件)
适用于所有课程设计、毕业设计、期末大作业,复制粘贴即用,100%成功!
本来安装的2.7.1版,虽然启动成功了,但理论上还不是更好的,因为GitHub上没有找到对应的2.7.1版本的winutils和hadoop.dll,所以为了更好的兼容和适配,这里选择更高一点的2.7.2版本
一、下载地址(全部直链,2025年永久有效)
| 项目 | 版本 | 下载地址(点开直接下) |
|---|---|---|
| Hadoop 2.7.2 | 2.7.2 | https://archive.apache.org/dist/hadoop/common/hadoop-2.7.2/hadoop-2.7.2.tar.gz |
| winutils + hadoop.dll | hadoop-2.7.2 | https://github.com/cdarlint/winutils/raw/master/hadoop-2.7.2/bin/winutils.exe https://github.com/cdarlint/winutils/raw/master/hadoop-2.7.2/bin/hadoop.dll |
| JDK 8(推荐) | 8u422 | https://github.com/adoptium/temurin8-binaries/releases/download/jdk8u422-b05/OpenJDK8U-jdk_x64_windows_hotspot_8u422b05.msi |
二、最终目录结构(强烈建议这样放)
D:\
└─hadoop-2.7.2\
├─bin\
├─winutils.exe
└─hadoop.dll
├─etc\
├─logs\
└─data\
├─namenode\
└─datanode\
三、环境变量(永久生效)
右键「此电脑」→ 属性 → 高级系统设置 → 环境变量 → 系统变量
| 变量名 | 变量值 |
|---|---|
| HADOOP_HOME | D:\hadoop |
| JAVA_HOME | D:\jdk-8.0.422.5-hotspot |
| Path 追加 | %HADOOP_HOME%\bin |
重启所有 CMD 窗口使生效!
四、核心配置文件(全部精确内容,直接替换)
1. D:\hadoop-2.7.2\etc\hadoop\hadoop-env.cmd(加入或修改)
cmd
set JAVA_HOME=D:\jdk-8.0.422.5-hotspot
set HADOOP_HOME=D:\hadoop-2.7.2
set HADOOP_CONF_DIR=%HADOOP_HOME%\etc\hadoop
set HADOOP_IDENT_STRING=%USERNAME%
2. D:\hadoop-2.7.2\etc\hadoop\core-site.xml(全部替换)
xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>file:///D:/hadoop-2.7.2/data/tmp</value>
</property>
</configuration>
3. D:\hadoop-2.7.2\etc\hadoop\hdfs-site.xml(全部替换)
xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:///D:/hadoop-2.7.2/data/namenode</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>file:///D:/hadoop-2.7.2/data/datanode</value>
</property>
<property>
<name>dfs.namenode.http-address</name>
<value>0.0.0.0:50070</value>
</property>
</configuration>
4. D:\hadoop-2.7.2\etc\hadoop\mapred-site.xml(复制 mapred-site.xml.template 重命名)
xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<property>
<name>mapreduce.application.classpath</name>
<value>D:/hadoop-2.7.2/etc/hadoop,D:/hadoop-2.7.2/share/hadoop/common/*,D:/hadoop-2.7.2/share/hadoop/common/lib/*,D:/hadoop-2.7.2/share/hadoop/hdfs/*,D:/hadoop-2.7.2/share/hadoop/hdfs/lib/*,D:/hadoop-2.7.2/share/hadoop/mapreduce/*,D:/hadoop-2.7.2/share/hadoop/mapreduce/lib/*,D:/hadoop-2.7.2/share/hadoop/yarn/*,D:/hadoop-2.7.2/share/hadoop/yarn/lib/*</value>
</property>
</configuration>
5. D:\hadoop-2.7.2\etc\hadoop\yarn-site.xml(全部替换)
xml
<?xml version="1.0"?>
<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>localhost</value>
</property>
<property>
<name>yarn.nodemanager.local-dirs</name>
<value>file:///D:/hadoop-2.7.2/data/yarn/local</value>
</property>
<property>
<name>yarn.nodemanager.log-dirs</name>
<value>file:///D:/hadoop-2.7.2/data/yarn/logs</value>
</property>
</configuration>
五、创建必要目录(一次性执行)
cmd
mkdir D:\hadoop-2.7.2\data\namenode
mkdir D:\hadoop-2.7.2\data\datanode
mkdir D:\hadoop-2.7.2\data\tmp
mkdir D:\hadoop-2.7.2\data\yarn\local
mkdir D:\hadoop-2.7.2\data\yarn\logs
六、一键启动脚本(永久保存,以后每天双击)
新建文件 D:\hadoop-2.7.2\一键启动Hadoop2.7.2.bat
bat
@echo off
echo ================================
echo Hadoop 2.7.2 Windows 一键启动
echo ================================
:: 强制使用 UTF-8 编码(关键!)
chcp 65001 >nul
:: 设置控制台标题
title Hadoop 2.7.x 一键启动(中文无乱码)
cd /d D:\hadoop-2.7.2
:: 第一次运行请取消下面这行的注释(只执行一次)
:: bin\hdfs namenode -format -force
start "NameNode" cmd /k bin\hdfs namenode
timeout /t 12 >nul
start "DataNode" cmd /k bin\hdfs datanode
timeout /t 8 >nul
start "ResourceManager" cmd /k bin\yarn resourcemanager
timeout /t 8 >nul
start "NodeManager" cmd /k bin\yarn nodemanager
echo.
echo 全部启动成功!
echo HDFS地址: http://localhost:50070
echo YARN地址: http://localhost:8088
echo.
jps
pause
七、验证成功(必须全部绿灯)
cmd
jps
# 必须看到:
# NameNode
# DataNode
# ResourceManager
# NodeManager
# 测试 HDFS
hdfs dfs -mkdir /test
hdfs dfs -put %windir%\win.ini /test/
hdfs dfs -cat /test/win.ini
# 测试 MapReduce
hadoop jar share\hadoop\mapreduce\hadoop-mapreduce-examples-2.7.2.jar pi 10 100
八、停止脚本(新建停止.bat)
bat
@echo off
taskkill /f /fi "windowtitle eq NameNode*"
taskkill /f /fi "windowtitle eq DataNode*"
taskkill /f /fi "windowtitle eq ResourceManager*"
taskkill /f /fi "windowtitle eq NodeManager*"
echo Hadoop 已全部关闭
pause
现在你拥有了 Windows 上最干净、最稳定的 Hadoop 2.7.2 + YARN 环境!