prometheus+node_exporter+grafana监控K8S
1.prometheus部署
包下载地址:https://prometheus.io/download/
将包传至/opt
解压
tar xf prometheus-2.53.3.linux-amd64.tar.gz
移动到 /usr/local/prometheus
mv prometheus-2.53.3.linux-amd64 /usr/local/prometheus
prometheus配置文件,可以根据需求更改配置
/usr/local/prometheus/prometheus.yml
配置systemd,用systemctl来管理prometheus
cat > /usr/lib/systemd/system/prometheus.service <<'EOF'
[Unit]
Description=Prometheus Server
Documentation=https://prometheus.io
After=network.target
[Service]
Type=simple
ExecStart=/usr/local/prometheus/prometheus \
--config.file=/usr/local/prometheus/prometheus.yml \
--storage.tsdb.path=/usr/local/prometheus/data/ \
--storage.tsdb.retention=15d \
--web.enable-lifecycle
ExecReload=/bin/kill -HUP $MAINPID
Restart=on-failure
[Install]
WantedBy=multi-user.target
EOF
启动prometheus
systemctl start prometheus
systemctl enable prometheus
netstat -lntp | grep 9090
在Status-Targets 下看到prometheus状态为up
代表能正常采集到数据
2.node_exporter部署
在集群中每个需要监控的K8s节点上部署node_exporter
下载node_exporter包
地址:https://prometheus.io/download/
data:image/s3,"s3://crabby-images/4d449/4d4490bbc44895dace9799a7308cac71f6adb7bb" alt=""
上传包至/opt
data:image/s3,"s3://crabby-images/76a59/76a5932473918f4508f3d79f4343e218d120eb4a" alt=""
解压包
tar xf node_exporter-1.8.2.linux-amd64.tar.gz
移动到 /usr/local/bin
mv node_exporter-1.8.2.linux-amd64 /usr/local/bin
配置systemd,用systemctl来管理node_exporter
cat > /usr/lib/systemd/system/node_exporter.service <<'EOF'
[Unit]
Description=node_exporter
Documentation=https://prometheus.io/
After=network.target
[Service]
Type=simple
ExecStart=/usr/local/bin/node_exporter \
--collector.ntp \
--collector.mountstats \
--collector.systemd \
--collector.tcpstat
ExecReload=/bin/kill -HUP $MAINPID
Restart=on-failure
[Install]
WantedBy=multi-user.target
EOF
启动node_exporter
systemctl start node_exporter
systemctl enable node_exporter
netstat -lntp | grep :9100
data:image/s3,"s3://crabby-images/a91f4/a91f4cd054798df717c96bd487989dfb8afdbb55" alt=""
3.修改prometheus配置文件
修改prometheus配置文件,在末尾加入被监控的k8s集群信息
vim /usr/local/prometheus/prometheus.yml
- job_name: nodes
metrics_path: "/metrics"
static_configs:
- targets:
- 192.168.xx.x:9100
- 192.168.xx.x:9100
- 192.168.xx.x:9100
labels:
service: kubernetes
data:image/s3,"s3://crabby-images/e6dc6/e6dc6cbd1bcb7e045ee74dce006b1ef316f33307" alt=""
重载prometheus服务
systemctl reload prometheus
再次打开prometheus,验证k8s集群数据是否采集到
data:image/s3,"s3://crabby-images/dd4e5/dd4e5e3f92f2e14f2694816019167448b2a3b9d7" alt=""
4.grafana部署
grafana官网:https://grafana.com/grafana/download/11.5.0
安装grafana
sudo yum install -y https://dl.grafana.com/enterprise/release/grafana-enterprise-11.5.0-1.x86_64.rpm
启动grafana
systemctl start grafana-server
systemctl enable grafana-server
netstat -lntp | grep 3000
data:image/s3,"s3://crabby-images/4d85e/4d85ec952ac9a604d18f8fa2436ac735347671d8" alt=""
用户名/密码:admin/admin
data:image/s3,"s3://crabby-images/2056a/2056a315de2449713be650efaa2a1b02831edf6e" alt=""
进入主界面
data:image/s3,"s3://crabby-images/7d2f8/7d2f8e0d70c158a0a3282539107b166ecf085d5d" alt=""
配置数据源
data:image/s3,"s3://crabby-images/a68a9/a68a9e6b9cad2eee61b2742a9e053e491c2b1830" alt=""
添加新的数据源
data:image/s3,"s3://crabby-images/aad9e/aad9e36b482ad064091d4bfde049738a0071f99a" alt=""
选择prometheus
data:image/s3,"s3://crabby-images/1b146/1b1462f9a639a6f76ef25fd1b69896ca4d874d41" alt=""
配置prometheus地址
data:image/s3,"s3://crabby-images/802b0/802b073a84b4255495df8250627324c0d4784042" alt=""
在最下方点保存
data:image/s3,"s3://crabby-images/b37f4/b37f4e53a452c71935df373a06be89be96be8677" alt=""
添加模版
data:image/s3,"s3://crabby-images/6f560/6f560f05bac0f283ca51bbfedca521d1829da3ad" alt=""
导入模版
在模版网站,选择模版,复制ID
我用的模版地址:
https://grafana.com/grafana/dashboards/16098-node-exporter-dashboard-20240520-job/
data:image/s3,"s3://crabby-images/72494/72494cf0ef964614973bf939f75fba1e594b7180" alt=""
导入ID
最后在面板上查看监控的k8s的信息即可
data:image/s3,"s3://crabby-images/995e4/995e46a7d12019f462d0663df57e23b9e6136f4d" alt=""
参考文档: