前提
已经部署Grafana,并且dashboard接入数据
大屏编号地址:Node Exporter Full | Grafana Labs
data:image/s3,"s3://crabby-images/df380/df380c077cba1f17806daae9377b37405e9fb523" alt=""
创建企微机器人
data:image/s3,"s3://crabby-images/e83d0/e83d08cbcf12c81a35d2d4ca3d239f31bc69f1b7" alt=""
备注:群里若有第三方外部人员不能创建
data:image/s3,"s3://crabby-images/ebb26/ebb267cbb0cc85171f3e02b602b27d250b1c491a" alt=""
data:image/s3,"s3://crabby-images/94279/9427952d0f5544cdceabae866857134268d1a4e7" alt=""
data:image/s3,"s3://crabby-images/d781a/d781a9c56db978530ed0e166292ef5b695047ba9" alt=""
机器人创建完成,记录下来Webhook地址
Grafana配置告警消息模板
data:image/s3,"s3://crabby-images/ecc88/ecc8866c39b11bfe4e0145efef52dbf14b0127ca" alt=""
data:image/s3,"s3://crabby-images/19551/19551a916b6d995eba490cdc114161ce6b323393" alt=""
bash
{{ define "KDDI" }}
{{ range $i,$alert :=.Alerts }}
===KDDI资源使用率监控报警===
告警状态:{{ if eq .Status "firing" }} <font color="#FF0000">触发告警</font> {{ else if eq .Status "resolved" }} <font color="#00FF00">告警解除</font>{{ end }}
告警级别:{{ .Labels.level }}
告警类型:{{ .Labels.alertname }}
故障主机:{{ if eq .Labels.instance "123.56.222.147:9100" }} <font color="#0000FF" style="font-weight:bold;">123.56.222.147</font> {{ else if eq .Labels.instance "47.94.168.75:9100" }} <font color="#0000FF" style="font-weight:bold;">47.94.168.75</font>{{ end }}
告警主题:{{ .Annotations.summary }}
告警详情:{{ .Annotations.description }}
触发时间:{{ .StartsAt.Local.Format "2006-01-02 15:40:12" }}
==========end==========
{{ end }}
{{ end }}
Grafana配置企微机器人
data:image/s3,"s3://crabby-images/851af/851afd647bcc5f8e1194495087d95d7e79678ec4" alt=""
data:image/s3,"s3://crabby-images/b28a6/b28a617f9b2b69dce782fc1c9ce775ffba4dba18" alt=""
data:image/s3,"s3://crabby-images/ece64/ece645a738375a5cbd30785dec14c814812784e7" alt=""
选择告警消息模板,配置告警title,保存
Grafana配置panelCPU告警规则(阈值80)
data:image/s3,"s3://crabby-images/3de36/3de36eb2adf1b7c7810e7cf11691e47a3b502297" alt=""
data:image/s3,"s3://crabby-images/852b4/852b4ff816a04020cd36b7fef0ffd02da5509b87" alt=""
data:image/s3,"s3://crabby-images/ee322/ee322bc2364c1ba5477688487bbd892785a9afd9" alt=""
bash
100 - (avg by (instance,job)(irate(node_cpu_seconds_total{mode="idle"}[5m])) * 100)
data:image/s3,"s3://crabby-images/32722/3272266932483f46d34479cabbae3b7afa26d5a3" alt=""
data:image/s3,"s3://crabby-images/ee6c8/ee6c82c3b7478700d672a0a14134db42b2309440" alt=""
data:image/s3,"s3://crabby-images/48461/484611eebfc3331979b38105a179eee920b9e50c" alt=""
保存退出
data:image/s3,"s3://crabby-images/2e0f8/2e0f889431bd86d594a7fa364e7acef147ef4caa" alt=""
查看企微告警消息
data:image/s3,"s3://crabby-images/01cce/01ccee0c0fbfc39fbce735db0c648946ef28688e" alt=""
内存和磁盘配置跟CPU基本一样,查询语句分别如下
data:image/s3,"s3://crabby-images/3da72/3da722f2014ae19ff59bd3a5ba5bc63ac91c1845" alt=""
bash
CPU
100 - (avg by (instance,job)(irate(node_cpu_seconds_total{mode="idle"}[5m])) * 100)
内存
(1 - (node_memory_MemAvailable_bytes)/node_memory_MemTotal_bytes)* 100
磁盘
100-(node_filesystem_free_bytes{fstype=~"ext4|xfs"}/node_filesystem_size_bytes {fstype=~"ext4|xfs"}*100)