监控系统和zabbix基本知识
为了保证集群的健康,我们需要检测服务器的状态,就有了监控系统,专门来监控服务器的状态。
服务器状态出现问题,可以用告警的形式来提示运维人员。
服务器上部署的关键的业务组件,nginx、mysql等等,如果出现故障,他也可以提示给用户,可以
让我们尽快的处理问题。监控系统只能发现,不能解决问题。
zabbix是简介
zabbix是基于web页面的(可视化系统),分布式的系统监控和网络监控的企业级的开源监控方案。
zabbix的功能
1、监控服务器关键硬件的状态
2、可以监控服务器的关键指标
3、可以监控服务器上关键应用的状态
4、可以监控网络设备(路由器、交换机等等)
5、关于监控容器(zabbix5开始已经兼容容器的监控,但是效果不是很好,目前监控容器的主流还是普罗米修斯)
zabbix的组件
C/S模式:client server,服务端和客户端,即zabbix server和zabbix-agent,C/S模式是一个数据采集的模式,客户端把数据发送给服务端。
B/S模式:browser/server,浏览器和服务端,浏览器访问服务端的内容。B/S模式提高的是数据的展示和相关的监控项的配置。
zabbix server:服务端
zabbix-agent:客户端
zabbix proxy:服务端的代理服务器,可以代替服务端收集客户端的数据,然后转发给服务端。可以降低服务端直接收集客户端数据带来的系统压力。
zabbix get:zabbix的命令行工具
zabbix安装
这里我们使用三台服务器完成zabbix实操,架构如下:
zw4:192.168.254.14,zabbix-server
zw5:192.168.254.15,zabbix-agent2、zabbix-proxy
zw6:192.168.254.31,zabbix-agent2
163网易邮箱,做smtp服务器
配置主服务器zw4
1、下载安装deb包
wget https://repo.zabbix.com/zabbix/6.4/ubuntu/pool/main/z/zabbix-release/zabbix-release_6.4-1+ubuntu22.04_all.deb #下载deb包
dpkg -i zabbix-release_6.4-1+ubuntu22.04_all.deb #编译安装
data:image/s3,"s3://crabby-images/b0da1/b0da18449307941b9741835144d08908e7e9ae15" alt=""
apt-get update #更新源
2、安装zabbix
apt install zabbix-server-mysql zabbix-frontend-php zabbix-apache-conf zabbix-sql-scripts zabbix-agent -y #安装zabbix及依赖环境
apt install mariadb-server -y #安装数据库
mysql_secure_installation #先初始化数据库
登录数据库并创建用户和赋权
data:image/s3,"s3://crabby-images/d0e96/d0e96fc6a8942f8adb93240cc12301320e5cf5b4" alt=""
3、导入mysql架构到zabbix库
data:image/s3,"s3://crabby-images/d4699/d4699021170004552bdcd5d6f9acf3441fd09c90" alt=""
4、修改zabbix服务端的配置文件并重启
data:image/s3,"s3://crabby-images/7082c/7082c653ef70e6d648adeea8ee36f6bd4b679704" alt=""
data:image/s3,"s3://crabby-images/667b2/667b236b595f97564c6722ac125ba289c7e300aa" alt=""
data:image/s3,"s3://crabby-images/efc6e/efc6e193f2213101ef0d74282df3f5e286caa10c" alt=""
5、登录服务端界面
data:image/s3,"s3://crabby-images/4c81b/4c81b112306f712e310fd23aa0269fecd96f6036" alt=""
data:image/s3,"s3://crabby-images/faa77/faa775e433110a262c362108d3c768c0b63154c9" alt=""
data:image/s3,"s3://crabby-images/e260f/e260ffc2e637480134535c63ccc826f2f6263e4c" alt=""
data:image/s3,"s3://crabby-images/4037d/4037d575f45faee9892e7f914a076a455a961b1c" alt=""
这时候我们发现有很多乱码
6、解决界面乱码
准备好simkai.ttf文件,这个文件在windows的c盘
data:image/s3,"s3://crabby-images/19938/19938c60d8acd944a5c96062315e457e57442ce7" alt=""
修改php配置文件
data:image/s3,"s3://crabby-images/e0c2b/e0c2b00b89f298bad6081c0a801b85467ab5d584" alt=""
data:image/s3,"s3://crabby-images/093cf/093cfb4ce1697442788214e0e2ee3a40521efcf8" alt=""
data:image/s3,"s3://crabby-images/3dc8b/3dc8b11d1e98c05dd54d897683c8610009f51c36" alt=""
data:image/s3,"s3://crabby-images/edbb0/edbb032eda0bc649013dd34e31b3aeb5cf11ea35" alt=""
可以发现乱码问题已经解决
7、修改zabbix自己的客户端配置文件和主机映射
data:image/s3,"s3://crabby-images/a34be/a34bef77b957ee87289b9a26a0d4f8dadab64eb3" alt=""
data:image/s3,"s3://crabby-images/db32c/db32cc3b0b594754adfe372d658ef204c31c27a8" alt=""
data:image/s3,"s3://crabby-images/c65ff/c65ffe057d3022605fa6ddeb4c4d05f635ee75fb" alt=""
data:image/s3,"s3://crabby-images/4bbcf/4bbcf6829093488958d470b9f3c4838bc18297bb" alt=""
8、这时候我们登录界面显示主服务端配置成功
zabbix的可用性
data:image/s3,"s3://crabby-images/c4c6d/c4c6d350a1e68c46bc127f3b9bf8e1f58f065f88" alt=""
- zbx:默认的,监控服务器设备的网络,以及服务器监控状态,以及告警。
- snmp:监控网络设备
- JMX:开发人员用来监控java应用程序
- ipmi:通过网络管理服务器的电源,温度,风扇速度等等
配置客户端操作zw5
1、下载安装deb包
dpkg -i zabbix-release_6.4-1+ubuntu22.04_all.deb
apt-get update
2、安装zabbix并检查
apt -y install zabbix-agent2
data:image/s3,"s3://crabby-images/55b13/55b130761946cddf5c112a772158f5b9590d5096" alt=""
data:image/s3,"s3://crabby-images/0a931/0a931422a9771fb593b31a2f424e811750a48f37" alt=""
3、配置zabbix客户端配置文件
vim /etc/zabbix/zabbix_agent2.conf
data:image/s3,"s3://crabby-images/326c7/326c7828c06b89e94da671c1c3920ad59800e488" alt=""
data:image/s3,"s3://crabby-images/f34ec/f34ec183721bb4b9e6dffe3ac10afc98795d82d2" alt=""
做主机映射并重启后检查端口
data:image/s3,"s3://crabby-images/e94de/e94dea5d87f0a955b8afbc75c7f12fd5dac5aeaf" alt=""
1、返回主服务器zw4,在主服务器上安装一个命令行工具
data:image/s3,"s3://crabby-images/e2941/e2941f8705b0b38ee8048848ebbfc1602eb6ce26" alt=""
测试服务端与客户端之间是否能通
data:image/s3,"s3://crabby-images/854af/854af751683cd3bfc1c31bfb1f733522be98e690" alt=""
有数值代表成功,0代表失败
2、在界面添加监控主机zw5
data:image/s3,"s3://crabby-images/f745a/f745aa7e3c848357f38f3a0e350bbf805d1545f8" alt=""
data:image/s3,"s3://crabby-images/25254/25254329803519ecedb152b012a6dd40dba27c5b" alt=""
显示监控成功
创建对登录主机数量的监控
在上面安装的基础上,完成监控的创建,如果登录192.168.254.15的主机数超过三个就报警
zabbix的监控的原理
模板------监控项------触发器------实时监控(脚本的实现)
1、查看监控项的配置文件和写在脚本激活方式
vim /etc/zabbix/zabbix_agent2.conf
data:image/s3,"s3://crabby-images/78a35/78a35faa87fb1e262de54eb0749dd23194812ec8" alt=""
data:image/s3,"s3://crabby-images/70e4c/70e4c087042a38be24036385f95343399ecb0e06" alt=""
2、创建自定义监控项文件user.conf
data:image/s3,"s3://crabby-images/861ad/861ade8ee2ae6e824a52dcb33c589b165d952ed7" alt=""
data:image/s3,"s3://crabby-images/3597c/3597c0a760a89e281aa7be7bfd5052268e9aea76" alt=""
这时候我们回到主服务器上测试下是否配置成功
data:image/s3,"s3://crabby-images/4fc0e/4fc0e3c248b86dee060ba1532f8aa94d57a46a25" alt=""
3、页面操作
创建构建并保存
data:image/s3,"s3://crabby-images/6d8cb/6d8cb7ed4c4ac20cbe206be8819b3cb58072e529" alt=""
创建模板
data:image/s3,"s3://crabby-images/96b6d/96b6daa7aa8b8f78b67dcbf681ddb743fc22a575" alt=""
创建监控项并测试
data:image/s3,"s3://crabby-images/44ecc/44eccad95c6775eed783b1680ff26409b898b489" alt=""
data:image/s3,"s3://crabby-images/254c3/254c3258de8cc27f36d7f5f8b89e77144f43a50b" alt=""
创建触发器
data:image/s3,"s3://crabby-images/857d7/857d7009ea40792b586ab977f7642a26447fa410" alt=""
创建图形
data:image/s3,"s3://crabby-images/d6376/d63763eec2c3c750b24d5e79d831d3c5b2fd4bbe" alt=""
添加模板Template Login User
data:image/s3,"s3://crabby-images/2f429/2f4290d0b617ad60ba244ed66cfe101144afada3" alt=""
这时候图形显示我们添加的模板正常
data:image/s3,"s3://crabby-images/be493/be4931cf20047a51eb87c1f1b0d0be787efbd4dc" alt=""
这时候我们使用3台远程连接主机登录192.168.254.15,显示了告警
data:image/s3,"s3://crabby-images/1e04c/1e04c5ce12ae72e1e9f9dfaf1020beb19186b894" alt=""
创建监控nginx服务状态
1、配置nginx主配置文件,开启nginx状态统计
data:image/s3,"s3://crabby-images/e2394/e239449f0dd4f62f059a76d3abed1d2993137a9f" alt=""
data:image/s3,"s3://crabby-images/dda5f/dda5f48018ddb055cdb93655d2fcd35a3db0f8c7" alt=""
2、创建一个自定义监控项文件nginx_test.conf
data:image/s3,"s3://crabby-images/1097e/1097e30d90568b4af79728de6749396547354b58" alt=""
data:image/s3,"s3://crabby-images/90cd7/90cd79506e392730fa5ec6de06e2576cea67b737" alt=""
3、创建nginx监控脚本文件,监控nginx状态
data:image/s3,"s3://crabby-images/f5ca8/f5ca8db8dc59e629ed24577e811f89340be3ff35" alt=""
重启zabbix服务后,在主服务器zw4上测试是否成功监控
data:image/s3,"s3://crabby-images/4c599/4c59946d0089a65d8900601dfce24a3acda9252d" alt=""
data:image/s3,"s3://crabby-images/5d59b/5d59be92445e86216c1372ff2ef37ed3852d3afb" alt=""
这时候我们zw5上nginx只有在开启的状态下,才有数值,关闭则是0
4、配置监控项和触发器
创建模板
data:image/s3,"s3://crabby-images/53961/539618ae0a030af5d230c6b064c3b94ac6d5a288" alt=""
创建监控项并测试
data:image/s3,"s3://crabby-images/bf276/bf276d4c587270f1dd8916dda9ff13ce8ed4e559" alt=""
data:image/s3,"s3://crabby-images/da6b9/da6b90533204057bb6ddd7045b95308c0c39d10e" alt=""
创建触发器
data:image/s3,"s3://crabby-images/17b35/17b35f2fe996f84dddd3e18b05b752ba2732d277" alt=""
回到主机更新下模板即完成监控
我们关闭nginx,会发现有告警生成
data:image/s3,"s3://crabby-images/13697/13697bcaceef8195c109abf7987fe7b7dae5b12e" alt=""
5、设置邮件告警
创建媒介类型及测试
data:image/s3,"s3://crabby-images/d2ba8/d2ba8f60152fa937cd1bbb1ea74c2de6e1cac397" alt=""
data:image/s3,"s3://crabby-images/ef9da/ef9da1ec3f26ce86e58be096b4173573aca298e6" alt=""
添加触发器动作
data:image/s3,"s3://crabby-images/4d9b8/4d9b8dfcf569725f81ce54348370aaf57e2107d1" alt=""
data:image/s3,"s3://crabby-images/11e18/11e1833dc763bf5961d3f143ac6aa3903ad87e20" alt=""
data:image/s3,"s3://crabby-images/c080b/c080b193b0648efe572f7fbb1cb98bc9323e5113" alt=""
主题:Problem: {EVENT.NAME}
消息:<b>Problem started</b> at {EVENT.TIME} on {EVENT.DATE}<br><b>Problem name:</b> {EVENT.NAME}<br><b>Host:</b> {HOST.NAME}<br><b>Severity:</b> {EVENT.SEVERITY}<br><b>Operational data:</b> {EVENT.OPDATA}<br><b>Original problem ID:</b> {EVENT.ID}<br>{TRIGGER.URL}
这时候状态会变成已启用
data:image/s3,"s3://crabby-images/e65c3/e65c33909787d844bd176cd79999e22fc9e079c7" alt=""
然后回到用户Admin,点开添加告警媒介
data:image/s3,"s3://crabby-images/21fa0/21fa07b91d243ec714a4d0a42e47ad08f69c7a3e" alt=""
data:image/s3,"s3://crabby-images/97d80/97d80ef79d3e70b5d5a1169aa2b096d9baf1296a" alt=""
这时候当我们关闭nginx服务后,会触发报警并自动发送给我们邮箱
data:image/s3,"s3://crabby-images/11a3b/11a3b5ae52e6ab8329b7015e73ce63240747b831" alt=""
监控mysql的状态
如果zw6的mysql服务关闭,发生邮件告警。
1、首先客户端zw6安装zabbix,将zw6加入到主zabbix,方法和上面配置客户端zw5一样
data:image/s3,"s3://crabby-images/05da9/05da938a2ff1b227eb1cc89fdc8cf765608d6954" alt=""
2、创建自定义监控项文件mysql_test1.conf
data:image/s3,"s3://crabby-images/6e034/6e034191dab6823843a5128e183627777decd1ef" alt=""
这时候我们回到主服务器上测试下是否配置成功
data:image/s3,"s3://crabby-images/a799e/a799efcd86613b597cf917e1f553f9a403b3b6fb" alt=""
3、配置监控项和触发器
添加模板
data:image/s3,"s3://crabby-images/42536/42536c4bdd76d77fde85389294ae059a3dd11b0d" alt=""
添加监控项并测试
data:image/s3,"s3://crabby-images/a095f/a095f42142969ea52d3990ab68c2ea40d327072f" alt=""
添加触发器
data:image/s3,"s3://crabby-images/25412/254127bbdbd521b2b40229fbd0a4a52dec0473f5" alt=""
回到主机更新下模板即完成监控
我们关闭mysql,会发现有告警生成
data:image/s3,"s3://crabby-images/b2917/b2917a592d37be48f2e6f81058105b924c3c4362" alt=""
4、设置邮件告警
创建媒介类型及测试
data:image/s3,"s3://crabby-images/95d47/95d470b2779fe449ab162b258f578af7d9554160" alt=""
添加触发器动作
data:image/s3,"s3://crabby-images/c7744/c7744ed725b4332e3e8e9b7a0a706adced360040" alt=""
动作2我们之前做nginx的时候配置过了,直接点更新即可。
这时候状态会变成已启用
data:image/s3,"s3://crabby-images/ced64/ced6417fc52cd58685ae0b1cf3877956a3ebaa57" alt=""
然后回到用户Admin,告警媒介也不需要再添加了。
这时候当我们关闭mysql服务后,会触发报警并自动发送给我们邮箱
data:image/s3,"s3://crabby-images/6c6a4/6c6a4373cae531b9961c62c9e95a5ca8875c30f5" alt=""