判断ceph osd 节点磁盘异常

1 执行 ceph orch device ls --refresh

--refresh 是异步刷新,等待五分钟,再执行一次,没有刷新的磁盘是异常的。比如 ceph01 上的两个磁盘没有按时刷新。

复制代码
[ceph: root@ceph03 /]# ceph orch device ls --refresh
HOST    PATH      TYPE  DEVICE ID              SIZE  AVAILABLE  REFRESHED  REJECT REASONS                                                           
ceph01  /dev/vdb  hdd   b2587983-634c-478e-8   100G  No         44h ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph01  /dev/vdc  hdd   ace8da2b-a227-4e05-b  20.0G  No         44h ago    Has a FileSystem, LVM detected                                           
ceph02  /dev/vdb  hdd   7c2a7ebe-c6b3-400f-9   100G  No         14m ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph02  /dev/vdc  hdd   f4a5ce60-7eb5-4f77-b  20.0G  No         14m ago    Has a FileSystem, LVM detected                                           
ceph03  /dev/vdb  hdd   fa38bf8a-d8bc-4aca-b   100G  No         14m ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph03  /dev/vdc  hdd   492ab205-58af-436c-8  20.0G  No         14m ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
[ceph: root@ceph03 /]# ceph orch device ls --refresh
HOST    PATH      TYPE  DEVICE ID              SIZE  AVAILABLE  REFRESHED  REJECT REASONS                                                           
ceph01  /dev/vdb  hdd   b2587983-634c-478e-8   100G  No         44h ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph01  /dev/vdc  hdd   ace8da2b-a227-4e05-b  20.0G  No         44h ago    Has a FileSystem, LVM detected                                           
ceph02  /dev/vdb  hdd   7c2a7ebe-c6b3-400f-9   100G  No         32s ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph02  /dev/vdc  hdd   f4a5ce60-7eb5-4f77-b  20.0G  No         32s ago    Has a FileSystem, LVM detected                                           
ceph03  /dev/vdb  hdd   fa38bf8a-d8bc-4aca-b   100G  No         37s ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  
ceph03  /dev/vdc  hdd   492ab205-58af-436c-8  20.0G  No         37s ago    Has a FileSystem, Insufficient space (<10 extents) on vgs, LVM detected  

2 执行 ceph-volume inventory

ceph-volume inventory 是用来获取当前节点所有可用磁盘清单的命令 ,会显示当前节点异常的磁盘。

复制代码
[ceph: root@ceph03 /]# ceph-volume inventory 
-->  RuntimeError: /dev/mapper/ceph--405d56c7--ae1e--4caf--b6c4--a08e3d5baf5d-osd--block--9e1d9631--4d64--4dd7--9394--a725481513bc not found.
  1. 查看/var/log/ceph/ceph-fid/ceph-volume.log 日志
    ceph-volume.log 日志 里面记录 磁盘层面的日志。

    [2025-12-25 06:44:48,592][ceph_volume][ERROR ] exception caught by decorator
    Traceback (most recent call last):
    File "/usr/lib/python3.9/site-packages/ceph_volume/decorators.py", line 59, in newfunc
    return f(*a, **kw)
    File "/usr/lib/python3.9/site-packages/ceph_volume/main.py", line 166, in main
    terminal.dispatch(self.mapper, subcommand_args)
    File "/usr/lib/python3.9/site-packages/ceph_volume/terminal.py", line 194, in dispatch
    instance.main()
    File "/usr/lib/python3.9/site-packages/ceph_volume/inventory/main.py", line 58, in main
    self.format_report(Devices(filter_for_batch=self.args.filter_for_batch,
    File "/usr/lib/python3.9/site-packages/ceph_volume/util/device.py", line 43, in init
    sys_info.devices = disk.get_devices()
    File "/usr/lib/python3.9/site-packages/ceph_volume/util/disk.py", line 831, in get_devices
    block[1] = UdevData(block[1]).slashed_path
    File "/usr/lib/python3.9/site-packages/ceph_volume/util/disk.py", line 1372, in init
    raise RuntimeError(f'{path} not found.')
    RuntimeError: /dev/mapper/ceph--405d56c7--ae1e--4caf--b6c4--a08e3d5baf5d-osd--block--9e1d9631--4d64--4dd7--9394--a725481513bc not found.
    [2025-12-25 06:44:58,928][ceph_volume.main][INFO ] Running command: ceph-volume inventory
    [2025-12-25 06:44:58,934][ceph_volume.util.system][WARNING] Executable lvs not found on the host, will return lvs as-is
    [2025-12-25 06:44:58,934][ceph_volume.process][INFO ] Running command: lvs --noheadings --readonly --separator=";" -a --units=b --nosuffix -S -o lv_tags,lv_path,lv_name,vg_name,lv_uuid,lv_size
    [2025-12-25 06:44:58,969][ceph_volume.process][INFO ] stdout ceph.block_device=/dev/ceph-405d56c7-ae1e-4caf-b6c4-a08e3d5baf5d/osd-block-9e1d9631-4d64-4dd7-9394-a725481513bc,ceph.block_uuid=Do7Qhz-JMrD-XZZa-buzH-LSXl-Yhiw-lkTx0T,ceph.cephx_lockbox_secret=,ceph.cluster_fsid=c3dea8e6-cb42-11f0-a192-fa163e634a3a,ceph.cluster_name=ceph,ceph.crush_device_class=,ceph.encrypted=0,ceph.osd_fsid=9e1d9631-4d64-4dd7-9394-a725481513bc,ceph.osd_id=1,ceph.osdspec_affinity=None,ceph.type=block,ceph.vdo=0,ceph.with_tpm=0";"/dev/ceph-405d56c7-ae1e-4caf-b6c4-a08e3d5baf5d/osd-block-9e1d9631-4d64-4dd7-9394-a725481513bc";"osd-block-9e1d9631-4d64-4dd7-9394-a725481513bc";"ceph-405d56c7-ae1e-4caf-b6c4-a08e3d5baf5d";"Do7Qhz-JMrD-XZZa-buzH-LSXl-Yhiw-lkTx0T";"107369988096

相关推荐
invicinble1 小时前
对linux形成认识
linux·运维·服务器
小Pawn爷1 小时前
14.VMmare安装ubuntu
linux·运维·ubuntu
技术路上的探险家1 小时前
8 卡 V100 服务器:基于 vLLM 的 Qwen 大模型高效部署实战
运维·服务器·语言模型
有谁看见我的剑了?1 小时前
介绍一款 测试 DNS解析成功率的网站
运维
半桔1 小时前
【IO多路转接】高并发服务器实战:Reactor 框架与 Epoll 机制的封装与设计逻辑
linux·运维·服务器·c++·io
HABuo2 小时前
【linux文件系统】磁盘结构&文件系统详谈
linux·运维·服务器·c语言·c++·ubuntu·centos
Howrun7772 小时前
关于Linux服务器的协作问题
linux·运维·服务器
小白同学_C3 小时前
Lab3-page tables && MIT6.1810操作系统工程【持续更新】
linux·c/c++·操作系统os
十年磨一剑~4 小时前
Linux程序接收到sigpipe信号崩溃处理
linux
geshifei4 小时前
Sched ext回调3——select_cpu(linux 6.15.7)
linux·ebpf