ORACLE ODA一体机存储节点电源故障的分析处理

近期,某用户的ORACLE ODA一体机在例行机房巡检时出现亮黄灯告警;用户反馈次问题后我们立刻通过远程方式,登陆ODA的控制台进行查看;

对于ODA一体机(2个计算节点+1个存储节点),计算节点可以通过ilom管理界面登陆进行详细的硬件信息查看和管理,当然通过命令行也可以。

对于存储节点,是没有图形界面可以看,可以通过ODA管理台(7093/mgmt/index.html)或者命令查看;

本次问题查看为存储节点的1个电源故障,由于双电源配置,系统仍然可以正常工作;并且电源的更好工作是可以在线进行的。

如下为排查分析过程:

1、故障灯及系统中查看故障原因

root@TEST2 \~\]# odaadmcli show enclosure NAME SUBSYSTEM STATUS METRIC _FAN0 Cooling OK 4910 rpm _FAN1 Cooling OK 4540 rpm _FAN2 Cooling OK 4920 rpm _FAN3 Cooling OK 4530 rpm _IOM0 Encl_Electronics OK - _IOM1 Encl_Electronics OK - _PSU0 Power_Supply Critical - ===\>\>\>显示故障 _PSU1 Power_Supply OK - _TEMP0 Amb_Temp OK 23 C _TEMP1 Midplane_Temp OK 22 C _TEMP2 PCM0_Inlet_Temp OK 30 C _TEMP3 PCM0_Hotspot_Temp OK 24 C _TEMP4 PCM1_Inlet_Temp OK 42 C _TEMP5 PCM1_Hotspot_Temp OK 39 C _TEMP6 IOM0_Temp OK 22 C _TEMP7 IOM1_Temp OK 22 C 4 、更换电源(可以先尝试插拔电源线,电源线松动是可能的,插拔后也可能就恢复了) 更换的步骤MOS文档(How to confirm power supply status about storage shelf on ODA X7-2 (Doc ID 2419846.1),How To Replace an ODA (Oracle Database Appliance) X6-2HA, X7-2HA, X8-2HA, X9-2HA DE3-24C Power Supply/Cooling Unit \[VCAP\] (Doc ID 2960220.1))有视频和步骤,没有特殊的难度,参考如下: WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE?: 1. Locate the PSU by amber LED The following LEDs are lit when a power supply fault is detected: \* Front and rear Service Required LEDs \* Rear PS Failure LED on the bezel of the server \* Failure LED on the faulty power supply 2. Verify the PSU part number in the System Handbook and re-confirm. 3. Removing the PSU as follows. 3.1 Clear access to the PSU of any cables harnesses or assemblies. 3.2 Ensure the PSU On/Off switch is in the 'Off' Position. 3.3 Disconnect the power cord tie strap from the power cord, and unplug the power cord from the PSU. 3.4 Remove installed PSU by, Grasping the PSU handle, push the release button and slide out PSU. 4. Installing the Power Supply as follows or use the "online" Help Guide. 4.1 On the replacement PSU verify that the Release button is open . 4.2 Align PSU with empty bay in chassis and slide in . 4.3 Push the lever fully closed until you hear or feel a click. 4.4 Connect AC power cord to new PSU. Use the power cord retaining clips. 4.4 If required , place cable harness or assemblies back into normal position. 4.5 Turn the On/OFF switch to the On position . 5. Verify the replacement by checking for Green LED IMPORTANT NOTE : PSUs have a 3 minute Service time limit . When you remove a PSU the fans on the remaining PSU go to 100 % duty cycle . Testing has shown that HDD temperatures can exceed their operating temperature when a PSU has been removed for 3 minutes. 5、检查最终状态(注意次命令的输出,ODA 的2个计算节点的的输出是不一致的,简单说是检测到恢复正常是有时间差的,如节点1显示OK,节点2可能过几分钟才显示OK) \[root@TEST2 \~\]# odaadmcli show enclosure NAME SUBSYSTEM STATUS METRIC _FAN0 Cooling OK 4910 rpm _FAN1 Cooling OK 4540 rpm _FAN2 Cooling OK 4910 rpm _FAN3 Cooling OK 4540 rpm _IOM0 Encl_Electronics OK - _IOM1 Encl_Electronics OK - _PSU0 Power_Supply OK - _PSU1 Power_Supply OK - _TEMP0 Amb_Temp OK 23 C _TEMP1 Midplane_Temp OK 22 C _TEMP2 PCM0_Inlet_Temp OK 29 C _TEMP3 PCM0_Hotspot_Temp OK 24 C _TEMP4 PCM1_Inlet_Temp OK 41 C _TEMP5 PCM1_Hotspot_Temp OK 39 C _TEMP6 IOM0_Temp OK 22 C _TEMP7 IOM1_Temp OK 28 C

相关推荐
亿坊电商2 小时前
PHP后端项目中多环境配置管理:开发、测试、生产的优雅解决方案!
服务器·数据库·php
韩立学长2 小时前
基于Springboot的影视评论网站的设计与实现58py6238(程序、源码、数据库、调试部署方案及开发环境)系统界面展示及获取方式置于文档末尾,可供参考。
数据库·spring boot·后端
未来之窗软件服务3 小时前
未来之窗昭和仙君(四十七)开发商品进销存——东方仙盟筑基期
数据库·进销存·仙盟创梦ide·东方仙盟·昭和仙君·东方仙盟架构
IDOlaoluo4 小时前
TinyRDM 1.2.3 Windows版安装教程(附Redis客户端下载及详细步骤)
数据库·redis·缓存
小光学长4 小时前
基于微信小程序的背单词系统x1o5sz72(程序+源码+数据库+调试部署+开发环境)带论文文档1万字以上,文末可获取,系统界面在最后面。
数据库·微信小程序·小程序
我命由我123456 小时前
Derby - Derby 服务器(Derby 概述、Derby 服务器下载与启动、Derby 连接数据库与创建数据表、Derby 数据库操作)
java·运维·服务器·数据库·后端·java-ee·后端框架
RestCloud7 小时前
达梦数据库到Greenplum:用ETL工具实现数据仓库迁移
数据库·数据仓库·etl·达梦数据库·数据传输·greenplum
Boilermaker19928 小时前
【Redis】集群与分布式缓存
java·数据库·redis·1024程序员节
武子康8 小时前
Java-163 MongoDB 生产安全加固实战:10 分钟完成认证、最小权限、角色详解
java·数据库·分布式·mongodb·性能优化·系统架构·nosql
zhangyifang_0098 小时前
PostgreSQL 的表继承与分区
数据库·postgresql