I am hoping someone is able to help me, even if it is basic troubleshooting steps since I am starting to get baffled.
I have a set of about five T5-2 servers. Of those, 3 currently have the fault light illuminated on the front of the server. When I go to any of them though, I cannot see what is actually turning on the light (i.e. nothing appears to be showing a "fault still detected"). Taking just one of the 3 as an example (which all three show the same behavior outlined):
- Starting with the iLOM interface, the "Open Problems" shows nothing:
-> show /System/Open_Problems
Open Problems (0)
Date/Time Subsystems Component
------------------------ ------------------ ------------
- Looking at faultmgt from the iLOM side shows:
-> start /sp/faultmgmt/shell
Are you sure you want to start /SP/faultmgmt/shell (y/n)? y
faultmgmtsp> fmadm faulty
No faults found
- And finally, from faultmgt from inside Solaris 11 shows (everything is "solved"):
# fmadm faulty | grep -i status
Problem Status : solved
Status : faulted but still in service
Problem Status : solved
Status : faulted but still in service
Problem Status : solved
Status : faulted but still in service
So is there somewhere else I can look for what fault still may be present? I have also tried to simply "clear the fault" by several different methods going up to "set /SYS/ clear_fault_action=true", (of which, no /SYS/XXX_FAULT shows a fault either).
So I am really scratching my head. Updating firmware/OS is mostly out of the question given the locations of most of these servers, but the versions of each are:
-> version
SP firmware 3.2.1.9.b
SP firmware build number: 90525
SP firmware date: Fri Jul 11 14:18:00 PDT 2014
SP filesystem version: 0.2.7
and
# pkg list entire
NAME (PUBLISHER) VERSION IFO
entire (solaris) 0.5.11-0.175.2.7.0.4.0 i--
Is there a way to reset the light? To have it re-poll status and report the actual error? Any help someone can provide would be appreciated.
Thanks.