[OpenIndiana-discuss] LSI 3GB HBA SAS Errors (and other misc)
wrwehler at gmail.com
Fri Dec 2 01:28:32 UTC 2011
During the diagnostics of my SAN failure last week we thought we had seen a backplane failure due to high error counts with 'lsiutil'. However, even with a new backplane and ruling out failed cards (MPXIO or singular) or bad cables I'm still seeing my error count with LSIUTIL increment. I've got no disks attached to the array right now so I've also ruled those out.
Even with nothing connected but the HBA to the backplane expander, a simple restart of the SAN into a OpenIndiana LiveCD or other distribution (NexentaStor) increments the counter.
I've been as careful as I can be to clear the counter between changes to parts to try and eliminate a potentially bad cable/card/etc. You can see phy 8-15 throw errors irregardless of MPXIO or single card config, OR which expander port I use on the backplane.
According to my VAR something in the mptsas code changed "recently" (not sure what that means in time terms) and they do not see the problems with 6GB backplanes and adapters.
Attached is a log I took through NexentaStor 3.1.1 with my disks still attached. The disks themselves don't seem to be throwing errors, so that's good.
Has anyone seen anything like this? I have not tried to boot into an older version of Solaris or NexentaStor yet, but booting into Scientific Linux 6.1 yields about the same results with lsiutil.
Nothing from fmadm, /var/adm/messages or otherwise indicate these data errors outside of lsiutil.
More information about the OpenIndiana-discuss