[OpenIndiana-discuss] Mapping devices in OI to physical devices
real-men-dont-click at gmx.net
real-men-dont-click at gmx.net
Tue Jan 22 18:04:28 UTC 2013
-----Original message-----
From: Len Zaifman <leonardz at sickkids.ca>
Sent: Tue 22-01-2013 18:29
Subject: [OpenIndiana-discuss] Mapping devices in OI to physical devices
To: openindiana-discuss at openindiana.org;
> We have just had a major system meltdown and it took several days to fix.
>
> What we would have liked is 2 things we had on thumpers (Old SUN ZFS systems)
>
> 1) A tool to show the mapping of a solaris device name to a physical location
> 2) A tool to turn on the light on a disk via its solaris device name.
>
> The process below is too painful, and we have other devices whose disks may go
> bad. Does either 1 or 2 above exist in openindiana? I could not find it, if it
> does.
>
> Thanks.
>
> The issue was:
>
> OI (OpenIndiana Development oi_151a X86) reported:
>
>
> Jan 22 10:57:43 archivea scsi: [ID 107833 kern.warning] WARNING:
> /pci at 7a,0/pci8086,3408 at 1/pci1000,3040 at 0 (mpt_sas10):
> Jan 22 10:57:43 archivea Disconnected command timeout for Target 18
> Jan 22 10:57:43 archivea scsi: [ID 365881 kern.info]
> /pci at 7a,0/pci8086,3408 at 1/pci1000,3040 at 0 (mpt_sas10):
> Jan 22 10:57:43 archivea Log info 0x31140000 received for target 18.
> Jan 22 10:57:43 archivea scsi_status=0x0, ioc_status=0x8048,
> scsi_state=0xc
>
> zfs performance went through the floor and was intolerable(< 1 mb/sec where we
> had hundreds of MB/sec for resilver/scrubs and 100 MB/sec through the
> filesystem).
>
> The defective disk was one of 45 disks in a Supermicro Jbod system
> (SC847E26-RJBOD1)
>
> We finally found which disk it was by comparing serial numbers reported by
> iostat, disks that reported errors and the actual disk serial number (we pulled
> all 45 disks out to do this mapping). we do not want to repeat this process for
> our other devices.
>
>
>
Hi Len,
we had similar problems in the past, so I did some (google) research for already available tools addressing the issue. What I came across was the following:
dskinfo (quite powerfull and maybe a good bet for you): http://sparcv9.blogspot.de/2011/06/solaris-dskinfo-utility.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed:+UnixSystem+(It's+a+UNIX+system!)
vhci_stat (interesting if you are using mpxio): http://southbrain.com/south/software/vhci-stat.html
What we always use to identify a disk (flash the led) is something like:
dd if=/dev/dsk/$DISK of=/dev/null
in S11express fmdump shows the affected enclosure slot with SuperMicro JBODs as well (didn't have a broken disk in an OI installation so far, so I don't know if OI works the same way):
Oct 15 16:16:24.1188 0bf2d7d3-4fcf-ee05-d3ed-dc8ffc0ec433 FMD-8000-4M Repaired
100% fault.io.disk.predictive-failure Repair Attempted
Problem in: hc://:product-id=LSI-CORP-SAS2X36:server-id=:chassis-id=5003048000992b7f:serial=3SL0RFGF00009103VC79:part=SEAGATE-ST3600057SS:revision=0006/ses-enclosure=0/bay=3/disk=0
Affects: dev:///:devid=id1,sd@n5000c50023f6af03//scsi_vhci/disk@g5000c50023f6af03
FRU: hc://:product-id=LSI-CORP-SAS2X36:server-id=:chassis-id=5003048000992b7f:serial=3SL0RFGF00009103VC79:part=SEAGATE-ST3600057SS:revision=0006/ses-enclosure=0/bay=3/disk=0
Location: Slot 04
What I'm missing , too is a completely integrated tool for that....
cu
Carsten
More information about the OpenIndiana-discuss
mailing list