[OpenIndiana-discuss] Hardware fault, drive or controller?

Matt Connolly matt.connolly.au at gmail.com
Fri Jul 15 21:18:31 UTC 2011


Hi all,

following my crash and boot failure during the week, I left my oi
machine off for a day or so to think about what it had done wrong.
After turning it back on and booting off the live oi-148 cd, I was
able to import my 2 pools, and did a scrub of the boot pool. There
were a few checksum errors, but everything recovered correctly.

I then rebooted back into the installation on the HDs and it was
looking good. I let it run doing a scrub of my data pool after
reimporting it, which was running fine for many hours.

However, it locked up again. The system was still running, (I could
use firefox and make network connections) but anything that interacted
with the data pool hung. `zpool status` hung, for example, and could
not be terminated with Ctrl+C.

I shut the machine down and saw lots of errors in /var/adm/messages like:

Jul 14 07:10:01 vault genunix: [ID 859416 kern.info]
ghd_timer_newstate: HBA reset failed hba 0x
ffffff01ca571e40 gcmdp 0xffffff01db863a00 gtgtp 0xffffff01cc10f6c0
Jul 14 07:10:01 vault scsi: [ID 107833 kern.warning] WARNING:
/pci at 0,0/pci8086,2448 at 1e/pci-ide at 0
/ide at 1 (ata1):
Jul 14 07:10:01 vault   timeout: HBA reset, target=1 lun=0
Jul 14 07:11:01 vault genunix: [ID 859416 kern.info]
ghd_timer_newstate: HBA reset failed hba 0x
ffffff01ca571e40 gcmdp 0xffffff01db863a00 gtgtp 0xffffff01cc10f6c0
Jul 14 07:11:01 vault scsi: [ID 107833 kern.warning] WARNING:
/pci at 0,0/pci8086,2448 at 1e/pci-ide at 0
/ide at 1 (ata1):
Jul 14 07:11:01 vault   timeout: HBA reset, target=1 lun=0


I've since completed a scrub on both pools with no errors, so I'm
fairly sure that it's not a dead hard drive.

Does this sound like a motherboard/controller failure?? Any other thoughts?

Thanks,
Matt



More information about the OpenIndiana-discuss mailing list