[OpenIndiana-discuss] Diagnosis help needed

michelle michelle at msknight.com
Mon Jun 25 06:30:55 UTC 2012


Came back to it this morning, there had been errors. However, now the 
system has crashed.

It was responding to console, but when I asked for a zpool status, the 
terminal froze.

I had another terminal that was already logged on, and gave an init 6, 
but the server didn't respond. It's just sitting there.

---

The end result was that I had to hit the power again.

Reboot and log on gave zpool result of...

mich at jaguar:~# zpool status
   pool: data
  state: ONLINE
   scan: scrub in progress since Sun Jun 24 18:00:28 2012
     1.38T scanned out of 5.03T at 177K/s, (scan is slow, no estimated time)
     3.22M repaired, 27.53% done
config:

         NAME        STATE     READ WRITE CKSUM
         data        ONLINE       0     0     0
           raidz1-0  ONLINE       0     0     0
             c2t2d0  ONLINE       0     0     0
             c2t3d0  ONLINE       0     0     0  (repairing)
             c2t4d0  ONLINE       0     0     0

errors: No known data errors

   pool: rpool
  state: ONLINE
   scan: scrub repaired 0 in 0h1m with 0 errors on Sat Jun  9 00:01:26 2012
config:

         NAME          STATE     READ WRITE CKSUM
         rpool         ONLINE       0     0     0
           mirror-0    ONLINE       0     0     0
             c2t0d0s0  ONLINE       0     0     0
             c2t1d0s0  ONLINE       0     0     0

errors: No known data errors

.. and this is the messages at about the time of the crash...

Jun 24 21:20:55 jaguar ahci: [ID 657156 kern.warning] WARNING: ahci0: 
error recovery for port 3 succeed
Jun 24 21:20:58 jaguar ahci: [ID 296163 kern.warning] WARNING: ahci0: 
ahci port 3 has task file error
Jun 24 21:20:58 jaguar ahci: [ID 687168 kern.warning] WARNING: ahci0: 
ahci port 3 is trying to do error recovery
Jun 24 21:20:58 jaguar ahci: [ID 693748 kern.warning] WARNING: ahci0: 
ahci port 3 task_file_status = 0x4041
Jun 24 21:21:00 jaguar ahci: [ID 657156 kern.warning] WARNING: ahci0: 
error recovery for port 3 succeed
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info] 
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar  SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info] 
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar  SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info] 
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar  SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info] 
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar  SATA port 3 error
Jun 24 21:21:42 jaguar fmd: [ID 377184 daemon.error] SUNW-MSG-ID: 
ZFS-8000-FD, TYPE: Fault, VER: 1, SEVERITY: Major
Jun 24 21:21:42 jaguar EVENT-TIME: Sun Jun 24 21:21:42 BST 2012
Jun 24 21:21:42 jaguar PLATFORM: H55M-UD2H, CSN: -, HOSTNAME: jaguar
Jun 24 21:21:42 jaguar SOURCE: zfs-diagnosis, REV: 1.0
Jun 24 21:21:42 jaguar EVENT-ID: fbae1f99-ef10-ca46-c308-958e66bd0ddb
Jun 24 21:21:42 jaguar DESC: The number of I/O errors associated with a 
ZFS device exceeded
Jun 24 21:21:42 jaguar       acceptable levels.  Refer to 
http://illumos.org/msg/ZFS-8000-FD for more information.
Jun 24 21:21:42 jaguar AUTO-RESPONSE: The device has been offlined and 
marked as faulted.  An attempt
Jun 24 21:21:42 jaguar       will be made to activate a hot spare if 
available.
Jun 24 21:21:42 jaguar IMPACT: Fault tolerance of the pool may be 
compromised.
Jun 24 21:21:42 jaguar REC-ACTION: Run 'zpool status -x' and replace the 
bad device.
Jun 24 21:21:55 jaguar ahci: [ID 517647 kern.warning] WARNING: ahci0: 
watchdog port 3 satapkt 0xffffff020f879888 timed out
Jun 25 07:24:04 jaguar genunix: [ID 108120 kern.notice] ^MOpenIndiana 
Build oi_151a4 64-bit (illumos 13676:98ca40df9171)
Jun 25 07:24:04 jaguar genunix: [ID 107366 kern.notice] SunOS Release 
5.11 - Copyright 1983-2010 Oracle and/or its affiliates.
Jun 25 07:24:04 jaguar genunix: [ID 864463 kern.notice] All rights 
reserved. Use is subject to license terms.
Jun 25 07:24:04 jaguar unix: [ID 223955 kern.info] x86_feature: lgpg

On the face of this, a drive failure has taken the system down.

I'm going to have to check the connections,




More information about the OpenIndiana-discuss mailing list