[OpenIndiana-discuss] Diagnosis help needed
michelle
michelle at msknight.com
Mon Jun 25 06:30:55 UTC 2012
Came back to it this morning, there had been errors. However, now the
system has crashed.
It was responding to console, but when I asked for a zpool status, the
terminal froze.
I had another terminal that was already logged on, and gave an init 6,
but the server didn't respond. It's just sitting there.
---
The end result was that I had to hit the power again.
Reboot and log on gave zpool result of...
mich at jaguar:~# zpool status
pool: data
state: ONLINE
scan: scrub in progress since Sun Jun 24 18:00:28 2012
1.38T scanned out of 5.03T at 177K/s, (scan is slow, no estimated time)
3.22M repaired, 27.53% done
config:
NAME STATE READ WRITE CKSUM
data ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
c2t2d0 ONLINE 0 0 0
c2t3d0 ONLINE 0 0 0 (repairing)
c2t4d0 ONLINE 0 0 0
errors: No known data errors
pool: rpool
state: ONLINE
scan: scrub repaired 0 in 0h1m with 0 errors on Sat Jun 9 00:01:26 2012
config:
NAME STATE READ WRITE CKSUM
rpool ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
c2t0d0s0 ONLINE 0 0 0
c2t1d0s0 ONLINE 0 0 0
errors: No known data errors
.. and this is the messages at about the time of the crash...
Jun 24 21:20:55 jaguar ahci: [ID 657156 kern.warning] WARNING: ahci0:
error recovery for port 3 succeed
Jun 24 21:20:58 jaguar ahci: [ID 296163 kern.warning] WARNING: ahci0:
ahci port 3 has task file error
Jun 24 21:20:58 jaguar ahci: [ID 687168 kern.warning] WARNING: ahci0:
ahci port 3 is trying to do error recovery
Jun 24 21:20:58 jaguar ahci: [ID 693748 kern.warning] WARNING: ahci0:
ahci port 3 task_file_status = 0x4041
Jun 24 21:21:00 jaguar ahci: [ID 657156 kern.warning] WARNING: ahci0:
error recovery for port 3 succeed
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info]
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info]
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info]
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar SATA port 3 error
Jun 24 21:21:27 jaguar sata: [ID 801845 kern.info]
/pci at 0,0/pci1458,b005 at 1f,2:
Jun 24 21:21:27 jaguar SATA port 3 error
Jun 24 21:21:42 jaguar fmd: [ID 377184 daemon.error] SUNW-MSG-ID:
ZFS-8000-FD, TYPE: Fault, VER: 1, SEVERITY: Major
Jun 24 21:21:42 jaguar EVENT-TIME: Sun Jun 24 21:21:42 BST 2012
Jun 24 21:21:42 jaguar PLATFORM: H55M-UD2H, CSN: -, HOSTNAME: jaguar
Jun 24 21:21:42 jaguar SOURCE: zfs-diagnosis, REV: 1.0
Jun 24 21:21:42 jaguar EVENT-ID: fbae1f99-ef10-ca46-c308-958e66bd0ddb
Jun 24 21:21:42 jaguar DESC: The number of I/O errors associated with a
ZFS device exceeded
Jun 24 21:21:42 jaguar acceptable levels. Refer to
http://illumos.org/msg/ZFS-8000-FD for more information.
Jun 24 21:21:42 jaguar AUTO-RESPONSE: The device has been offlined and
marked as faulted. An attempt
Jun 24 21:21:42 jaguar will be made to activate a hot spare if
available.
Jun 24 21:21:42 jaguar IMPACT: Fault tolerance of the pool may be
compromised.
Jun 24 21:21:42 jaguar REC-ACTION: Run 'zpool status -x' and replace the
bad device.
Jun 24 21:21:55 jaguar ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 3 satapkt 0xffffff020f879888 timed out
Jun 25 07:24:04 jaguar genunix: [ID 108120 kern.notice] ^MOpenIndiana
Build oi_151a4 64-bit (illumos 13676:98ca40df9171)
Jun 25 07:24:04 jaguar genunix: [ID 107366 kern.notice] SunOS Release
5.11 - Copyright 1983-2010 Oracle and/or its affiliates.
Jun 25 07:24:04 jaguar genunix: [ID 864463 kern.notice] All rights
reserved. Use is subject to license terms.
Jun 25 07:24:04 jaguar unix: [ID 223955 kern.info] x86_feature: lgpg
On the face of this, a drive failure has taken the system down.
I'm going to have to check the connections,
More information about the OpenIndiana-discuss
mailing list