[OpenIndiana-discuss] disconnected drives, how to avoid in the future?

Maurilio Longo maurilio.longo at libero.it
Tue Jan 10 15:13:04 UTC 2012


Geoff,

I've hit this problem several times in the past, with OpenSolaris and then
with OpenIndiana.

There are, to my knowledge, no available solutions, it is so by design!

If a disk stops responding the pool waits until after it responds again
(sometimes pulling it out of its slot and then reinserting the disk causes a
reset of the link and it starts working again).

I was not able to assess what happens if I set failmode to continue.

I think it could be no better since you still cannot write to the pool.

This is IMHO the biggest problem of ZFS, in that I cannot instruct it to stop
using a failed device if it has some level of redundancy still available.

Wait is OK only if an entire vdev stops responding, not if a disk in a vdev
with redundancy has problems either fatal or transitory.

Best regards.

Maurilio.


PS. Using server grade disks (those with TLER) makes it possibile to overcome
this problem for transitory errors.


Geoff Nordli wrote:

> Part of my concern is why one disk would have completely brought down
> the system.  I have seen this come up on the list before, but I don't
> remember any resolutions to fixing it.
> 
> Anyone have any clues to try to prevent this from happening in the future?
> 
> thanks,
> 
> Geoff
> 

-- 
 __________
|  |  | |__| Maurilio Longo
|_|_|_|____| farmaconsult s.r.l.





More information about the OpenIndiana-discuss mailing list