[OpenIndiana-discuss] Hanging zfs commands
Maurilio Longo
maurilio.longo at libero.it
Fri Aug 3 08:12:33 UTC 2012
Hi,
I'm using oi_151a4.
This system is in production right now, so I cannot recreate the problem.
What I was doing was running a script via cron to take a recursive snapshot
and then removing old ones when there were enough of them.
In my case it was hanging daily or every other day.
Maurilio.
Rich wrote:
> I'd suggest trying to get a trace of where in the kernel it's blocking
> so that the deadlock can be found and fixed.
>
> What version of OI is this?
>
> - Rich
>
> On Thu, Aug 2, 2012 at 5:19 AM, Maurilio Longo <maurilio.longo at libero.it> wrote:
>> Hans,
>>
>> I've seen the same problem when using zfs list with -d n, so that it just goes
>> down some depth (I was using 1 as well).
>>
>> I've solved it doing a zfs list without -d and sorting and grepping the result
>> for what I need.
>>
>> Slower, but it did not hang anymore.
>>
>> You need to reboot, though, I've not found any other way to kill the hanging
>> process.
>>
>> Maurilio.
>>
>> Hans Joergensen wrote:
>>> Hey,
>>>
>>> Somehow I've hit somekind of lock on one of my NAS-boxes....
>>>
>>> Output from ps;
>>> root 26707 1 0 09:10:16 ? 0:00 /usr/sbin/zfs destroy datastore1/vmware-nfs/zfsnas4-clientstore at snap-hourly-1-201
>>> root 26705 1 0 09:10:16 ? 0:00 /usr/sbin/zfs destroy datastore1/vmware-nfs/zfsnas4-clientstore at snap-hourly-1-201
>>> root 2583 1 0 11:40:52 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 12079 1 0 15:40:35 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 26706 1 0 09:10:16 ? 0:00 /usr/sbin/zfs destroy datastore1/vmware-nfs/zfsnas4-clientstore at snap-hourly-1-201
>>> root 22359 1 0 22:21:17 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 26708 1 0 09:10:17 ? 0:00 /usr/sbin/zfs destroy datastore1/vmware-nfs/zfsnas4-clientstore at snap-hourly-1-201
>>> root 22374 1 0 22:21:24 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 16677 1 0 18:06:38 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 22335 1 0 22:21:03 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 27981 1 0 09:40:57 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 22386 1 0 22:21:28 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 7165 1 0 13:40:48 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 29390 1 0 10:18:03 ? 0:00 /usr/sbin/zfs list -t snapshot -r datastore1/vmware-nfs
>>> root 3637 1 0 12:06:27 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 15999 1 0 17:40:43 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 20089 1 0 20:40:52 ? 0:00 /usr/sbin/zfs list -t snapshot -H -o name -d 1 -r datastore1/vmware-nfs/zfsnas4
>>> root 17500 1 0 18:42:03 ? 0:00 /usr/sbin/zfs list -t snapshot -r datastore1/vmware-nfs
>>>
>>>
>>> Any chance I can get around this without rebooting the machine? it's
>>> a production system with lots of VM's on it.. So that would be very
>>> annoying..
>>>
>>> I've tried solving the problem by killing the processes that spawned
>>> the zfs list and destoy commands, thats why they have 1 as parant
>>> process...
>>>
>>> Could the lock have happened because of PID 26707 and 26705 running
>>> at the same time?
>>>
>>> // Hans
>>>
>>> _______________________________________________
>>> OpenIndiana-discuss mailing list
>>> OpenIndiana-discuss at openindiana.org
>>> http://openindiana.org/mailman/listinfo/openindiana-discuss
>>>
>> --
>> __________
>> | | | |__| Maurilio Longo
>> |_|_|_|____| farmaconsult s.r.l.
>>
>>
>>
>> _______________________________________________
>> OpenIndiana-discuss mailing list
>> OpenIndiana-discuss at openindiana.org
>> http://openindiana.org/mailman/listinfo/openindiana-discuss
>
> _______________________________________________
> OpenIndiana-discuss mailing list
> OpenIndiana-discuss at openindiana.org
> http://openindiana.org/mailman/listinfo/openindiana-discuss
>
--
__________
| | | |__| Maurilio Longo
|_|_|_|____| farmaconsult s.r.l.
More information about the OpenIndiana-discuss
mailing list