Re: [vserver] [resend] vs2.2.0.x and scheduler getting stuck

From: Daniel Hokka Zakrisson <daniel_at_hozac.com>
Date: Sat 17 May 2008 - 16:20:47 BST
Message-ID: <56269.192.168.102.6.1211037647.squirrel@intranet>

Grzegorz Nosek wrote:
> Hi all,
>
> (sorry if you receive this message twice)
>
> I've been experiencing some weird hangs (boom and it's dead, no panic,
> not even a softlockup or lockdep warning). It happens about once a month
> (of course, in production only), usually under some load (I suspected
> I/O but it might be simple CPU usage too). It happened on several very
> different machines (2-way pentium3, 4-way opteron 270).
>
> After enabling the nmi watchdog I could trace it back to schedule(), or
> at least the nmi watchdog felt the need to kill the machine right in the
> middle of a schedule() call.

So I guess it can happen on vanilla kernels as well... This was noticed
and fixed on PlanetLab, and the patches are already fed upstream (see
http://vserver.13thfloor.at/Experimental/delta-sched-fix0{4,5,6}.diff), to
be included in the next release.

-- 
Daniel Hokka Zakrisson
Received on Sat May 17 16:21:05 2008
[Next/Previous Months] [Main vserver Project Homepage] [Howto Subscribe/Unsubscribe] [Paul Sladen's vserver stuff]
Generated on Sat 17 May 2008 - 16:21:09 BST by hypermail 2.1.8