Re: [vserver] Kernel 3.18.7 hangs when inserting netconsle module on DELL M620 VRTX Blade

From: Corey Wright <undefined_at_pobox.com>
Date: Thu 09 Apr 2015 - 17:35:24 BST
Message-Id: <20150409113524.fa4dd8e1bc2bbc454b146b1c@pobox.com>

On Thu, 09 Apr 2015 14:46:30 +0200
Urban Loesch <bind@enas.net> wrote:

> Hi,
>
> for the records:
>
> Compiling the kernel without CONFIG_NET_RX_BUSY_POLL solves the issue
> and it's not vserver related.

based on your lkml thread [1] i wouldn't say "solves", but "works
around". ;)

[1] https://www.mail-archive.com/linux-kernel@vger.kernel.org/msg863876.html

sorry we couldn't solve your problem, but glad we could help by
process of elimination (ie not linux-vserver).

thanks for following up!

corey

--
undefined@pobox.com
> Thanks
> Urban
> 
> 
> Am 08.04.2015 um 09:33 schrieb Urban Loesch:
> > Hi,
> > 
> > sorry for the delay, but compiling 2 kernel takes a while :-)
> > 
> >>
> >> https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/?h=linux-3.18.y&id=e3b175f60e9c79f559d14f76590a701b62c583c4
> >>
> >> that commit is highly suspect (as solving your problem) because it was
> >> committed to 3.18.8, while you are running 3.18.7, and it involves polling
> >> in the bnx2x driver, while bnx2x_poll is in your stack trace.  that and
> >> netconsole works fine for me here on 3.18.11-vs2.3.7.4 (in my linux-vserver
> >> test virtualbox vm; i don't use netconsole).
> >>
> >> if you are running 3.18.7 because that's the latest 3.18 version a
> >> linux-vserver patch was specifically released for, then i can assure you,
> >> though you might have some patch hunks apply with offsets (and maybe even a
> >> little fuzzing; can't remember), it works just fine applied to later 3.18
> >> releases (for others and myself; if not, then i usually email a fix-up patch
> >> to the mailing list).
> >>
> >> for future reference, please try to replicate the problem on a
> >> non-vserver-patched kernel (or if you did that already, then please state
> >> it), to help isolate it to linux-vserver.
> > 
> > I tried now two new compiled kernels.
> > * 3.18.7 without vserver patch:
> > SAME PROBLEM!
> > 
> > Here comes the trace:
> > 
> > Apr  8 09:11:57 server2 kernel: [   42.462568] ------------[ cut here ]------------
> > Apr  8 09:11:57 server2 kernel: [   42.462573] WARNING: CPU: 9 PID: 2955 at kernel/softirq.c:146 __local_bh_enable_ip+0x72/0xa0()
> > Apr  8 09:11:57 server2 kernel: [   42.462583] Modules linked in: netconsole(+) configfs sb_edac edac_core shpchp dcdbas iTCO_wdt ipmi_si
> > iTCO_vendor_support pcspkr ipmi_msghandler tpm_tis lpc_ich tpm joydev acpi_pad evbug hed hid_generic usbmouse usbkbd usbhid hid ahci libahci
> > megaraid_sas bnx2x ptp pps_core mdio
> > Apr  8 09:11:57 server2 kernel: [   42.462585] CPU: 9 PID: 2955 Comm: modprobe Not tainted 3.18.7-em64t-efigpt #1
> > Apr  8 09:11:57 server2 kernel: [   42.462585] Hardware name: Dell Inc. PowerEdge M620/0NJVT7, BIOS 2.4.3 07/02/2014
> > Apr  8 09:11:57 server2 kernel: [   42.462587]  0000000000000009 ffff881fcc42f9e8 ffffffff8172194c 0000000019c919c9
> > Apr  8 09:11:57 server2 kernel: [   42.462588]  0000000000000000 ffff881fcc42fa28 ffffffff81051b2c ffffffff81f45e40
> > Apr  8 09:11:57 server2 kernel: [   42.462589]  0000000000000200 ffff881fcf808dd4 ffff881fcf808d58 0000000000000000
> > Apr  8 09:11:57 server2 kernel: [   42.462590] Call Trace:
> > Apr  8 09:11:57 server2 kernel: [   42.462596]  [<ffffffff8172194c>] dump_stack+0x46/0x58
> > Apr  8 09:11:57 server2 kernel: [   42.462598]  [<ffffffff81051b2c>] warn_slowpath_common+0x8c/0xc0
> > Apr  8 09:11:57 server2 kernel: [   42.462599]  [<ffffffff81051b7a>] warn_slowpath_null+0x1a/0x20
> > Apr  8 09:11:57 server2 kernel: [   42.462601]  [<ffffffff810557a2>] __local_bh_enable_ip+0x72/0xa0
> > Apr  8 09:11:57 server2 kernel: [   42.462604]  [<ffffffff817279bb>] _raw_spin_unlock_bh+0x1b/0x20
> > Apr  8 09:11:57 server2 kernel: [   42.462616]  [<ffffffffa0147f33>] bnx2x_poll+0x83/0x3c0 [bnx2x]
> > Apr  8 09:11:57 server2 kernel: [   42.462620]  [<ffffffff8164bb10>] netpoll_poll_dev+0x110/0x1b0
> > Apr  8 09:11:57 server2 kernel: [   42.462622]  [<ffffffff8164bd17>] netpoll_send_skb_on_dev+0x167/0x240
> > Apr  8 09:11:57 server2 kernel: [   42.462624]  [<ffffffff8164c0c2>] netpoll_send_udp+0x2d2/0x400
> > Apr  8 09:11:57 server2 kernel: [   42.462626]  [<ffffffffa008a85f>] write_msg+0xcf/0x110 [netconsole]
> > Apr  8 09:11:57 server2 kernel: [   42.462630]  [<ffffffff8109c6ab>] call_console_drivers.constprop.26+0x9b/0x100
> > Apr  8 09:11:57 server2 kernel: [   42.462632]  [<ffffffff8109d70a>] console_unlock+0x3ba/0x440
> > Apr  8 09:11:57 server2 kernel: [   42.462634]  [<ffffffff8109eaba>] register_console+0x29a/0x360
> > Apr  8 09:11:57 server2 kernel: [   42.462635]  [<ffffffffa0099000>] ? 0xffffffffa0099000
> > Apr  8 09:11:57 server2 kernel: [   42.462637]  [<ffffffffa00991c5>] init_netconsole+0x1c5/0x1000 [netconsole]
> > Apr  8 09:11:57 server2 kernel: [   42.462640]  [<ffffffff810002dc>] do_one_initcall+0x8c/0x1c0
> > Apr  8 09:11:57 server2 kernel: [   42.462643]  [<ffffffff81170202>] ? __vunmap+0xc2/0x110
> > Apr  8 09:11:57 server2 kernel: [   42.462645]  [<ffffffff810c798d>] load_module+0x1dbd/0x25b0
> > Apr  8 09:11:57 server2 kernel: [   42.462647]  [<ffffffff810c4170>] ? show_initstate+0x60/0x60
> > Apr  8 09:11:57 server2 kernel: [   42.462649]  [<ffffffff81729a9f>] ? page_fault+0x1f/0x30
> > Apr  8 09:11:57 server2 kernel: [   42.462651]  [<ffffffff810c821a>] SyS_init_module+0x9a/0xc0
> > Apr  8 09:11:57 server2 kernel: [   42.462652]  [<ffffffff81728192>] system_call_fastpath+0x12/0x17
> > Apr  8 09:11:57 server2 kernel: [   42.462653] ---[ end trace 4abd740e2bd6628a ]---
> > 
> > So I think this is kernel related, not linux-vserver.
> > 
> > * 3.18.11 with vserver patch vs2.3.7.4:
> > I got some Hunks during patching and I got the SAME error.
> > 
> > Apr  8 09:22:27 server2 kernel: [  216.788670] ------------[ cut here ]------------
> > Apr  8 09:22:27 server2 kernel: [  216.788676] WARNING: CPU: 11 PID: 2929 at kernel/softirq.c:147 __local_bh_enable_ip+0x72/0xa0()
> > Apr  8 09:22:27 server2 kernel: [  216.788687] CPU: 11 PID: 2929 Comm: modprobe Not tainted 3.18.11-vs2.3.7.4-rol-em64t-efigpt #1
> > Apr  8 09:22:27 server2 kernel: [  216.788688] Hardware name: Dell Inc. PowerEdge M620/0NJVT7, BIOS 2.4.3 07/02/2014
> > Apr  8 09:22:27 server2 kernel: [  216.788690]  0000000000000009 ffff881fcfaa39e8 ffffffff8174434a 0000000019af19af
> > Apr  8 09:22:27 server2 kernel: [  216.788690]  0000000000000000 ffff881fcfaa3a28 ffffffff81051fac ffffffff81f4a080
> > Apr  8 09:22:27 server2 kernel: [  216.788691]  0000000000000200 ffff881fcf624dd4 ffff881fcf624d58 0000000000000000
> > Apr  8 09:22:27 server2 kernel: [  216.788692] Call Trace:
> > Apr  8 09:22:27 server2 kernel: [  216.788696]  [<ffffffff8174434a>] dump_stack+0x46/0x58
> > Apr  8 09:22:27 server2 kernel: [  216.788698]  [<ffffffff81051fac>] warn_slowpath_common+0x8c/0xc0
> > Apr  8 09:22:27 server2 kernel: [  216.788699]  [<ffffffff81051ffa>] warn_slowpath_null+0x1a/0x20
> > Apr  8 09:22:27 server2 kernel: [  216.788701]  [<ffffffff81055fc2>] __local_bh_enable_ip+0x72/0xa0
> > Apr  8 09:22:27 server2 kernel: [  216.788704]  [<ffffffff8174a3cb>] _raw_spin_unlock_bh+0x1b/0x20
> > Apr  8 09:22:27 server2 kernel: [  216.788716]  [<ffffffffa00b8f43>] bnx2x_poll+0x83/0x3e0 [bnx2x]
> > Apr  8 09:22:27 server2 kernel: [  216.788720]  [<ffffffff81667de0>] netpoll_poll_dev+0x110/0x1b0
> > Apr  8 09:22:27 server2 kernel: [  216.788721]  [<ffffffff81667fe7>] netpoll_send_skb_on_dev+0x167/0x240
> > Apr  8 09:22:27 server2 kernel: [  216.788722]  [<ffffffff81668392>] netpoll_send_udp+0x2d2/0x400
> > Apr  8 09:22:27 server2 kernel: [  216.788724]  [<ffffffffa018685f>] write_msg+0xcf/0x110 [netconsole]
> > Apr  8 09:22:27 server2 kernel: [  216.788728]  [<ffffffff8109e32b>] call_console_drivers.constprop.27+0x9b/0x100
> > Apr  8 09:22:27 server2 kernel: [  216.788730]  [<ffffffff8109f39a>] console_unlock+0x3ca/0x450
> > Apr  8 09:22:27 server2 kernel: [  216.788731]  [<ffffffff810a073a>] register_console+0x29a/0x360
> > Apr  8 09:22:27 server2 kernel: [  216.788733]  [<ffffffffa0191000>] ? 0xffffffffa0191000
> > Apr  8 09:22:27 server2 kernel: [  216.788735]  [<ffffffffa01911c5>] init_netconsole+0x1c5/0x1000 [netconsole]
> > Apr  8 09:22:27 server2 kernel: [  216.788737]  [<ffffffff810002dc>] do_one_initcall+0x8c/0x1c0
> > Apr  8 09:22:27 server2 kernel: [  216.788740]  [<ffffffff81181042>] ? __vunmap+0xc2/0x110
> > Apr  8 09:22:27 server2 kernel: [  216.788743]  [<ffffffff810d7f8d>] load_module+0x1dbd/0x25b0
> > Apr  8 09:22:27 server2 kernel: [  216.788744]  [<ffffffff810d4770>] ? show_initstate+0x60/0x60
> > Apr  8 09:22:27 server2 kernel: [  216.788746]  [<ffffffff8174c49f>] ? page_fault+0x1f/0x30
> > Apr  8 09:22:27 server2 kernel: [  216.788747]  [<ffffffff810d881a>] SyS_init_module+0x9a/0xc0
> > Apr  8 09:22:27 server2 kernel: [  216.788749]  [<ffffffff8174ab72>] system_call_fastpath+0x12/0x17
> > Apr  8 09:22:27 server2 kernel: [  216.788750] ---[ end trace 224709e18793096d ]---
> > 
> > I could try 3.18.11 without linux-vserver patch. But I think that doesn't solve my problem.
> > 
> > Thanks
> > Urban
> > 
> >>
> >> corey
> >> --
Received on Thu Apr 9 17:35:42 2015
[Next/Previous Months] [Main vserver Project Homepage] [Howto Subscribe/Unsubscribe] [Paul Sladen's vserver stuff]
Generated on Thu 09 Apr 2015 - 17:35:42 BST by hypermail 2.1.8