Re: [vserver] I can't kill a processus in a vserver, I've dmesg error

From: Timothee Besset <ttimo_at_idsoftware.com>
Date: Wed 23 Dec 2009 - 14:22:27 GMT
Message-ID: <4B3227A3.7030906@idsoftware.com>

FWIW I've had the same kind of situations with NFS partitions. Once the
vserver host had oops'ed the only solution was to reboot the blade.

TTimo

KLEIN Stéphane wrote:
> Hi,
>
> In a vserver, I've a processus with D status :
>
> $ vps aux | grep "my_vs1"
>
> stephane 572 40004 my_vs1 0.0 0.0 4704 1420 ? D
> 11:41 0:01 scp -r root@example.com:/home/foobar/www/ ./
> stephane 573 40004 my_vs1 0.2 0.0 0 0 ? H
> 11:41 0:15 [ssh]
>
> I can't kill this process :(
>
> In dmesg, I've this :
>
> [2220677.412013] BUG: unable to handle kernel paging request at 00020004
> [2220677.412048] IP: [<c1087710>] find_get_page+0x40/0x81
> [2220677.412080] *pde = 00000000
> [2220677.412102] Oops: 0000 [#9] SMP
> [2220677.412126] last sysfs file: /sys/devices/virtual/block/md1/md/mismatch_cnt
> [2220677.412155] Modules linked in: iptable_raw xt_comment xt_recent
> xt_policy ipt_ULOG ipt_REJECT ipt_REDIRECT ipt_NETMAP ipt_MASQUERADE
> ipt_LOG ipt_ECN ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp
> nf_nat_snmp_basic nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc
> nf_nat_h323 nf_nat_ftp nf_nat_amanda ts_kmp nf_conntrack_amanda
> nf_conntrack_tftp nf_conntrack_sip nf_conntrack_proto_sctp
> nf_conntrack_pptp nf_conntrack_proto_gre nf_conntrack_netlink
> nf_conntrack_netbios_ns nf_conntrack_irc nf_conntrack_h323
> nf_conntrack_ftp xt_tcpmss xt_pkttype xt_physdev xt_owner xt_NFQUEUE
> xt_NFLOG nfnetlink_log xt_multiport xt_MARK xt_mark xt_mac xt_limit
> xt_length xt_iprange xt_helper xt_hashlimit xt_DSCP xt_dscp xt_dccp
> xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY xt_tcpudp xt_state
> iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack
> iptable_mangle nfnetlink iptable_filter ip_tables x_tables dummy
> hwmon_vid i2c_viapro loop evdev snd_pcm snd_timer psmouse serio_raw
> snd soundcore snd_page_alloc i2c_i801 i2c_core pcspkr processor button
> intel_agp agpgart rng_core ext3 jbd mbcache raid1 md_mod sd_mod
> ide_pci_generic piix ide_core crc_t10dif ata_piix ata_generic ahci
> libata scsi_mod r8169 mii thermal fan thermal_sys [last unloaded:
> scsi_wait_scan]
> [2220677.412710]
> [2220677.412731] Pid: 558, comm: scp Tainted: G D
> (2.6.31.6-vs2.3.0.36.24-beng #1) 986LCD-M/mITX
> [2220677.412775] EIP: 0060:[<c1087710>] EFLAGS: 00210203 CPU: 0
> [2220677.412801] EIP is at find_get_page+0x40/0x81
> [2220677.412824] EAX: 0001ffff EBX: f6bc46c0 ECX: 00020000 EDX: 00000000
> [2220677.412850] ESI: f3df42f0 EDI: 00020004 EBP: f6bc4778 ESP: dfcefd24
> [2220677.412876] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
> [2220677.412902] Process scp (pid: 558, ti=dfcee000 task=f64f8df0
> task.ti=dfcee000)
> [2220677.412942] Stack:
> [2220677.412961] 00020000 00200296 f65ab400 00000444 f6bc477c
> f6bc46c0 f6bc4778 00000444
> [2220677.412998] <0> f6bc4778 c10879ea f6bc46c0 00000000 ffffffff
> c1087c4b 000000d0 00000444
> [2220677.413053] <0> 00000000 f6bc46c0 00000000 00444000 00000000
> f7e843f1 00445000 00000000
> [2220677.413118] Call Trace:
> [2220677.413141] [<c10879ea>] ? find_lock_page+0x10/0x43
> [2220677.413166] [<c1087c4b>] ? grab_cache_page_write_begin+0x30/0x88
> [2220677.413219] [<f7e843f1>] ? ext3_write_begin+0x79/0x1d2 [ext3]
> [2220677.413247] [<c10884b3>] ? generic_file_buffered_write+0xe8/0x27f
> [2220677.413287] [<c1088a8e>] ? __generic_file_aio_write_nolock+0x26c/0x294
> [2220677.413318] [<c10329d5>] ? current_fs_time+0x13/0x15
> [2220677.413344] [<c10892f9>] ? generic_file_aio_write+0x60/0xb7
> [2220677.413388] [<f7e800c8>] ? ext3_file_write+0x19/0x85 [ext3]
> [2220677.413421] [<c10ae891>] ? do_sync_write+0xc0/0x107
> [2220677.413447] [<c112dd35>] ? rb_insert_color+0x51/0xb5
> [2220677.413475] [<c1040ad2>] ? autoremove_wake_function+0x0/0x2d
> [2220677.413502] [<c10ae57c>] ? fsnotify_access+0x5a/0x61
> [2220677.413529] [<c10fd638>] ? security_file_permission+0xc/0xd
> [2220677.413555] [<c10ae7d1>] ? do_sync_write+0x0/0x107
> [2220677.413580] [<c10af1c5>] ? vfs_write+0x7e/0xd6
> [2220677.413604] [<c10af2b5>] ? sys_write+0x3c/0x63
> [2220677.413630] [<c10030e9>] ? syscall_call+0x7/0xb
> [2220677.413653] Code: 0c 8b 44 24 10 e8 cd 59 0a 00 85 c0 89 c6 74 44
> 8b 08 b8 ff ff ff ff f6 c1 01 0f 45 c8 8d 41 ff 83 f8 fd 89 0c 24 77
> d5 8d 79 04 <8b> 59 04 89 7c 24 04 85 db 74 c7 8d 6b 01 89 d8 8b 7c 24
> 04 f0
> [2220677.413799] EIP: [<c1087710>] find_get_page+0x40/0x81 SS:ESP 0068:dfcefd24
> [2220677.413831] CR2: 0000000000020004
> [2220677.414173] ---[ end trace 90eddfda8e996248 ]---
>
> I don't know if this information is useful for you.
>
> Question : I can't kill this process, I need to reboot the server completely ?
>
> Thanks for your help,
> Stephane
>
Received on Wed Dec 23 14:23:09 2009

[Next/Previous Months] [Main vserver Project Homepage] [Howto Subscribe/Unsubscribe] [Paul Sladen's vserver stuff]
Generated on Wed 23 Dec 2009 - 14:23:11 GMT by hypermail 2.1.8