[00:00] np [00:01] frz (~frz@213.235.213.90) joined #vserver. [00:03] hi frz! [00:10] ok 2 maschines, one with 1.22 and one with 1.23 runned sync [00:10] the same [00:10] 1.22 running, 1.23 lower then 1min and total crash [00:11] okay ... that is something we can work on ;) both UP? [00:12] jep [00:14] okay, we do that in a binary tree way ... [00:14] I cut out about half the changes made in 1.23 (make sure it works) and we test that one, okay? [00:14] stupidawy (foo@198.77.239.131) joined #vserver. [00:15] hi stupidawy! [00:15] Nick change: stupidawy -> stupidnic [00:16] atlanta, you.wish.you.were.pimp.olicio.us? [00:16] ? [00:16] come on... don't pick on my hostmask - it is 1337 :) [00:16] besides it is run off a vserver... that should count for something [00:17] ok bertl [00:17] v3ry l33t ... [00:17] can u send me the new diff? [00:17] sec [00:18] JonB (~NoSuchUse@129.142.112.33.ip.tele2adsl.dk) left irc: Remote host closed the connection [00:18] okay stupidnic, how is your vserver today? ;) [00:18] Good actually... just upgraded to 2.4.24-1.3.4 fix01 [00:19] hmm, better than 1.3.5? [00:19] have one minor issue... and I mean it is rather minor [00:19] Bertl: well considering that 1.3.5 was released the day after I finished my upgrade to 1.3.4 I decided I would let things settle down first :) [00:19] ah okay ... ic [00:19] what is your issue? [00:20] Sh[a]de: status (mine): currently testing the patch ... should be done in a few ... [00:20] when I start a vserver... I am getting this: /usr/local/sbin/vserver: ulimit: cannot modify max user processes limit: Invalid argument [00:20] bertl np, we stress only kernel's and not bertl's ;) [00:21] but only if ULIMIT is defined (obviously) [00:21] stupidnic: change the -H to -HS or check if you use -n with values above 1024 [00:21] Okay... I had added the S, but not the above 1024 [00:21] currently set to 1000 [00:22] (which is what it was set to ogirinally when I started runing ctx17 [00:23] let me have a look at your ULIMIT= line in the config please ... [00:24] sure... [00:24] ULIMIT="-HS -u 1000" [00:24] and give http://vserver.13thfloor.at/Stuff/testme.sh a try just to see everything is working fine ... [00:25] do I run that inside a vserver? or just the root server? [00:25] on the host as root ... [00:26] bertl... [00:26] thats interessting [00:26] -s [00:26] sry i have it loaded too ;) [00:26] all clear but one failed [00:26] [201]# failed. [00:27] on stable 201 failing is natural ... [00:27] ok *relief* [00:28] Bertl: on the root server all tests pass with succeeded [00:28] Bertl, i have to go short outside [00:29] paste me the link in querry [00:29] i do the tests [00:29] im back in aprox 30min [00:29] http://vserver.13thfloor.at/Experimental/delta-vs1.23-rev01.diff [00:30] what do the first 5 lines show? [00:31] Doener (~doener@pD9588138.dip.t-dialin.net) joined #vserver. [00:31] stupidnic: what do the first 5 lines show? [00:31] hi Doener! [00:31] Linux-VServer Test [V0.06] (C) 2003-2004 H.Poetzl [00:31] chcontext is working. [00:31] chbind is working. [00:31] Linux 2.4.24-vs1.3.4 i686/chcontext 0.27/chbind 0.27 [E] [00:31] --- [00:32] okay, shutting down and restarting that vserver gives you what messages? [00:32] come to think of it... I get it on enter too: /usr/local/sbin/vserver: ulimit: cannot modify max user processes limit: Invalid argument [00:33] granted... I haven't made the modifications you suggested about putting the limit number over 1024 [00:33] well I didn't suggest that ... [00:33] do you use nproc as flags? [00:33] affirmative [00:34] is this unmodified 1.3.4? on vanilla 2.4.24 or with some other patches? [00:35] totally vanilla 2.4.24 and the only patch to 1.3.4 was the fix01 (to fix the context numbering problem [00:36] hmm, which patch was that? [00:36] hio i've a problem with lsof in 2.4.23/vs 1.23 [00:36] It has to do with a bug in 1.3.4 where the context number won't increment the next time it was invoked. [00:36] hi [00:36] lsof brings up some warnings which contain the filestructure of the host system [00:36] for example: [00:37] stupidnic: hmm, interesting could you make that one available? [00:37] Sure... hold one [00:37] test-1:/var/www# lsof |grep mysql [00:37] lsof: WARNING: can't stat() ext3 file system /vservers/tooltime [00:37] Output information may be incomplete. [00:37] test-1 is a context below /vservers/test-1 [00:37] rmoriz: okay what is in your /porc/mounts on that server? [00:37] host or ctx? [00:38] s/porc/proc/ [00:38] vserver context [00:38] -> /dev/vserver_vol/tooltime /vservers/tooltime ext3 rw 0 0 [00:38] [00:39] want to have ssh access? :) [00:39] not required, I assume that is the reason for lsof complaining ... [00:39] is there a way to hide that mounts to vserver contexts? [00:39] as you are using vs1.23, you could use the vproc tool to disable the /proc/mounts entirely ... [00:40] oh. didn't try that ;) [00:40] vproc -d /proc/mounts on the host should do the trick ... [00:42] thanks! :) [00:42] Bertl: http://n0rp.floridachicks.com/patch-vs1.3.4-fix01.diff [00:42] don't mind the domain name :) [00:47] adding the S to the ULIMIT seems to sort out the problem [00:50] ah okay that is the nproc patch ... [00:53] Sorry... I didn't bother to read it... (I just did) [00:57] which package contains vproc? [00:57] i don't seem to have it with util-server-0.27 [00:58] Nick change: ccooke -> cc-away [01:09] Nick change: stupidnic -> stupidawy [01:23] stupidawy: it's an addon, available at the download site [01:36] any progress yet? [01:38] hmm, regarding 1.23, I assume? [01:39] yes [01:39] and/or other security/stability related issues [01:40] it would be interesting to test with 1.22 as it seems, that Sh[a]de was able to reproduce this with 1.23 on UP ... [01:40] i saw a patch vs 1.3.4 [01:40] ok [01:40] and it would also be very interesting to test with 1.3.5 [01:40] so i schedule tests with 1.22 and 1.3.5 for tomorrow? [01:40] yeah, preferable 1.3.5 first ... [01:42] ok [01:42] Last message repeated 1 time(s). [01:42] ill do one kernel right now [01:43] re [01:44] oki, patch is available ... [01:44] http://vserver.13thfloor.at/Experimental/delta-vs1.23-rev01.diff [01:44] so bertl [01:44] im ready for test ;) [01:45] this rips out the part, which actually was intended to fix the race issues once and for all ... [01:45] (so it's a good candidate for looking at) [01:49] bertl [01:49] sounds not good [01:50] see here [01:50] wwipvmdev-01:/install/1.23rev01-kernel/linux-2.4.24# patch -p1 < delta-vs1.23-rev01.diff [01:50] patching file Makefile [01:50] Hunk #1 FAILED at 1. [01:50] 1 out of 1 hunk FAILED -- saving rejects to file Makefile.rej [01:50] patching file fs/proc/array.c [01:50] Reversed (or previously applied) patch detected! Assume -R? [n] [01:50] you are patching against what? [01:50] a fresh kernel [01:50] 2.4.24 [01:51] with ur patch [01:51] it's agains vs1.23 ... so you have to apply that first ;) sorry [01:51] oh ok [01:51] np [02:08] patch 1.22, 1.23 to kernels 2.4.23 and 2.4.24 is giving socket.c1111 when attempting dig or nslookup on the host server [02:09] please provide strace of dig/nslookup in such case ... [02:09] I ust compiled 2.4.24 without the patch and dig works [02:09] Hi Bertl: [02:10] I did few days back remember [02:10] building the kernel [02:10] okay, that was the modloader, right? [02:10] 2.4.24-vs1.3.5-295 [02:11] serving: I have an idea why it works/doesn't work ... [02:11] ? [02:11] but I need another strace of that command ... [02:11] ok . [02:12] udp.c: In function `udp6_get_info': [02:12] udp.c:985: structure has no member named `vx_id' [02:12] okay, change the vx_id to xid [02:13] atd [02:13] you have ipv6 enabled, by the way? [02:13] wrong window [02:13] Bertl: you need a new strace out put or you wana look at the old ones ? [02:13] both should do ... [02:14] bertl [02:14] i have run the killer-03 on my remote maschine [02:14] right now . I am running th box with 2.4.24 clean. I will reboot to run 2.4.24-vs1.23 [02:14] it run without problems but only one thing is interesting [02:14] fs/fs.o(.text+0x2d544): In function `proc_virtual_readdir': [02:14] /home/raoul/test/src-vs.1.3.5-2.95/linux-2.4.24/fs/proc/virtual.c:415: undefined reference to `__cmpdi2' [02:14] mhm [02:15] ? [02:15] killer-03 hang for few seconds and running again [02:15] sry, not wanted to interrupt you! [02:16] Bertl: 213.186.190.24/yahoo4 [02:17] Bertl: sorry . http://213.186.190.24/vserver/yahoo4 [02:18] this is within the vserver, right? [02:21] Bertl: Killer-03 is running [02:21] bertl: sorry that everybode wants something from you [02:22] fs/fs.o(.text+0x2d544): In function `proc_virtual_readdir': [02:22] undefined reference to `__cmpdi2' [02:22] Bertl: here is the new starce http://213.186.190.24/vserver/yahoo5 [02:22] maharaja: hmm that is interesting ... [02:22] Bertl: no they are done in the host server [02:22] i already am sorry for him. [02:23] serving: hmm interesting too ... [02:23] the last lines of dmesg are? [02:24] hmm i think thats was a "bugfix" for the stressing problem, killer-03 still running without probs [02:25] Bertl: request_module[net-pf-10]: fork failed, errno 1 [02:25] shade: which patch? [02:25] shade: if its for 1.23, i would like to try [02:25] http://vserver.13thfloor.at/Experimental/delta-vs1.23-rev01.diff [02:25] yes 1.23 [02:26] with the patch from bertl [02:26] thnx [02:26] right, this one [02:26] without this patch my system was only < 1min stressed and then crashed [02:27] serving: hmm does 1.22 give the same issues? [02:27] yes [02:27] but on 2.4.24 vanilla, no problem, no dmesg, right? [02:27] shade: does a "make bzImage" suffice? or do i need a make clean too [02:28] I did not try dmesg but nslookup works fine [02:28] no make clean, but a make dep [02:28] in 2.4.24 vailla [02:28] maharaja: i always make "fresh"! kernels [02:28] right bertl [02:28] ok [02:28] and im happy, very happy :) [02:28] killer running and running [02:29] .oO(i should be studing *g*) [02:29] Sh[a]de: hmm, well, we just _removed_ the race protections in that version ;) [02:29] u guys, specialy bertl, do great things !! [02:29] k [02:30] so that should prove kind of unhealthy, or am i wrong [02:30] maharaja: you mean studying right ? [02:30] good question [02:30] serving: http://www.ussg.iu.edu/hypermail/linux/kernel/0206.1/0009.html [02:30] is 1.23 now on 1.22 level, no or? [02:30] i m there [02:31] please touch /var/log/ksymoops [02:32] rebooting [02:32] serving: try again (on vs1.23) and look what's written there ... [02:33] serving: you have ipv6 enabled? or compiled in? [02:33] Bertl: I read this few days back a dcould not figure out a solution for it. How come th eproblem only manifest itseld when running patched kernles ? [02:33] shade: you run killer-03 1/0/? [02:33] bertl: is killer-03 still ident? [02:33] maharaja: killer-03 [02:34] hmm, it was updated twice, so reload ... [02:34] ok [02:34] DaemonDazz (~spam@WYPP-p-144-134-3-182.prem.tmns.net.au) joined #vserver. [02:34] started "./killer-03" [02:34] hi DaemonDazz! [02:34] Bertl: ipv6 is compiled as modules I think . I am using config file from ftp.soluccorp.. [02:34] allo all [02:34] Thumb squeeze for maharaja ;) [02:35] hi Daemon [02:35] *reading some qm stuff [02:35] Bertl: I just replied to your email requesting an strace of host (Subject Socket.c Invalid Arguement) [02:35] ah, good we are working on that right now ... [02:36] ./killer-03 is still running [02:36] bb in 2mins or so [02:36] me too maharaja [02:37] DaemonDazz, serving: could you please try to insert the ipv6 module manually .. via modprobe [02:37] and then try again with the dig/host/nslookup/whatever ... [02:37] not sure if i built it, hang on i'll check [02:38] well, I'm sure ;) [02:38] Bertl: line in modprobe ipv6 ? [02:38] yep, i did ;) [02:39] don't know how it's called, what is in /lib/modules//kernel/driver/net ... [02:40] Bertl: I did modprobe ipv6 the nslookup yahoo.com and I got socket.c:1111: internal_send: 192.168.0.35#53: Invalid argument [02:40] is the ipv6 modules loaded? [02:40] or do you still get the request_module[net-pf-10]: fork failed, errno 1 message? [02:40] hmm killer now running > 20 min [02:40] Bertl: lsmod > ipv6 175220 -1 [02:41] seems to be ok [02:41] -1 okay? [02:41] the ipv6 loadded ok, the strace is at http://fuzzitech.net/host.strace.ipv6 and there is no more messages in dmesg after loading the module [02:42] [12:42am] hmm killer now running > 20 min <- seems to be ok [02:42] if u think this was my comment to Bertl: lsmod > ipv6 175220 -1 [02:42] ,) [02:43] Bertl: no more error request_module[net-pf-10]: fork failed, errno 1 [02:43] but the dig still fails? [02:43] yep [02:43] yep [02:43] stereo [02:43] same error [02:43] okay, I would ask you to compile the kernel without ipv6 support (just for a test) and try again ... [02:43] bertl: "./killer-03" is happily up and running [02:44] bertl: ill try "./killer-03 1" [02:44] with me also [02:44] Bertl: ok. i am off to recompile [02:44] maharaja: no, not necessary/useful ... [02:44] ok [02:45] what is usefull? [02:45] maharaja: you could start some loops, doing a cat /proc/*/status >/dev/null; endlessly on the host, while running killer-03 [02:45] ok [02:45] something like 'while true; do cat /proc/*/status >/dev/null; done ... [02:45] ill do that via ssh, to keep the console for debugging [02:45] yup [02:47] ok, running happily [02:48] i get a lot of: cat: /proc/20652/status: No such file or directory [02:48] but i guess thast from the fork [02:48] the second instance of that while loop produces no output of missing files/dirs [02:49] when i remove the /dev/null, i get the status as expected [02:49] yeah, you could try '(cat /proc/*/status >/dev/null &) inside the while loop, to push a little harder [02:53] ok running 2x cat loop and one killer-03 [02:53] without probs [02:53] same [02:53] CPU states: 26.6% user, 73.4% system, 0.0% nice, 0.0% idle [02:53] *g [02:53] no fork yet [02:54] 02:55:29 up 20 min, 3 users, load average: 173.37, 147.43, 89.87 [02:54] yeah, on UP this should not be a problem, on SMP this should hang after a while ... [02:54] ill do that & stuff [02:54] and run another killer-03 [02:54] maharaja: 2x killer & 2 cat loops? [02:55] setting it up [02:55] more than one killer doesn't make too much sense, more bashing on the procfs makes sense ... [02:55] ok [02:55] 2x with &, 1x the simple loop [02:56] serving: there is no line like the following one, in dmesg? [02:56] "kmod: failed to exec %s -s -k %s, errno = %d\n", [02:57] 4x bashing the procfs [02:57] 97,3% load and all running fine [02:57] 02:58:51 up 24 min, 5 users, load average: 68.73, 116.99, 89.99 [02:57] 2 loops, 1 loop & loop and 1 killer [02:57] how do you get that cpu load stuff? [02:57] top? [02:57] jep [02:57] Cpu(s): 14.2% user, 85.6% system, 0.0% nice, 0.2% idle [02:58] i'm suprised, that the system is very reactive [02:58] jey i can open more and more ssh windows without probs [02:58] login and other :) very nice [02:58] cause i thought that after a certain load, it gets really hard to interact [02:59] did you implement some schedule patches? [02:59] ok [02:59] i get debug output [02:59] <1>Unable to handle kernel NULL pointer dereference at virtual address 00000000 [03:00] interested in more? [03:00] printing eip: [03:00] c01686ce [03:00] *pde = 00000000 [03:00] Oops: 0000 [03:00] CPU: 1 [03:00] and going on with the register/stack/.. [03:00] yeah, that looks more like it ... [03:00] save it somewhere ... [03:00] we'll need to put it through ksymoops ... [03:01] Bertl: no there isnt such aline [03:01] okay, just checking ... [03:01] damn putty! [03:02] i increased the history size [03:02] dang p3 takes a long time to compile a 2.4 kernel [03:02] and my screan went black :) [03:02] so i'm waiting for more of that stuff? [03:02] :) [03:02] or is there any way the geht the output of minicom back? [03:02] well, probably not ... [03:03] 1x winscp data transfer (2gigs), a dig loop to web.de, 2 double cat, 2 normal cat and one killer-03 -> still stable [03:03] Sh[a]de: vs1.22 will not break on UP ;) [03:03] what I don't understand, why does vs1.23 break ... [03:04] cat_stress.sh: line 3: 11202 Killed sh cat_stress.sh [03:04] mhm [03:04] damn [03:04] will the oops reappear? :) [03:04] uhhhm [03:04] ok my system hangs [03:04] on start apache [03:04] too [03:05] sure? [03:05] ctx: 65258 [03:05] 2631 [03:05] thts the output from killer [03:05] +a [03:05] and he count [03:06] ctx: 65258 [03:06] 2683 [03:06] thats the only thing... [03:06] now i have killed all cat stress tests [03:06] but no change on killer [03:07] well, maybe killer was killed, that might be possible ... [03:07] but the system is responsive? [03:07] no [03:07] now killer and all terminals stand still [03:07] one sec [03:07] ah [03:07] the stuff is saved in dmesg [03:07] :) [03:08] i have last top cpu at 100% on system [03:08] killer killed himself [03:08] yes i have access for now [03:09] all works [03:09] well, it was probably killed by the OOM killer ... [03:09] no crash [03:09] killed all other tests and system is stable [03:09] okay, I'll modify the patch and we test again, okay? [03:09] sure [03:09] np [03:10] you want my oops? [03:10] i'm going to be building a new host server in a week or so, what kernel version and patch version would you recommend? [03:11] hmm im not sure [03:11] 1.22 works fine for our server [03:11] DaemonDazz: atm, 2.4.23+vs1.22 ... [03:11] :) [03:11] bertl... why not 2.4.24? [03:11] with 1.22 [03:12] how much difference is there between 1.21 and 1.22 ? [03:13] i like the kernel ppl: Hey! Please login now. You have one minute left [03:13] :) [03:13] or the login ppl [03:13] höä? [03:14] ;) [03:14] i simply like the tiny gimmicks they add [03:14] *g [03:14] DaemonDazz: in a week or so, I probably recommend 2.4.25 + vs1.24 ... [03:14] is that going to fix what we're looking at now? ;) [03:14] yeah, for sure ... [03:15] if I can track it down ... [03:15] it's now recompiling modules [03:16] ipv6 isn't supported yet, but it never gave problems before, maybe the tools changed a little, and now that missing support breaks them ... [03:17] ahh. one other thing i just remembered. i've added fakeinit to a RH9 vserver so i can use the inittab and now i get the following message whenever i start/stop/enter [03:17] /usr/lib/vserver/printconf.sh: G: command not found [03:17] any ideas? [03:18] [root@hornet root]# rpm -qa | grep vserver [03:18] vserver-0.29-1 [03:19] probably the 'broken' tools, switching to util-vserver-0.27 from enrico should solve that ... [03:19] Sh[a]de: http://vserver.13thfloor.at/Experimental/delta-vs1.23-rev01-rev02.diff [03:19] ontop of your current patched rev01 ... [03:21] k [03:24] Bertl: same error even without ipv6 support [03:25] okay, but no dmesg line this time, I hope? [03:25] no [03:25] does that mean i don't need to remotely rebooting my box ? ;) [03:25] bertl: i guess youl will be happy to know that. [03:25] i run 1x killer-03 [03:26] and 4x that cat procfs loop [03:26] still no hang [03:26] and one oops per cpu [03:26] DaemonDazz: :) [03:27] maharaja: well, those oopses should not happen ... [03:27] ok [03:27] hehe [03:27] doh [03:27] :) [03:27] vs1.23 was supposed to fix that ... and other possible reaces ... [03:27] but at least, no crash [03:27] http://www.ipax.tk/~raoul/vserver/dmesg_oops_040116_0109.txt [03:28] hmm no Oops by me [03:28] its on dual systems only, i guess [03:28] yup [03:28] k [03:28] Bertl: I wonder howcome I get socke.c1111 error on some boxes and not the others ? [03:28] maybe bertl means races, which occur between 2 or more cpus [03:28] trying to accomplish the same thing [03:28] serving: hardware ? [03:29] maharaja: you have to put it through ksymoops -v -m -K [03:29] DaemonDazz: same hardwareon all ofthem [03:29] serving: yeah, but what hardware? my box is a p3-1hgz using an sis chipset [03:29] killer running [03:29] ghz even :) [03:29] serving: could you provide another strace, with strace -fF -s 10000 on the no ipv6 kernel? [03:30] Bertl: sure [03:30] DaemonDazz: 1 sec [03:30] DaemonDazz: doesn't look like it's hardware dependant ... looks more like either config or state of system dependant ... [03:31] 2 cat loops and 1 killer running fine [03:31] this is with the rev2, right? [03:31] sure :) [03:31] as your instrucion [03:31] i just tried runnign 'host domain ns' on another machine which is using the same kernel as the box with the problem and it works fine [03:32] i built the kernel on the one with the problem and transplanted it to the one that is working [03:32] 4 cat loops and 1 killer running fine [03:32] Sh[a]de: well, you can skip the cat loops on UP [03:32] k [03:33] the only thing its different to rev01 is the cpu load [03:33] with only one cat loop and one killer i have baout 80% [03:34] hmm, interesting, what is the difference? [03:34] Bertl: http://213.186.190.24/vserver/yahoo6 [03:34] much more cpu load [03:34] means % or really load? [03:34] DaemonDazz: 2 dual p3 733 1 GB ram . 18 scsi HD [03:34] 80% [03:34] Bertl: fyi if i change the /etc/resolv.conf file to use a remote nameserver it also has a problem [03:34] from top output [03:35] so it depends on an entry in /etc/resolv.conf? [03:36] if i put a nameserver running on the host machine in /etc/resolv.conf i can do dns queries, if i specify a remote server on the command line to host or put a remote one in /etc/resolv.conf i can't [03:36] wonder if it's udp related? [03:36] u put the dns servers in host and v-childs? [03:37] yep [03:37] hmm, hmm, but a correct default route is set, right? [03:37] hmm is set any "search" on top of resolv.conf? [03:37] yep, i am remote the box, so i wouldn't be able to log into it otherwise [03:37] bertl: cat dmesg_oops_040116_0109.txt|ksymoops -v 2.4.24-vs1.23-fix1-2.95 -m /boot/System.map-2.4.24-vs1.23-fix1-2.95 -K [03:37] yes, my domain, fuzzitech.net [03:37] is that correct? [03:37] ok for test delete these entry Daemon [03:38] you might need the vmlinuz in the kernel dir, not the bzImage ... [03:38] i have resolv problems to with a search string in resolv.conf [03:38] without i haven't [03:38] bertl: i do not understand :) [03:38] put the same dns servers as the host had it in all vchilds [03:39] but without the "search" at the beginning [03:39] bertl: take a look at http://www.ipax.tk/~raoul/vserver/ksymoops-dmesg_oops_040116_0109.txt [03:39] DaemonDazz: good observation. same here regarding the local running dns [03:40] DaemonDazz: You know , we should have time the kernel compiles we did :) [03:40] s/time/timed [03:41] maharaja: looks good ... but seems unrelated ... [03:42] bertl: killer and cat stressing the kernel but the kernel works and works fine :) [03:42] serving, DaemonDazz: well, if it depends on the resolv.conf, it's even stranger ... [03:42] have a look at http://fuzzitech.net/resolv.txt [03:42] i did the same test on the host (hornet) and a virtual server (vs1) [03:43] i don't think it depends on resolv.conf, i think it's the remote nameserver that is getting it [03:43] Deamon how it is without the search string in the childs? [03:43] berl: ok, i kept it running [03:43] got another ooops [03:43] only on the host machine tho, not in the vservers [03:43] [root@vserver:vs1 /]cat /etc/resolv.conf [03:43] nameserver 202.191.96.192 [03:43] nameserver 202.191.96.193 [03:43] nameserver 202.191.97.33 [03:43] [root@vserver:vs1 /]host mirror.aarnet.edu.au [03:43] mirror.aarnet.edu.au has address 192.42.62.2 [03:44] hmm [03:44] Bertl: could it be dependent on the interface ? I did not change the resolv.conf. I did nalookup yahoo.com localhost [03:44] would it be possible for the host to have problems with udp but the virtual servers are ok? [03:45] could be ... [03:45] my nameserver is bound to eth0, as are the vservers [03:45] guess we'll have to look at tcpdumps of those requests ... [03:45] ok, coming up [03:45] have your childs a dns entry? [03:46] what i mean is, had any vchild a tld connected? [03:46] Bertl: how can I generate the needed tcpdump ? [03:46] bertl: just another ext3 related oops: http://www.ipax.tk/~raoul/vserver/ksymoops-dmesg_oops_040116_0146.txt [03:46] Bertl: yeah, what flags to you want to tcpdump [03:46] tcpdump -vvnei host [03:46] Sh[a]de: all my hosts are in dns, with reverse set up [03:46] (would be the easiest way I guess) [03:47] k [03:47] hmm, add -s 10000 to it ... [03:47] what say a dig @YourDNSServer -axfr your ip or yopur domain [03:47] Bertl: that "1 warning and 1 error issued. Results may not be reliable." is normal? [03:48] Bertl: i get nothing in tcpdump [03:48] maharaja: well, not normal, but acceptable ... [03:48] Bertl: Stressing works [03:48] no problems to report [03:49] starting, stoping apache ftpd and other thins work great [03:49] okay, we'll start adding the locks abck ... [03:49] Bertl: yep . nothing here too. [03:49] s/abck/back/ [03:49] serving, DaemonDazz: hmm, that means the server isn't even contacted? [03:49] it would look that way [03:50] Bertl: hence the error [03:50] and looking at the error message it would make sense [03:50] do you get the same results on a 'working' machine [03:50] I mean no tcpdump output .... [03:50] hang on [03:51] Bertl: I cant test . I only have the DNS server and the problem server now [03:51] Bertl: Sorry i would really help you but this night i need at least some hours sleep... my fuel is out :/ [03:52] tomorrow we can test again :) [03:52] 11:23:16.133166 vsm.34055 > ns1.e-access.com.au.domain: 46346+ A? mirror.aarnet.edu.au. (38) (DF) [03:52] 11:23:16.133873 ns1.e-access.com.au.domain > vsm.34055: 46346 1/2/0 A mirror.aarnet.edu.au (100) (DF) [03:52] that's what i get from a working amchine [03:52] vsm is your local ip? [03:53] yep [03:53] that is the machine that trevor from syc posted about the other week, he was having problems sshing down ipip tunnels [03:53] that was fixed by not using v_sshd [03:53] i wish everyone in here a well night and i say see ya tomorrow [03:53] nite [03:53] and thanks herbert [03:53] night Sh[a]de! [03:54] thank you for testing ... [03:55] hey np again and again with pleasure! [03:55] MORNING! [03:55] actually, it's nearly afternoon ;) [03:55] DaemonDazz: hmm, 2am in the afternoon, I like that ... [03:56] DaemonDazz: What gcc version are you compiling with ? [03:56] mine is gcc version 3.3.2 20031022 (Red Hat Linux 3.3.2-1) [03:56] no, it's 11:28am :) [03:57] [root@hornet root]# gcc -v [03:57] Reading specs from /usr/lib/gcc-lib/i386-redhat-linux/2.96/specs [03:57] gcc version 2.96 20000731 (Red Hat Linux 7.3 2.96-113) [03:57] eep that's old lol [03:57] but often better than 3.x compilers ... [03:58] i was looking at the datestamp .. :) [03:58] 1726 read(7, "nameserver 192.168.0.35\n", 4096) = 24 [03:59] hmm, that is a local nameserver, right? [03:59] anyways, different hardware (one UP and one SMP), different compilers, different name servers [03:59] who's was that from? [03:59] http://213.186.190.24/vserver/yahoo6 [03:59] no . that is seperate box [03:59] is it on the same LAN ? [03:59] yes [04:00] lets try some things ... [04:00] shoot [04:00] 192.168.0.35 is my DNS and internet gateway . The problem box in 192.168.0.71 [04:00] first: 'dig 13thfloor.at' on both machines, working and not [04:01] serving: you try on the problem box only ... [04:01] I just want to know, works, or doesn't work ... [04:01] dosnot work [04:02] bad-host: doesn't work - vserver-on-bad-host: works - other-machine: works [04:02] ok, second: 'dig 13thfloor.at @212.16.62.60' [04:03] does not work [04:03] bad-host: doesn't work - vserver-on-bad-host: works - other-machine: works [04:03] btw, i have the resolv.conf on bad-host set to the remote server still [04:03] if i change it to the local one the first test works [04:04] okay ... now: 'dig +tcp 13thfloor.at @212.16.62.60' [04:04] works [04:05] i never knew you could do that :) [04:05] serving? [04:05] works too [04:05] udp then? [04:05] okay, seems to be an udp issue ... [04:06] this isn't a remnant from the udp.c issue with 17c? [04:06] let's see if something looks obviously broken ... [04:06] FIREWALL [04:06] not in my case [04:08] :( not here either. I flushed ipatbles but no that is not it [04:08] I forgot that I can nslookup fine from inside a vserver [04:08] :) [04:10] well what does 'grep ipv4root /proc/self/status' output on a failing and on a working machine? [04:10] ipv4root: 0100007f/00ffffff 4700a8c0/00ffffff [04:10] ipv4root_bcast: ff00a8c0 [04:10] ipv4root_refcnt: 13 [04:11] Daemon? [04:13] http://fuzzitech.net/self.status [04:13] first one is failing, second is vserver and third is working [04:13] thought so ... [04:13] you know the same problem occures with vmlinuz-2.4.22ctxsmp-17c that I got of the net few month back. Not with rh7 only iwth this fedora on tghis box [04:13] you are not on the 'pure' host, you are inside a strangley configured chbind ... [04:14] who me ? [04:15] both, on the failing hosts ... [04:15] ipv4root: 0100007f/00ffffff 2161bfca/f8ffffff [04:15] ipv4root: 0100007f/00ffffff 4700a8c0/00ffffff [04:15] both mean that chbind --ip 127.0.0.1 --ip was issued ... [04:17] is it issued in the v_services ? [04:17] ok, even when i forcefully do a v_sshd stop and sshd start it is still not 0 [04:19] ok, something weird is going on [04:20] sshd won't stop running connections ... [04:21] it's not being called via chbind, but it thinks it is? [04:21] yes, i know that :) [04:21] i change it and then reconnect [04:21] serving: yes, v_sshd will use that, but it should not specify 127.0.0.1 (local) [04:21] I just turned off v_sshd v_xinetd v_portmap and rebooted [04:22] sshd is only bound to eth0 like i set the ListenAddress to in the sshd_config [04:23] what tools do you use? [04:23] at least a netstat -l only shows it bound to eth0 [04:23] GOOD NEWS [04:23] vserver 0.29? [04:23] it is working fine [04:23] yes, 0.29 [04:24] well, i can try a reboot [04:24] one of v_sshd v_xinetd v_portmap is the culprit [04:24] damn, there goes my uptime though [04:24] serving you can get the faulty behaviour, if you do chbind --ip 127.0.0.1 --ip 192.168.0.71 dig ... [04:24] yes [04:24] which one is reposible ? [04:25] you are using 0.29 vserver too? [04:25] yes I am . [04:25] should i reboot? [04:25] okay, folks 0.29 is definitely broken, in several aspects ... [04:25] DaemonDazz: did you disblae v_services ? [04:25] what is the best version then? [04:25] probably enricos util-vserver 0.27 atm [04:25] i'm only using v_sshd [04:26] he says in his announcement htere was a problem with rh7.3 iirc [04:26] let's do some tests with 0.29, I guess some things might be fixed ... [04:26] serving: rebooting is not a problem for you? [04:26] no [04:27] okay, do you have a /etc/vservices directory? [04:27] i have /etc/vservers [04:27] but no /etc/vservices right? [04:27] nop [04:28] mkdir /etc/vservices [04:28] no, i have /etc/vservers and /etc/vservers.conf [04:28] ko [04:28] ok [04:28] done [04:28] then add a file sshd.conf in that directory, which contains one line: [04:29] IP="" [04:29] done [04:29] you could also use IP="eth0" if you do not up/down that service after boot ... [04:29] with quotes ? [04:30] with quotes ... [04:30] done again ;) [04:30] now reenable the v_sshd and disable the sshd service ... [04:30] [12:02][darryl@hornet ~][0]$ cat /proc/self/status | grep ipv4 [04:30] ipv4root: 2161bfca/00ffffff [04:30] ipv4root_bcast: ffffffff [04:30] ipv4root_refcnt: 7 [04:30] [12:02][darryl@hornet ~][0]$ host mirror.aarnet.edu.au ns1.e-access.com.au [04:30] Using domain server: [04:30] Name: ns1.e-access.com.au [04:30] Address: 202.191.96.194#53 [04:30] Aliases: [04:30] mirror.aarnet.edu.au has address 192.42.62.2 [04:30] works now [04:30] IP="192.168.0.71" [04:30] yup [04:31] problem with multiple IPs? [04:31] well, not a problem, but a know (mis)feature [04:31] s/know/known/ [04:32] but it only affects udp [04:32] if you have more than one ips assigned to a vserver/chbind, then the first one is used for unspecified outgoing connections ... [04:32] uhuh [04:32] in your case, this is the localhost ;) [04:32] so if you did --ip extip --ip 127.0.0.1 it would work? [04:32] yep! [04:32] maha|fh (~b3dd4@chello080109078221.4.15.vie.surfer.at) left irc: Quit: leaving [04:34] oki, so not really a problem if you are aware of it [04:34] the localhost isn't added to the v_* services on tools != 0.29 ... [04:35] Bertl: should I reboot with v_sshd now ? [04:35] it seems, jack had the bright idea to add this per default, and it broke some assumptions ... [04:35] dont need to point to the newly created file ? [04:35] serving: yup should work this way ... [04:36] don't know maybe jack removed that too, but I doubt it ... [04:36] rebooting [04:38] bertl: i guess you had no time to add some locks back to the code [04:38] now something else is going on [04:38] wui, got another oops [04:38] analysing [04:38] i did a /etc/init.d/vservers stop and then start and now the host server is showing all the services bound running in one of the vs's [04:39] maharaja: a new patch will be ready any moment ... [04:40] DaemonDazz: hmm, please elaborate ... [04:40] bertl: ext3 again [04:40] bertl: shall i post these somewhere for the ext3 ppl to notice? [04:42] Bertl: it is workin now. [04:43] Bertl: looking into it [04:44] Bertl: how can I add other IPs ? would it make a differece if I bind IPs to /etc/ssh/sshd_config ? [04:44] s/to/in [04:44] maharaja: you could post them on fs-devel, but I doubt, that they'll investigate vserver kernels ... [04:45] so you think its vserver related? [04:45] serving: no, the current setup with the 'IP=""' in /etc/services limits sshd to that single ip ... [04:46] if you specify ips to bind to in /etc/ssh/sshd_config you do not need the v_sshd anyway ,,, [04:47] Bertl: how to do remove the ipv6 module ? i'm getting a resource in use error [04:47] maharaja: http://vserver.13thfloor.at/Experimental/delta-vs1.23-rev02-rev03.diff ontop of rev02 ... [04:48] DaemonDazz: well, guess some 'services' are using that module, or something locked to it ... [04:48] [root@hornet root]# lsmod [04:48] Module Size Used by Not tainted [04:48] ipv6 170688 -1 [04:49] -1 is a bug ... [04:49] well, that's nice :P [04:49] but a kernel ipv6 bug ;) [04:49] reboot ? [04:49] would be the 'best' solution, to get rid of it ... [04:50] mhm [04:50] i haven't got the rev02 [04:50] oki, will do, then i have to go drop the wife off [04:50] ok [04:50] got it [04:53] what does oops mean? [04:54] kernel/kernel.o(.text+0x181): In function `proc_caches_init': [04:54] /home/raoul/test/src-2.95/linux-2.4.24-rb/kernel/fork.c:877: undefined reference to `ctx_ref_lock' [04:55] machine is back up without ipv6 [04:55] and the problem with the vserver services binding the 0.0.0.0 has gone too [04:55] anyways, bbl [04:55] ill get a fresh copy [04:55] and apply the patches [04:55] maharaja: 'oops' is onomatopoetic (lautmalerisch) for something you didn't want to happen, but what happened ... [04:56] like in 'oops, I did it again!' [04:56] oh [04:56] and i thought it was something like gpl, gnu and stuff [04:57] http://www.m-w.com/cgi-bin/dictionary?va=oops [05:00] The 2 boxes without the socked.c problem had multiple ListenAddress in sshd_config . If I was observant I would have saved us the aggrivation . Sorry guys [05:00] no problem at all ... [05:01] I'm glad that we could narrow it down, and solve it ... [05:01] thanx Bertl. that was spectacular :) [05:02] guess I'd found it earlier, but the statement 'on the host' distracted me from the real issue ... [05:03] bertl: [05:03] kernel/kernel.o(.text+0x757): In function `schedule': [05:03] : undefined reference to `ctx_ref_lock' [05:03] now that I am running 2.4.24 and 1.23 , do you wana see my /proc output ? [05:04] well, did you use vproc to adapt it? [05:04] maharaja: hmhm ... sec [05:04] i applied: [05:04] cat ~/patch-2.4.24-vs1.23.diff |patch -p1 [05:04] cat ~/delta-vs1.23-rev01.diff |patch -p1 [05:04] cat ~/delta-vs1.23-rev01-rev02.diff |patch -p1 [05:04] cat ~/delta-vs1.23-rev02-rev03.diff |patch -p1 [05:05] no . not yet . just vserver-tools [05:05] well, then it'll probably show the same as before ... [05:05] but /proc on the vserver "looks" dfferent than the hosts' [05:06] well, other (the numeric dirs) but except for that, it will be the same, I guess ... [05:06] I ll get vproc then [05:07] lithium (~lithium@CPE005004cf14b9-CM013439901047.cpe.net.cable.rogers.com) joined #vserver. [05:08] hi lithium! [05:08] Hello. [05:09] maharaja: I don't get those error, I'll make a new diff against 2.4.24 for you ... [05:10] is vproc documented somewher ? how do I use it ? [05:11] ok [05:11] there is an example on the mailing list ;) [05:11] maharaja: http://vserver.13thfloor.at/Experimental/patch-2.4.24-vs1.23-rev03.diff [05:11] serving: and the tool has a -h (help) [05:12] what brings you here lithium? [05:12] it must be late :) [05:13] I just wanted to see what's going on. I've been using vserver for a while, so I figured maybe I could help someone. [05:13] rebuilding [05:14] I seem to be about the only person using vservers under Slackware for some reason. Oh well... [05:14] well, honestly, i know very few ppl who use slackware [05:15] and even less use vserver at all [05:15] so :) [05:15] lithium: great ... [05:15] hehe [05:15] /home/raoul/test/src-2.95/linux-2.4.24/kernel/fork.c:877: undefined reference to `ctx_ref_lock' [05:16] kernel/kernel.o(.text+0x181): In function `proc_caches_init': [05:16] I did get that feeling when I had to patch util-vserver for Slackware's init method. [05:16] maharaja: hmm, strange .. investigating ... brb [05:17] maharaja: well that seems to be a different kernel ... [05:18] maybe your sources got mixed up?! [05:19] mhm [05:19] dont know [05:19] ill get the md5sum of linux-2.4.24.tar.bz2 [05:20] the kernel builds here, and line 877 of fork doesn't contain any ctx_ref_lock [05:20] i know, it does not [05:20] thats why i wonder [05:20] i post the full output [05:20] (query) [05:21] or rather, ill upload it ;) [05:21] http://www.ipax.tk/~raoul/vserver/make_bzImage.txt [05:22] can you do an md5sum of your kernel? [05:22] tared and bzipped [05:22] 1e055c42921b2396a559d84df4c3d9aa /home/raoul/linux-2.4.24.tar.bz2 [05:22] hmm, okay that is in the linker part ... [05:23] do a make clean dep , and rebuild the kernel ... [05:23] ok [05:23] maybe it's beeen because of "make dep && make clean && make bzImage" [05:23] ? [05:26] did you add the "-g" to the cflags [05:26] as you tried to compile the kernel? [05:26] CFLAGS_KERNEL = -g [05:27] no [05:27] what does -g do? [05:28] it adds debug information to the kernel ... [05:28] so there can be no problem with that [05:28] no, not really ... [05:28] ok, im again rebuilding the kernel, after receiving that linker error [05:29] again [05:29] error [05:29] the same one [05:29] hmm, okay let me check ... again ... [05:29] I have an idea ... [05:29] great! :D [05:30] i go to study another page [05:41] okay maharaja, found it ... [05:42] tell me [05:43] kernel/vcontext.c [05:43] 155:static spinlock_t ctx_ref_lock = SPIN_LOCK_UNLOCKED; [05:43] the static has to be removed ... [05:43] spinlock_t ctx_ref_lock = SPIN_LOCK_UNLOCKED; [05:43] didn't show up in my kernel compile, because SMP was off :( [05:43] i c [05:44] do i need a make dep/clean/whatever? [05:44] make bzImage should be sufficient [05:44] ok [05:48] Last message repeated 1 time(s). [05:48] Linux version 2.4.24-vs1.23-rev3-2.95 (raoul@red.ipax.tk) (gcc version 2.95.4 20011002 (Debian prerelease)) #8 SMP Fri Jan 16 05:45:19 CET 2004 [05:48] starting killer-03 without any parameters [05:48] okay [05:49] ok, ill go reading another page for my exam [05:49] lets see what happens if i return [05:49] ;) [05:49] oki [06:16] loger joined #vserver. [06:21] ok Bertl, i've added a wiki at http://fuzzitech.net/wiki/ - now to work out how to replicate yours [06:22] re [06:47] anyways B, I'll have a bit of a look into it [06:47] i have to get going now [06:47] okay, cu around ... [06:47] thanks again for your help [06:48] ciao [06:48] your welcome ... [06:48] DaemonDazz (~spam@WYPP-p-144-134-3-182.prem.tmns.net.au) left #vserver. [08:11] ping Doener [08:11] pong Bertl [08:12] hmm ;) [08:14] hm.. about that wiki in conjunction with mirroring... i guess some 'slave-mode' for tavi would be of use? [08:14] sure, definitely .. [08:15] something givin read access locally, and forwarding on write requests to the main site ... or actually doing that write request on behalf ... [08:16] then i'll have a look at the tavi sources right after setting up some mysql-replication and having breakfast ;) [08:16] also some feature enhancements for tavi are still on my list ... [08:17] (like not storing empty changes ;) [08:17] i guess it would be best to keep all write request at the master so there is no need for a mysql user that has write access over the net [08:18] I guess the simplest, but most efiicient way, would be to handle the write pages locally, but instead of going into the database, just send another http request to the master server, and resync the db then ... [08:19] (which would be done automagically with replication) [08:21] so that the user never leaves the server he logged onto... yeah, that's important regarding the cookie saving the preferences... [08:22] well, we should change the cookies to use a 'linux-vserver' domain as visibility ... so that the prefs work on each of the mirrors too ... [08:31] oh boy... 10 minutes to figure out, that the mirrors can all get different sub-domains on the same domain and that that solves the cookie problem... [08:32] hmm, well I thought about using linux-vserver.net for the mirrors ... [08:33] something like www.de.linux-vserver.net for example ... [08:34] i just was kinda blind and thought of the mirrors to be on different domains, which eliminates cookie data access across the mirrors... but of course i was wrong... [08:35] well there is still the linux-vserver.net vs linux-vserver.org if we choose that path ;) [08:35] there is a .net domain? [08:35] Action: Doener didn't even know that ;) [08:36] yup, we have .com, .org, .net ;) [08:39] ok, back to the wiki itself, you'd like to have possibility to put the wiki into slave mode what simply forwards editing requests to a master server by doing a second http request, right? [08:39] well, that was what crossed my mind, smarter solutions are always welcome ... [08:42] ok, then i'll take that idea as a starting point... but now it's time for breakfast... [08:42] i guess you'll be asleep when i return? [08:42] right, I'll have breakfast, and go to bed then ... [08:43] ok, have a good night then [08:43] yeah, and thanks for helping with the wiki ... [08:43] I'll send you the modifications I did to tavi 0.25 tomorrow ... [08:44] hey, i'm happy that there finally is something i think i can really handle to help with :=) [08:44] cu tomorrow [08:44] Nick change: Doener -> doener_bf [08:57] night everyone ... [08:57] Nick change: Bertl -> Bertl_zZ [09:00] lithium (~lithium@CPE005004cf14b9-CM013439901047.cpe.net.cable.rogers.com) left irc: Quit: Leaving [09:30] Nick change: doener_bf -> doener_aw [12:46] loger joined #vserver. [13:25] mcp (~hightower@wolk-project.de) left irc: Ping timeout: 492 seconds [13:32] frz (~frz@213.235.213.90) left irc: Remote host closed the connection [13:36] mcp (~hightower@wolk-project.de) joined #vserver. [14:28] johnny (~johnny@ip68-10-185-29.hr.hr.cox.net) left irc: Remote host closed the connection [15:24] MedivhWrk (ck@netops.multimedia-centrum.de) left irc: Quit: simon says: rehashing [15:27] AGoe (~agoeres@Gd4fc.g.pppool.de) joined #vserver. [15:27] AGoe (~agoeres@Gd4fc.g.pppool.de) left irc: Client Quit [16:37] Doener_zZz (~doener@pD9E12A70.dip.t-dialin.net) joined #vserver. [16:45] doener_aw (~doener@pD9588138.dip.t-dialin.net) left irc: Ping timeout: 501 seconds [17:00] Sh[a]de (shade@cpe109.bb101.cablesurf.de) left irc: Quit: Excursion (On IRC.BONGSTER.DE [#wwip, #german-elite and #lov]) [17:16] |Shade (shade@cpe109.bb101.cablesurf.de) joined #vserver. [18:16] |Shade (shade@cpe109.bb101.cablesurf.de) left irc: Killed (services.oftc.net (Too many invalid passwords)) [18:18] Shade (shade@cpe109.bb101.cablesurf.de) joined #vserver. [18:18] Nick change: Shade -> |Shade [18:48] |Shade (shade@cpe109.bb101.cablesurf.de) left irc: Killed (services.oftc.net (Too many invalid passwords)) [18:48] Shade (shade@cpe109.bb101.cablesurf.de) joined #vserver. [18:48] Nick change: Shade -> Sh[a]de [19:23] serving (~serving@213.186.190.24) left irc: Read error: Connection reset by peer [20:12] Nick change: Bertl_zZ -> Bertl [20:13] hi everyone! [20:18] Cmaj (DeCKeR@216.113.62.236) joined #vserver. [20:18] hi Cmaj! [20:18] hi Bertl [20:20] hi Bertl [20:20] hi Sh[a]de! [20:22] the radiostation FM4 is the greatest stream in net ;) [20:22] greets to austria [20:22] hehe! [20:22] im very happy of my set up Linux debian 2.4.24-vs1.23 #3 Tue Jan 13 17:03:55 EST 2004 i686 GenuineIntel unknown GNU/Linux :) thx i even run a sarge vserver [20:22] hmm [20:22] 1.23 running stable? [20:23] or is this the patched one? [20:23] well no, actually 1.23 is stable in many setups .. especially without killer ;) [20:23] hehe ok [20:23] are you eager to continue testing? [20:24] i take the one on patch-2.4.24-vs1.23.diff.gz [20:24] yes sure [20:24] yell i optimixed my gcc specs and a -O3 [20:25] okay then lets continue with the following: http://vserver.13thfloor.at/Experimental/patch-2.4.24-vs1.23-rev03.diff [20:25] Cmaj: you could even specify -O9, did you know that? [20:25] yeas i seen a ipv6 fix is it for vs.123 [20:25] k patch this over the 1.23rev01 one? [20:25] lol [20:25] Sh[a]de: nope against vanilla ... [20:26] and you have to fix a line IIRC ... [20:26] in my way off view -O9 is the same [20:26] ok the fresh one ;) [20:26] well not that experiemented users [20:26] what fix for ipv6 do you mean? [20:27] mistake in /net/ ... [20:27] actually your post [20:27] hmm, msg id or url? [20:28] !vs_check(sk->xid, VX_MATCH|VX_IDENT)) [20:28] ah okay, that is 1.3.5 stuff ... [20:29] oky cause i use a little ipv6 [20:29] actually met setup is good my problem come from util-vserver [20:30] lots of confusing msg at compile time [20:30] maybe debian got outdated dev lib [20:31] well bug the debian maintainer to do a decent util-vserver package ... [20:31] Cmaj: does https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=112448 help you? [20:31] I'm sure enrico is willing to help where possible ... [20:31] hi enrico! [20:32] Bertl: hi, had no time yet to answer your mail; I have to finish vunify first... [20:34] but I am not sure if this will be ever compilable with gcc29x... [20:35] hmm, well ... many distros still use a 'working' compiler ;) [20:36] ;) [20:38] okay yeas i did that used the source for e2fsprogs from source make install-libs [20:38] now it compile but i still got lots warning [20:39] ensc: vlimit seems broken ... details later ... [20:42] Cmaj (DeCKeR@216.113.62.236) left irc: Quit: keep up the good work |me its skool time [20:43] mjumm.... [20:43] hi click! [20:43] i've stuck my head into too many projects these days [20:43] heya :) [20:44] but it seems, you are back to the one-and-only real project! ;) [20:45] ofcourse [20:46] one thing that should have been a define in the kernels menuconfig after being patched is the ability to change the 16-ip limit on the patch [20:46] i'm too lzay to change it usign the defines :) [20:46] urgh my fingers are still numb [20:47] Bertl [20:47] ich have a error in make bzImage [20:48] yeah, I said you have to modify a line by hand ... [20:48] oups [20:48] kernel/vcontext.c [20:48] 155:static spinlock_t ctx_ref_lock = SPIN_LOCK_UNLOCKED; [20:48] remove the static [20:48] spinlock_t ctx_ref_lock = SPIN_LOCK_UNLOCKED; [20:49] hehe "remove the static" [20:49] sounds like some readionoise :) [20:49] radio even [20:50] ok done [20:50] sry [20:50] make bzImage started [20:51] nothing to be sorry about, I could have fixed it ... [20:51] serving (~serving@213.186.190.24) joined #vserver. [20:51] hmm therefore because I haven't seen it ,) [20:52] hi serving! [20:55] ok Bertl [20:55] should i now running killer? [20:56] okay, that is again on that UP test system, right? [20:56] right [20:56] as yesterday [20:56] and all tests [20:56] HI Bertl. How r you ? [20:56] Sh[a]de: well then go ahead, kill it! [20:56] serving: fine thank you! how are you? [20:57] ok as you say ;) [20:57] ctx: 50547 [20:57] ctx: 50548 [20:57] killed [20:57] *g [20:57] okay, that was expected .... [20:57] we now 'manually' remove some locks, okay [20:57] np let me short restart the maschine [20:59] wow. [21:00] hey all [21:00] i see you're making progress, thats good to know :) [21:00] gotta go though [21:00] cu tomorrow [21:00] bertl: we found a slightly elegant method to implement quotas on a vdisk btw... [21:01] the diskspace is allocated on the root-server using a loopdevice [21:02] maharaja: cu [21:02] click: well loopback isn't that elegant ... [21:03] in the vserver, editing /etc/mtab, you force /dev/hdv1 to be a loop-device instead of the normal mount, and create the respective dev-node for the loop-device in the vs'es /dev/-system [21:03] thusm one have access to quotas, without the quotapatch [21:03] and it would be easier to create the loop inode as /dev/hdv1 [21:04] nothing which hasn't been done yet ... but as I said it has major drawbacks ... [21:04] hm... [21:04] well, thats actually which is done [21:05] as the root-server admin creates the respective dev-node [21:05] ok Bertl [21:06] maschine up [21:06] okay in kernel/sched.c [21:06] lines 634 and 644 ... [21:06] spin_lock(&ctx_ref_lock); [21:06] spin_unlock(&ctx_ref_lock); [21:06] comment them out using // [21:07] k done [21:07] // spin_lock(&ctx_ref_lock); [21:07] okay, [21:07] that's it ... another run ... [21:07] k [21:12] killer runns [21:12] and crash [21:12] ctx: 53038 [21:12] ctx: 53039 [21:12] okay .. next locks ... let me know, when you are ready ... [21:13] k short restart [21:14] same file [21:14] 170 spin_lock(&ctx_ref_lock); [21:15] 177 spin_unlock(&ctx_ref_lock); [21:15] spinlock-problems again? [21:16] Sh[a]de: comment them out using // [21:16] kk [21:16] click: well, it's a spinlock issue on UP ... [21:18] hm [21:18] on the devel-edition? [21:19] nope, stable vs1.23 ... [21:19] 1.20 is not affected? [21:28] well 1.20 is affected, but in another way ... [21:28] basically we have to deal with races which are the result of ctx-2 and later ... so verry old ... [21:29] vs1.00 doesn't fix any of those 'inherited' races [21:29] vs1.20 and up to vs1.22 fix many of those races, but not all [21:33] vs1.23 was intented to fix the last known races, but although I do not understand it yet, seem to create spinlock issues on UP systems ;) [21:33] killer running [21:33] and it seems to be ok [21:35] still running [21:35] okay, if this stays stable, please comment in the lines 634 and 644 again ... [21:36] k [21:36] give it a few minutes to run ... [21:40] ok still running [21:41] okay, let's add the locks in 634/644 back and try again ... [21:41] k [21:46] killer is hunting [21:46] I'm really curious if we can find out why your system crashes, and mine for example doesn't ... [21:47] well...me too [21:47] :/ [21:55] still running [21:56] okay, so now the only 'removed' locks are the ones in lines 170 and 177, right? [21:56] mom [21:56] right [21:57] could you build a vs1.23 (without any other patches) only with that two lines commented out? [21:57] sure [21:57] they look a little different in the original (read_lock/unlock) [21:58] give me some minutes [21:58] but the location should be the same ... [21:58] take your time, no need to hurry ... [21:59] make: *** No rule to make target `/install/linux-2.4.24/include/asm/param.h', needed by `/install/linux-2.4.24/include/linux/sched.h'. Stop. [22:00] what did you do? [22:00] oh mom [22:00] i have a kernel with patched 1.23 [22:00] and comment 170 & 177 out [22:00] then make bzImage [22:00] maybe you need a make dep clean first ... [22:01] k mom [22:03] ok with make dep and clean it seems to work [22:03] good ... [22:03] you probably move/renamed that directory ... [22:03] can be.. [22:03] ,) [22:04] so much kernels... [22:10] Bertl, one question outsinde vserver [22:10] what does this mean: [22:10] #warning the author of this code needs to read up on list_entry [22:11] well this means, that the author of this code needs to read up on list_entry ... ;) [22:11] or, that nobody cared about the code you are compiling ... [22:11] well i think thats the right answer *g [22:11] ok [22:11] it's old/outdated and probably broken ... [22:12] such warnings are generated by macros once used/now depreciated ... [22:12] ok thanks [22:22] Bertl, i have to go short do some support [22:22] i*ll be back in aprox 2h [22:27] np [22:54] ccooke_ (~ccooke@spc1-walt1-4-0-cust238.lond.broadband.ntl.com) joined #vserver. [22:54] ccooke (~ccooke@spc1-walt1-4-0-cust238.lond.broadband.ntl.com) left irc: Read error: Connection reset by peer [23:14] hehe [00:00] --- Sat Jan 17 2004