[00:12] nathan_ (~nathan@207.44.202.162) joined #vserver. [00:12] hi nathan! [00:12] hey herbert [00:12] can you link me to 1.3.2 again? [00:12] i tested and 1.3.1 and it crashed the same but i noticed it didnt have the same locking as .2 [00:13] when do you plan to test? [00:13] right now unless you advise not to [00:13] okay, I'll upload the latest patches ... [00:14] get Con's patches for 2.4.23 too, if possible .. [00:15] ok ill apply ck1 [00:15] and there is a fix ... available from Con [00:15] against ck1? [00:16] yeah, he has it in the fix dir .. [00:16] fix1, i see it [00:16] k patching it up [00:16] http://vserver.13thfloor.at/Experimental/patch-2.4.23-ck1-vs1.3.2.diff [00:17] http://vserver.13thfloor.at/Experimental/patch-2.4.23-vs1.3.2.diff [00:18] and if possible, enable magic sysreq, in the case that it locks up, I would be glad to get some dumps ... [00:18] maybe compiling with -g as kernel CC options would also help later on ... [00:18] Bertl, yea i still have that problem with the remote box though :/ [00:18] i have no access to this box physically or serial [00:18] and lkcd is disabling smp [00:18] so its kinda like, yea it crashed let me go look at the code and hunt blindly right now [00:18] it is a SMP machine, so enable the nmi_watchdog ... [00:19] this will at least get you a clean reboot, if it's a locking issue ... [00:20] ok all patched clean [00:20] what do you think of me ;) [00:21] you are the awesome :) [00:24] hehe [00:24] assuming this fixes the vx_info crashing issues ill hack up some tests for the ip code too [00:24] the awesome! [00:24] it will be nice to finally see this box stay up [00:25] yeah, I totally agree, and I appreciate your testing ... [00:25] I really appreciate it, thank you .. [00:26] hi dan! how was your xmas? [00:26] it was good :) how was yours? [00:27] thanks fine ... [00:49] ok ck1 vs1.3.2 is booted, running same tests again. [00:57] hmm i think i just lost the box [00:57] my tests were running and then i threw jonathans killer.c into the mix [00:57] hmm, not good ... [00:57] lost it after about 2s of output on killer.c [00:57] gonna give it a boot and do it just with killer.c [00:59] hmm i wish i could get lkcd working without killing the smp [01:00] what about network console? [01:00] do you have a second machine in the same network? [01:00] network console requires same subnet right? [01:00] IIRC yes [01:00] hmm i dont, i wonder if someone i know does though [01:01] it almost seems like lkcd is no longer maintained [01:02] lkcd isn't/wasn't a very good approach ... [01:03] has anything else come along for dump analysis akin to sun? [01:05] http://iven.home.cern.ch/iven/linux_debug/linux_crash/tools.html [01:06] hm checking out mclinux [01:06] never tried, but might be an option ;) [01:07] my google skills suck i guess, i couldnt find any reference to mclinux [01:09] You don't have permission to access /downloads/patches/ on this server. [01:09] Action: nathan_ scratches head [01:09] their approach maybe not so hot either? :) [01:11] http://oss.mclx.com/projects/mcore/ [01:11] http://oss.mclx.com/projects/crash/ [01:11] yep and then download->kernel patches->forbidden [01:12] maybe they turned off indexing by accident [01:12] user tools are still downloadable [01:13] based on what im reading on their mlist archives it looks like mcore and lkcd may have become one [01:13] in that case, the result is a regression ;) [01:15] i wonder if my friend will give me back my dual i donated to him [01:22] dead in 2s with killer.c [01:22] just tried it from a fresh boot [01:23] hmm, sounds good, in that case, I 'just' missed something important ;) [01:24] box is still alive but nothing is running in user space [01:24] and the nmi watchdog hasnt panic'd the box yet [01:25] what do you mean with still alive? [01:25] network is responding [01:25] ping coming back, tcp negotiating [01:25] okay, let me suggest, to go back to 2.4.23-vs1.3.2 to limit the possibilities ... [01:26] ok [01:26] have you tried killer.c on your smp? [01:26] nope, not yet ... [01:27] Bertl, the fact that it is responding on the network would lead me to believe that it is a semaphore that is being deadlocked on and not spinlocking? [01:28] not that it makes a difference or anything, just something im wondering about [01:28] could be, but vserver doesn't use semaphores, or am I wrong? [01:28] netrose (~john877@CC3-24.171.21.47.charter-stl.com) left irc: Ping timeout: 480 seconds [01:28] Action: Bertl is searching the code ... [01:29] how is task_lock and such implemented? [01:29] Action: nathan_ looks as well [01:29] yeah, task_lock is semaphore, right ... [01:30] it is still running something because the nmi watchdog should have it panic'd and rebooted by now [01:31] well, it is a fork bomb ... so maybe it is working as expected ... [01:31] hmm [01:31] Action: nathan_ looks at that code a little more closely [01:31] ill punch myself if ive just wasted your time on a trivial case [01:32] hey, no problem, if so I'm glad .. if not we find the issue and fix it ... [01:32] sigh i missed an extra fork() in there, you are definitely right. [01:33] maybe a recursion/fork limit would be a good idea ;) [01:33] yes id say so :) [01:34] lets pretend this never happened and ill go back to my original tests that were crashing 1.3.1 and 1.2.22 [01:34] okay, but try with 2.4.23-vs1.3.2 first ... [01:34] so just vanilla 2.4.23 with 1.3.2? [01:34] that code is based on the 'solid' scheduler [01:35] the ck1 has the O(1) scheduler ... [01:35] yea i saw. ok will do it vanilla + the 1.3.2 [01:36] http://vserver.13thfloor.at/Experimental/patch-2.4.23-vs1.3.2.diff [01:36] k [01:45] http://vserver.13thfloor.at/Experimental/killer-fix.diff [01:45] (untested, but might work ;) [03:15] morning [03:15] morning! [03:20] netrose (~john877@CC3-24.171.21.47.charter-stl.com) joined #vserver. [03:20] Bertl: where are the already existing chctx/lsctx tools? [03:21] those are patches to the ext2 tools .. sec I'll get you the url .. [03:21] http://vserver.13thfloor.at/Experimental/patch-e2fsprogs-1.34-cti0.01.diff.bz2 [03:22] the name of the IOCTL has changed since ... have a look at the kernel patches ... [03:35] Bertl, got a null pointer oops with /proc/pid/status [03:35] hmm .. ksymoops? [03:35] yep running now [03:36] >>EIP; c017c53d <===== [03:36] are there any 'suspicious' error messages in the logfile? [03:37] Bertl, nope nothing abnormal [03:37] hmm, I thought one of the vx_verify checks could have hit ... [03:37] Bertl, was .1 more verbose? [03:38] might have been extraneous i just remember some debugging [03:38] hmm, yes, you can enable debugging with VX_DEBUG ... [03:38] but the vx_verify hits will be reported regardless of that .. [03:39] ah [03:39] let me look how the error message shiould look like .. [03:40] 'vx bad assumption' ... [03:40] negative [03:40] well, I take that as a good sign, so my assumptions where correct ;) [03:51] nathan: did you compile ther kernel with debug info, as suggested? [03:58] I am trying to minimize the size of a vserver. Do I need /lib/modules/* inside a vserve ? [03:58] nope [03:58] Hi Bertl. then they are gone :) [03:59] Also modutils package. Destroy it if you still didn't. :) [04:00] where are they ? [04:01] What packagin system is that? RPM? [04:01] yes [04:02] rpm -e modutils :) [04:03] i am deleting dirs from outside the vserver. DO I need /usr/share/locale/* ? can I keep just en? [04:03] just to give you an idea of the lower bound ... [04:03] # du -skx /vservers/NAME/ [04:03] 81428 /vservers/NAME [04:03] (this is mandrake 8.2 based) [04:04] He-he. I had one around 40mb or so. [04:04] rpm? cool! [04:04] Simon (~sgarner@apollo.quattro.net.nz) joined #vserver. [04:04] No-no. With debian for sure. [04:05] ah, yes with debian you can go lower ... [04:05] a standard install or did you remove dirs by hand ? [04:05] Not dirs but packages. [04:05] By removing modutils, console-tools, manpages etc. [04:06] i c [04:06] ah yes, this is without any manual removal, just package selection ... [04:06] This package selection ended with absent /etc/protocols. That was fun. :) [04:07] nathan_: still around? [04:15] kestrel (~athomas@dialup51.optus.net.au) left irc: Ping timeout: 480 seconds [04:25] Bertl, yea [04:25] did you compile with -g? [04:26] doing it now [04:26] forgot to do a make modules [04:26] okay, you didn't change the config did you? [04:26] nope [04:27] good, use addr2line on the new built kernel to get me the file/line for the oops above, please ... [04:27] yep [04:46] hmm well VX_DEBUG wasnt the hottest idea [04:46] that's why it is off by default ;) [04:47] rebuilding :-/ [04:52] Tamama (~Tamama@a62-216-20-152.adsl.cistron.nl) joined #vserver. [05:14] netrose (~john877@CC3-24.171.21.47.charter-stl.com) left irc: Ping timeout: 480 seconds [05:21] Tamama (~Tamama@a62-216-20-152.adsl.cistron.nl) left irc: Quit: one little two little three little piggies OINK! OINK! OINK! [05:36] ok well killer.c with a loop to 100 around start_ctx() killed the box [05:36] appears to be panic'd this time [05:37] could you ran the 'old' oops through ksymoops and use addr2line to fix the line? [05:37] yep lemme wait till the box comes back up [05:47] hmm what am i doing wrong here, addr2line is not giving me anything useful for any address [05:47] cmdline? [05:48] what are you typing? [05:48] addr2line -e /usr/src/linux/vmlinux c0205e98 [05:48] should be an acpi function [05:48] just giving me ??:0 [05:49] i pulled out that symbol address with nm [05:49] CFLAGS_KERNEL = [05:49] CFLAGS_KERNEL = -g ? [05:49] hmm I did it with HOSTCFLAGS [05:49] needs to be at CFLAGS_KERNEL? [05:50] host is where you compile it ... [05:51] ok im rebuilding it with -g on the CFLAGS_KERNEL [05:55] much better [05:56] >>EIP; c017c53d <===== [05:57] /usr/src/linux-2.4.23/fs/proc/array.c:155 [05:57] first one was array.c:135 [05:58] much better ;) [06:01] the first is a false positive, you seem to have changed something .. the second at 155 is valid .. [06:07] kestrel (~athomas@dialup51.optus.net.au) joined #vserver. [06:16] hmm not sure what ive changed between the builds [06:16] ill try to produce it again on the live kernel [06:25] nathan? [06:25] try this 'trivial' patch ... [06:25] --- fs/proc/array.c.origSun Dec 28 01:47:15 2003 [06:25] +++ fs/proc/array.cTue Dec 30 04:26:00 2003 [06:25] @@ -152,10 +152,12 @@ static inline char * task_state(struct t [06:25] pid_t ppid; [06:25] read_lock(&tasklist_lock); [06:25] ppid = p->p_opptr->pid; [06:26] +task_lock(p); [06:26] if (ppid != 0 [06:26] && current->vx_info [06:26] && current->vx_info->vx_initpid == ppid) [06:26] ppid = 1; [06:26] +task_unlock(p); [06:26] buffer += sprintf(buffer, [06:26] "State:\t%s\n" [06:26] "Tgid:\t%d\n" [06:26] g'night folks [06:26] night .. [06:26] Nick change: doener_aw -> doener_zzz [06:30] okay nathan, I have to go to sleep now .. please let me know if this 'stabilizes' vs1.3.2 for you, if not, send me the oopses you get, (ksymoopsed and addr2lined) via email ... (herbert@13thfloor.at) .. thanks for your time and testing ... [06:30] night everyone! [06:30] Nick change: Bertl -> Bertl_zZ [07:06] netrose (~john877@CC3-24.171.21.47.charter-stl.com) joined #vserver. [08:35] Simon (~sgarner@apollo.quattro.net.nz) left irc: Read error: Connection reset by peer [08:38] Simon (~sgarner@apollo.quattro.net.nz) joined #vserver. [10:33] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) joined #vserver. [10:40] noel- (~noel@p50859BE0.dip.t-dialin.net) left irc: Ping timeout: 493 seconds [11:45] Nick change: Bertl_zZ -> Bertl [11:45] morning everyone ... [11:45] nathan_: are you here? [11:47] <_Zoiah> Off-topic question: How do I change the ordering of the loading of kernel drivers? I have an ATARAID driver that depends on a SCSI driver (libata, for sata support) but IDE is initialized before SCSI if I load both in the kernel so it can't access the RAID array then yet. If I build the ATARAID as a module and load it later, it does wor. [11:48] hmm, why is that a problem? [11:48] I mean the ordering? [12:27] hmm ... _Zoiah? [12:47] Simon (~sgarner@apollo.quattro.net.nz) left irc: Quit: so long, and thanks for all the fish [13:20] serving (~serving@213.186.190.99) left irc: Ping timeout: 480 seconds [13:36] LL0rd (~dr@pD9507ED1.dip0.t-ipconnect.de) left irc: Read error: Connection reset by peer [14:16] _MedivhWrk (ck@netops.multimedia-centrum.de) joined #vserver. [14:16] MedivhWrk (ck@netops.multimedia-centrum.de) left irc: Read error: Connection reset by peer [14:16] Nick change: _MedivhWrk -> MedivhWrk [14:29] infowolfe (~infowolfe@pcp04891550pcs.frnkmd01.md.comcast.net) joined #vserver. [14:29] Bertl: ping? [14:29] pong! (or was it echo-reply?) [14:29] lol [14:29] pong works [14:29] did you have a nice holiday? [14:30] holiday? [14:30] ahh .. you mean xmas? [14:30] yah [14:30] lol [14:30] that non-denominational wintertime thing [14:30] yeah, had a good one ... [14:30] wonderful [14:30] and you? [14:30] ... i worked :-p [14:31] well, I worked too ... on vservers and it is still fun! [14:31] oooh, anyway, why i'm here... [14:31] Action: infowolfe is lamebrained today [14:31] could you point me in the right direction for some copy on write stuff? [14:32] as you probably know, i'm running gentoo vservers... [14:32] hmm, what do you ahve in mind? [14:33] I'd like to make a single skeleton that I keep updated with most system stuff (gcc, glibc, apache, etc etc etc) and then make configs and whatnot , things that are individual per vserver be part of each individual [14:34] how does gentoo packet management work? [14:34] s/packet/package/ [14:34] so i have one big boy that is the only one i need to keep updated and as long as i don't replace gcc/glibc/apache/php/postfix on the vservers and i restart them after the update, i don't use the same diskspace 5 times (yah, i'm up to 5 now :-p) [14:35] well, the backend is python... (/var would be one of the things that is non-replicating) [14:35] hmm, the question is, are you the sole maintainer of those vservers? [14:35] yes [14:36] okay, so whatever you do/plan, it won't interfere with other administrators ... [14:36] nope [14:36] i'm the only one with root on my machine [14:37] i give redhat vservers to people i don't care about and keep gentoo to myself (debian is the next new distro and that's for my boss) [14:37] in that case, I would suggest to use unification and to do updates in all servers at once ... [14:37] umm [14:37] but that doesn't help the disk space problem [14:37] sure it does ... [14:37] because i'm keeping about 2 GB of data for each vserver [14:37] (afaik) [14:37] magic word was 'unification' [14:37] ok [14:38] ... so how does that work :-p [14:38] I don't know how the gentoo package system works, but basically all you ahve to do is: install a apckage in all vserver, then (somehow) get a list of 'installed' files (for that package) ... [14:39] easy enough [14:39] and then do the unification for the files, which are not config or log files ... [14:39] or to be more precise, the files that won't change till the next update [14:40] (and wont differ between the vservers ;) [14:40] ok [14:40] what do i do with that list? [14:40] you replace each file, in all vservers, with a hardlink to the first one ... [14:41] lol, ouch [14:41] and modify the flags, setting immutable and ili ... [14:41] that is called unification ... [14:41] what's illi? [14:41] or ili [14:41] ILI = Immutable Link(age) Invert [14:42] ah [14:42] so it's time for me to read the vuinfy script? :-p [14:42] Immutable files are not modify/deletable in vservers [14:42] but the ILI flag weakens that, so it can be deleted. [14:42] ok [14:43] how do i set these flags? [14:43] and is there a way for me to hardlink across filesystems? or no [14:43] there is a tool which does that (in the (util-)vserver package) [14:43] no hardlinks won't work across filesystems ... [14:43] ooooh [14:43] hrm... [14:44] here's a scenario [14:44] I update apache due to some flaw allowing it to be used as a bouncing board to fry cpus or something ludicrous... [14:44] inside the skeleton [14:45] yet the old version that is still vulnerable is still running in memory on the vservers... how do i update the vservers to allow me to restart apache without taking the vservers down [14:45] I have to leave in a few minutes, (for lunch), are you here in, let's say 20minutes? [14:46] yup [14:46] okay, we discuss this then, right? [14:46] ok [14:46] thanks [14:47] Nick change: Bertl -> Bertl_oO [14:51] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) got netsplit. [15:02] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) got lost in the net-split. [15:12] serving (~serving@213.186.190.138) joined #vserver. [15:22] Madkiss (madkiss@madkiss.org) left irc: Remote host closed the connection [15:24] Nick change: Bertl_oO -> Bertl [15:25] infowolfe: okay, I'm back ... [15:27] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) joined #vserver. [15:27] Topic changed on #vserver by !unununium.oftc.net: http://linux-vserver.org/ || latest stable 1.22, devel 1.3.1 "Revolutions". [15:32] yay [15:32] sorry, didn't see ya blinkin [15:32] np [15:32] ok, as we speak i'm building the skeleton [15:33] which is turning out to be very easy (i love gentoo :-p) [15:34] good ... [15:34] how do you update sshd for example? [15:34] it's pretty easy to set and forget [15:34] update sshd? :-p [15:34] just an example .. [15:34] emerge -u openssh [15:34] and what does emerge -u openssh do, in detail? [15:34] to install it'd be just plain emerge openssh and to uninstall emerge unmerge openssh [15:35] the way gentoo' [15:35] fine, what does it do? [15:36] the way gentoo's portage system works... in a nutshell, it's got a python backend that parses an ebuild (plain text) file [15:36] the text file name defines the package's version and name [15:36] okay, okay, spare me the bloody gentoo details ;) what happens with the files? configs? etc? [15:37] the ebuild file allows portage to build the package from source with on the fly dependancy checking/satisfaction, configuration options and more [15:38] so it is built where? [15:38] within the skeleton server? [15:38] portage does the builds in /var/tmp/portage/packagename-version/ installs all files there and then syncs that with the live system and then removes any files that weren't replaced [15:39] (but are not needed anymore, like /usr/share entries) [15:39] i'm going to be doing the builds in a skeleton that i'm setting up right now [15:39] the places i will NOT need to unify: /etc /home /var [15:39] okay, so basically you have a 'before' update, and an 'after' update version of the skeleton, right? [15:39] nothing that is per-vserver specific goes anywhere else [15:40] yah [15:40] i think [15:40] the skeleton is the one i'd like to unify everything else against [15:41] good .. so what I would do, in your case (if I don't have a good package system like RPM ;) ... [15:41] when the skeleton is updated, (let's say for glibc update or gcc update) there will be a ton of files changed [15:41] good? rpm? same sentence? you're insane! [15:41] Action: infowolfe laughs [15:41] I would run a comparison between 'pre' and 'post' versions ... [15:42] alright... [15:42] and try to figure out, which files are new and not server specific ... [15:42] let's say I make a skeleton for redhat 9 [15:42] sorry RPM provides all this information in each package ... [15:43] there is no need to do that for RH9 [15:43] i can get the same information via qpkg -i [15:43] except my host doesn't have rpm installed NOR will I install it [15:43] mhepp (~mhepp@r72s22p13.home.nbox.cz) joined #vserver. [15:43] what does qpkg give you? [15:43] give me a moment [15:44] i'll give you a link of apache for example [15:44] http://thor.hardcore-linux.net/apache.txt [15:45] (it'd be best to wget that and less it) [15:45] hmm, might be colorful, but what is what? [15:45] it's a listing of every file installed by the apache package by default [15:45] and full path :-D [15:45] the top 2 lines can be snipped [15:45] yeah, but no info about the file type ... means (is a config file, is a executable binary, etc) [15:45] the first line is packagename [15:46] well, if it's in /etc it's config [15:46] and if it's in /var then it's ? [15:46] if it's /var/ it's data [15:46] unifyable or not? [15:46] not [15:47] okay, then you have to setup your own 'rules' to decide that ... [15:47] the stuff in /var is like /var/lib/mysql /var/www//htdocs, etc etc [15:47] if you can do that, fine ... [15:47] do you know of any documentation that will allow me to use a find to do the unification? [15:47] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) got netsplit. [15:47] no, but it is quite simple to do the unification ... [15:48] like find {/usr,/bin,/sbin,/lib} and then unify everything that matches? [15:48] I wouldn't do that ... [15:48] so it'd be easy for me to setup via bash [15:48] why not? I won't be updating anything *inside* the vservers except configuration files and data [15:49] sure, and then you decide not to update server 3 and the next time you 'unify' bad things happen ;) [15:49] it wouldn't happen [15:49] I would advice the following procedere ... [15:49] i'm looking to script it [15:49] and run via cron once a week [15:50] not useful .. as it is best 'event' triggered .. [15:50] ok [15:50] a) you update the skeleton [15:50] b) you get a list of _all_ changed files [15:51] c) you get a list of all unifyable files (subset) [15:51] ok [15:51] so what would i use to compare changed vs unchanged? [15:51] 2 skeletons? [15:51] d) you remove one file after the other and either replace it by a copy or hardlink to the skeleton [15:51] mhepp (~mhepp@r72s22p13.home.nbox.cz) left irc: Quit: Tak ja padaaaaM [15:52] (for each vserver) [15:52] e) for all unified files after all hardlinks are done, the IMMUTABLE and ILI flags are set [15:52] f) you are done. [15:52] alright... [15:53] the question stands, how do i get a "snapshot" of what i had before the update? [15:53] make a copy ... if possible (with gentoo) a shallow one ... [15:53] shallow? [15:53] cp -la [15:54] if gentoo's build system f** up the shallow copy, make a real one ... [15:54] that would all be done outside the chroot though... only the actual update would be done inside the chroot [15:55] does a shallow copy keep track of filesizes and modified times? [15:55] you could also put the skeleton on a lvm volume, and make a snapshot, but then you have to keep it ununified ... [15:55] man cp [15:56] beautiful [15:56] so cp -la skeleton snapshot [15:56] when the skeleton is happy... [15:57] and btw, i doubt gentoo's build system would f*** up the shallow copy :-D [15:57] you have to verify that the gentoo build system doesn't modify the files ... [15:57] doesn't modify which files, the shallow copy? [15:57] any files [15:57] it can remove and replace them, but not modify! [15:57] oh... lol [15:57] i can say honestly that it doesn't [15:58] noel_ (~noel@pD9FFA7E6.dip.t-dialin.net) got lost in the net-split. [15:58] RPM does this and other package systems (tar based) too ... [15:58] (not modifying, only replacing) [15:58] like i said, it doesn't modify anything... [15:58] it'll remove if outdated [15:59] let's say i emerge -u apache and it WAS 2.0.45 and is going now to 2.0.48 [15:59] okay, fine .. if unsure, I would check with a glibc update, and a deep copy to compare with ;) [15:59] how would you verify? simple ... [15:59] it'll REPLACE every file that both packages have in common and REMOVE any files unused by the new package [15:59] a) make a shallow copy [16:00] b) make a deep copy [16:00] c) update skeleton [16:00] compare with [16:00] if there is _any_ difference, shallow copies won't work reliable ... [16:01] give me a second [16:01] doing the shallow copy now [16:02] and then i can force a re-emerge of gentoolkit and test for you right now [16:02] ok [16:02] i have a shallow copy made [16:02] b) make a deep copy [16:03] lol [16:04] hmm, reemerge will install the same stuff, right? [16:04] yup [16:04] then you have to test the folowing: [16:04] it's actually emerge and that'll force it to recompile and reinstall and should change the timestamps by replacement [16:05] to make the deep, will cp -a work? [16:05] a) are there any 'newer' timestamps in than in [16:05] yes, it will ... [16:05] creating deep [16:06] so i'll be doing vfiles shallow deep ? [16:06] what is vfiles? [16:07] part of util-vserver? [16:07] riel (~riel@riel.netop.oftc.net) got netsplit. [16:07] ccooke (~ccooke@80.1.164.238) got netsplit. [16:07] MrBawb (abob@swordfish.drown.org) got netsplit. [16:07] MrBawb (abob@swordfish.drown.org) returned to #vserver. [16:07] ccooke (~ccooke@80.1.164.238) returned to #vserver. [16:07] i'm not even sure what exactly i'm supposed to use to do the checking/unification [16:07] phil_ (~phil@20.22.204.212.sr2.DTM1.ip.versanet.de) joined #vserver. [16:07] riel (~riel@imladris.surriel.com) joined #vserver. [16:07] hiho [16:07] hi phil_ [16:07] well, checking could be com/diff [16:08] com? [16:08] but I would suggest a ls -lR and some scripted comparison ... [16:08] compare? [16:08] awk or something like that ... [16:08] so like ls -lR shallow > file1 and ls -lR deep > file2 and diff file1 file2? [16:09] i'm no good with awk :( [16:09] okay, any chance to actually 'upgrade' a large package, or downgrade? [16:10] mornning all. I am still cleanning :) Nothing in /boot is needed for a vserver operations ? [16:10] erm, i wanted to know if there are any example configs for this project. /usr/lib/vserver/sample.conf doesn't exist :/ [16:10] i've already re-emerged gentoolkit [16:10] i can re-do apache if you'd like [16:10] okay phil, hi! [16:10] hi [16:11] what did you do, and what do you want to achieve ... [16:11] infowolfe: try to up/downgrade glibc or something like that ... [16:11] doh [16:11] ... are you serious? that'll take quite a while... it is a FULL build of glibc [16:11] i found a sample.conf [16:11] phil: you downloaded what? installed what? [16:11] was just in another directory [16:12] not in the one, described in the tutorial [16:12] give us a chance to help you ;) [16:12] no, my problem is fixed, at the moment ;) [16:13] but, i guess i'll have some more questions later *g* [16:13] okay, what version? [16:13] at least let us know the version ;) [16:15] i used patch-2.4.23-vs1.22.diff to patch my kernel [16:15] so it seems to be 1.22 :) [16:15] okay, thanks ... [16:15] so can I delete all files in /boot ? Nothing in /boot is needed for a vserver operations ? [16:16] infowolfe: well, it doesn't have to be glibc, but something with config files and such stuff ... [16:16] Bertl, no problem, i'll do apache [16:17] but an upgrade please ... [16:17] serving: nope boot isn't required ... [16:17] no problem [16:19] Bertl, is dropping a version ok? like apache 2.0.47 as the new one with the old being 2.0.48? [16:20] yeah sure ... [16:24] Doener_aw (~doener@pD9E12B01.dip.t-dialin.net) joined #vserver. [16:25] bertl [16:26] yup? [16:26] how should i check for differences? [16:26] the names are skel, shallow and deep [16:27] diff -Nur shallow deep [16:28] is it good that it's telling me that there's no such file or directory? [16:28] or bad? [16:28] diff: shallow/etc/apache2/lib/ccache/bin/c++: No such file or directory [16:28] diff: deep/etc/apache2/lib/ccache/bin/c++: No such file or directory [16:30] guess you can ignore those messages, they are 'broken' links ... [16:31] lol [16:31] the only diff i've got so far is in /root/.bash_history :-p [16:32] hmm, okay but that _is_ such a case ... [16:32] so i'm getting broken link output and that's it [16:32] doener_zzz (~doener@pD9E127D7.dip.t-dialin.net) left irc: Ping timeout: 512 seconds [16:32] where a file is being modified instead of replaced? [16:32] it's in /usr/right now [16:33] it's probably safe to use shallow copies, but I can't guarantee it ... [16:33] it seems to be just fine to me... [16:35] I can email the output to you if you'd like [16:35] no, thank you ;) [16:36] didn't think you'd wanna see it [16:36] ok, so i'm convinced shallow copying works [16:36] i trust your judgement ... ;) [16:36] ok [16:37] so what's the next step? [16:37] whoa... [16:37] i just had a ... oh :-p [16:37] no wonder :-p [16:37] i left distribution files in /usr :-p [16:38] i'm convinced it works [16:38] since i don't want it to pay any attention to /var anyway... [16:39] and i'll be auto-cleaning my distfiles it's not an issue [16:49] mmkay, so Bertl, how do i set the correct bits on the hardlinks i'll be making? [16:55] noel (~noel@pD9FFA7E6.dip.t-dialin.net) joined #vserver. [16:56] hello? [16:57] Action: infowolfe listens for echo [16:57] Action: infowolfe hears the echo... [17:02] hmm, .. what do you need now? [17:02] :-\ [17:02] I got the impression, you where 'just' explaining what you do ... [17:02] i was trying to ask what tool is used to set the flags required for copy-on-write to work [17:03] ahh okay, missed that one ... [17:03] 01[08:50] mmkay, so Bertl, how do i set the correct bits on the hardlinks i'll be making? [17:03] I don't read black ... [17:03] no worries... and i do appreciate huh? [17:03] i do appreciate the help [17:03] what do you mean you don't read black? [17:04] are my colors messed up again? [17:04] not yet .. but in the copy/paste ... [17:04] ah.. sorry [17:05] setattr/showattr ... [17:05] :-\ what're those a part of? [17:07] odd [17:07] nevermind [17:07] that was a case of something being put where it wasn't expected... [17:07] Bertl, have a look at the query ;) [17:07] it's in /usr/lib/util-vserver/showattr [17:11] thanks a lot for your help Bertl, and i'm sorry to be a pain... [17:12] no you ar no pain, I'm busy and tired, that's all ... [17:12] you are welcome, and do not hesitate to ask me if you need something ... [17:12] 2.6 support! :-p [17:12] lol [17:13] http://vserver.13thfloor.at/Experimental/patch-2.6.0-test11-vs0.02.diff [17:13] Bertl, will that patch cleanly against 2.6.0? [17:13] [17:13] yup [17:13] oooh.... [17:13] Action: infowolfe runs off to go compile it [17:14] but no network and such stuff yet ... [17:14] Action: infowolfe blinks... [17:14] can you be more specific? [17:14] well, context separation is there, that's it ;) [17:14] no network would be very bad on a router :-p [17:15] what kind of testing do you need with it bertl? [17:15] no would be always bad on 'z' ;) [17:15] hmm, I don't know ... will be superceeded soon, but you asked for 2.6.x support, so you got what I have atm ;) [17:16] lol [17:16] well, if you can give me a better idea what you need tested, i can do a better job of testing it for you :-D [17:16] hmm, okay, what are you willing to test? [17:17] anything that doesn't set my box on fire or make my SNAT not work anymore :-p [17:18] ... as long as gentoo is ok as the host distro :-p [17:18] hmm, looks like a candidate for the quota stuff, then ... ;) [17:18] i've already got the util-vserver 0.26 tools on it [17:18] well, i do have some extra diskspace :-p [17:18] you just tell me what to do and how you want it broke and i'll try my hardest to give you good bug reports [17:19] there should be a preliminary howto, done by somebody I do not remember now ... describing the quota stuff (installation) [17:19] okie... which kernel? [17:19] 2.4.23 vs1.22 stable [17:20] alrighty [17:20] http://www.13thfloor.at/vserver/s_addons/quota/patch-2.4.22-vs1.20-q0.12.diff [17:20] it seems that some people managed to hang the quotaoff with this ... [17:21] I'm unable to reproduce or get any task traces ... [17:21] forgive me for not being a programmer :-p [17:21] task traces? [17:21] so testing/crashing this would be really useful ... [17:21] strace it and break it? [17:22] what you do with the kernel is the following: [17:22] a) enable magic sysreq [17:22] b) compile with CFLAGS_KERNEL = -g [17:23] c) try to get a 'ahnging' quotaoff [17:23] c) try to get a 'hanging' quotaoff [17:23] d) activate magic sysreq (howto in /usr/src/linux/Documentation) [17:23] e) produce nice task/kernel traces with SYSREQ-P [17:24] umm, did you want me to use the 2.4.22 quotapatch on a 2.4.23 kernel? [17:24] hmm, no sorry ... [17:24] http://www.13thfloor.at/vserver/s_addons/quota/patch-2.4.23-vs1.20-q0.12.diff [17:25] so vs1.20 quota on vs1.22? [17:25] f) run the output through ksymoops [17:25] g) postprocess the actual locations with addr2line [17:25] yes, 1.20 quota is the latest ... [17:26] it was clean so i assume that's the one :-p [17:26] is there an e) ? [17:27] 15:25 < Bertl> e) produce nice task/kernel traces with SYSREQ-P [17:27] sorry :-p [17:27] is there an h? [17:27] hmm, just because I stumbled over it, you know the VSkel stuff? [17:27] not really [17:28] hmm, maybe you should have a tiny peek at linux-vserver.org then ;) [17:28] lol [17:29] oooh, pretty [17:29] Action: infowolfe looks impressed :-p [17:33] would you like me to test vskel or use it as a solution to my unification problem [17:34] the latter one ... if it is useful ;) [17:34] i'll take a look at it whilst i'm breaking your vquota :-D [17:37] Bertl, should i disregard the vroot stuff or is it tied in [17:37] vroot is in vs1.22 [17:38] i know, but should i do the quota thing with or without using vroot [17:38] fyi, release candidate for util-vserver 0.27 has been published at http://www-user.tu-chemnitz.de/~ensc/util-vserver/ When no issues will be found, this will become 0.27 in 3 hours [17:38] deadguy (deadguy@bananajoe.big.du.se) joined #vserver. [17:38] infowolfe: now that you ask, both would be a good idea [17:39] is there anything special i would need to do (besides follow the documentation?) [17:40] hmm, there is a documentation ;) [17:40] hmm, there is a documentation? ;) [17:40] i'm not sure, i don't really remember :-p [17:40] again, that's me assuming you've written documentation on this amazing and alluring beast you've created [17:42] Bertl, do you know where i'd find addr2line? [17:42] (packagename would do) [17:45] binutils-2.11.92.0.12-6mdk [17:46] for some reason, i had it but bash-completion wasn't finding it :-\ [17:47] hm, that's weird. tough i've installed util-vserver-0.26.tar.bz2 there is no /usr/sbin/newserver [17:47] phil_, what did you use for prefix/exec-prefix? [17:48] at the configure-script? [17:48] yah [17:48] there i had no prefix [17:48] enrico, could you explain how to 'clone' a vserver from the host system with util-vserver? [17:48] have you done an updatedb since you installed util-vserver? [17:48] yes, of course [17:49] locate newvserver? [17:49] Bertl: in the stable branch, this works as documented. 'vserver ... build' should do that [17:49] ahh, could you give phil an example ... please? [17:50] phil_: 'vserver foo build' [17:50] ;) [17:50] Bertl, i'll be getting a buddy of mine's testkit to beat the hell out of your quotas with :-D [17:51] it's known for making just about everything but ext3 blow chunks [17:52] (reiserfs, jfs, afs, cifs, nfs) surprisingly it doesn't hurt xfs too bad (just makes it replay the journal on next reboot for about a half hour :-D) [17:53] okay, go ahead, but the critical part is the quotaon/quotaoff .. for separate vservers and maybe on the host ... [17:53] i'll find a way to make it puke... somehow :-p [17:54] Bertl, have you ever seen a machine with 1000+ load avg on all 3 measurements via top? [17:54] or perhaps an hdparm -Tt output showing in kb/s? [17:55] they're part of my hardware test battery :-D [17:55] Action: infowolfe displays an evil grin [18:05] bertl, i'll email you when i break quotas [18:05] in the mean time i have to go fight alsa... later [18:05] infowolfe (~infowolfe@pcp04891550pcs.frnkmd01.md.comcast.net) left #vserver. [18:05] sladen (paul@starsky.19inch.net) joined #vserver. [18:06] Action: sladen pings Bertl [18:17] hey paul! [18:17] how was your xmas? [18:31] relaxing--I stayed away from the family! :-) [18:31] ahh .. okay .. why did you 'ping' me? [18:31] and popped down to a soup-run on Christmas day [18:32] well, I made the mistake of putting 1.22 on a production machine at 2am on Boxing day morning [18:32] that machine is giving two issues which I haven't tracked down [18:32] and? [18:33] a) all outgoing connections are binding to the default address of the machine (the rebinding based on the iproot doesn't seem to be taking effect) [18:33] the second one is localhost->localhost connection rewriting (same thing) [18:34] and all this worked with ctx what version? [18:34] *cough*. That machine was using ctx10. *cough* [18:35] hmm, well ctx10 had network support ;) [18:35] ;-) [18:35] anyway, having gone through the patch as it stands I haven't noticed anything [18:36] I guess, its a little 'changes' in 2.4.23 here and some 'helpers' in ctx-17 there ... [18:36] so basically the behaviour changed somewhat ... [18:37] I'll be back in a few minutes, we can discuss this ... [18:39] Action: sladen pops off to do something aswell [18:40] okay, back ... [18:41] I'll do that later [18:42] right... 2.4.22/vs1.21 I could probably use .23/1.22 but I grabbed what was handy for the Debian archive [18:43] kestrel (~athomas@dialup51.optus.net.au) left irc: Ping timeout: 480 seconds [18:43] hmm, there is a debian 2.4.23-1 [18:44] ah, couldn't find it on boxing day evening--and I'm not in a hurry to reboot this particular machine again :-( [18:45] okay .. so it's 2.4.22/vs1.21, right? [18:45] (although I might downgrade it to the previous) [18:45] stuff in /proc is fine [18:45] http://www.linux-vserver.org/index.php?page=ChangeLog [18:46] smp machine? [18:47] nope, but SMP compiled in (I vagely remember so issue in the past with uni-processor kernels on that machine) [18:47] vs1.21 * small bugfix in tcp_ipv4_addr_conflict() [18:48] yes, that was a copy paste issue, 1.00 behaviour was restored ... [18:48] do you still have incremental diffs aroun? [18:48] do you still _not_ read my pages? ;) [18:48] http://www.13thfloor.at/vserver/s_release/delta/ [18:55] Action: sladen reads vs1.20->vs1.21 [18:57] ah, you've changed the schmantics of sys_reboot too--a vserver with CAP_SYS_REBOOT can't reboot the machine... (eg, running a watchdog in a vserver) [18:59] yup [19:02] bits of IPv6 are compiled in (but only as modules, and not loaded), I remember that messed up some stuff before (binding of anything that was IPv6 aware) [19:02] yup [19:04] do you have a test machine handy. [I don't] [19:04] well, I use QEMU and vs1.22 patchset is installed on one of my servers ... [19:07] hadn't come across QEMU.. [19:13] mailed you the .config [19:13] hmm, and what I'm supposed to do with it? [19:15] use it for reference; or if you have $time compile it up and see if you can reproduce the errata [19:15] hmm, have I missed the mis-behaviour? [19:16] diff it against something you have handy (eg, CONFIG_SMP is on etc) [19:16] I guess so [19:16] okay, maybe you start with explaining the setup, and how you try to configure routing/etc? [19:16] "all outgoing connections fail to be rewritten with the correct ipv4root" [19:16] vs1.22 works on multi homed vservers for me (FYI) [19:17] there is nothing unusual about the routeing, there is a single ethernet device and no netfilters rules in use [19:17] each context has a *single* IP assigned [19:17] okay, and what happens? [19:17] "all outgoing connections fail to be rewritten with the correct ipv4root" [19:18] an out going connection (TCP or UDP) from any context is left as is [19:18] eg. vs01 -> remote is left as host -> remote [19:19] where "host" is the first IP assigned to the machine [19:19] eg. localhost -> localhost is left as localhost -> localhost (rather than localhost -> vs01) [19:19] okay, sec, I'll try to reproduce ... [19:21] vs01$ sudo tcpdump -n -i eth0 udp & nc -u remote.com 9999 [19:22] vs01$ sudo tcpdump -n -i lo & nc -s 127.0.0.1 127.0.0.1 telnet [19:22] vs01$ sudo tcpdump -n -i eth0 udp & nc remote.com 9999 [19:22] here: vs1.22 on 2.4.23 machine with 10 vserver, one with two ips ... [19:22] ssh outbound conenction uses the first ip assigned to the vserver ... [19:22] I'll try assigning two IPs to one context [19:23] and I'll try with a single ip vserver [19:24] works also on a single ip vserver ... for ssh ... [19:29] right, having assigned one context with two IPs, I now get localhost rewriting in another context [19:31] ah, not always: [19:31] $ sudo /usr/sbin/rndc reload [19:31] 16:32:08.432051 127.0.0.1.49987 > 127.0.0.1.953: SWE 183205867:183205867(0) win 32767 (DF) [19:31] 16:32:08.432081 127.0.0.1.953 > 127.0.0.1.49987: R 0:0(0) ack 183205868 win 0 (DF) [19:32] you should not use 127.0.0.1, did you assign ip 127.0.0.1 for this vserver? [19:33] ah, does lo/127.0.0.1 have to be explictly appened to the IP_ROOT="" list now? [19:33] if you want to use it in vserver, yes but as 127.0.0.1 pelase ... [19:37] I don't /want/ to use it; I want it to be rewritten by anything trying to use it [19:37] hmm, sounds good, but isn't implemented ... [19:39] eg, the [19:39] + if (dst == 0x0100007f && !vx_check(0, VX_ADMIN)) [19:39] + dst = ipv4root; [19:40] yes, this works, but only for the one ip case ... [19:40] which is what that code is /supposed/ to be doing [19:40] but these are only one-ip cases [19:40] well, and this works for me, on my server ... but not for you, right? [19:41] lp (~lpressl@interner.SerNet.DE) joined #vserver. [19:41] indeed :-) [19:42] so how do we build a reproducible test case out of this? [19:43] start with the config file [19:44] hmm, and what? replace my entire vserver host? [19:44] just to find out, that it works too ... [19:44] the think is, that is doesn't work here [19:45] okay, turn that around. How can I more effectively track down what is not working [19:45] hmm, maybe build up a testcase with chbind only? [19:45] all a vserver does is chbind, regarding the network ... [19:46] so get a clean system, without any ip/routing config and no other vservers ... [19:46] then try with chbind --ip /bin/bash [19:47] and see if you can produce a case where tcpdump gives ip != [19:52] dasboot:~# chbind --ip 212.13.202.29 telnet starsky.19inch.net 9999 [19:52] ipv4root is now 212.13.202.29 [19:52] Trying 80.1.73.116... [19:52] 16:52:33.419516 194.153.168.125.50034 > 80.1.73.116.9999: [19:53] the tcpdump is running where? [19:53] on starsky.19inch.net (the remote machine) [19:54] and 212.13.202.29 does exist as alias or local address? [19:56] it wasn't (D'oh). But doing so: [19:56] dasboot:~# /sbin/ifconfig | grep -A1 test [19:56] eth0:test Link encap:Ethernet HWaddr 00:20:ED:2E:F4:D2 [19:56] inet addr:212.13.202.29 Bcast:212.13.202.255 Mask:255.255.255.0 [19:56] makes no difference; [19:56] dasboot:~# chbind --ip 212.13.202.29 telnet starsky.19inch.net 9999 [19:56] ipv4root is now 212.13.202.29 [19:56] Trying 80.1.73.116... [19:56] chbind --ip 212.16.62.51 telnet vstest.13thfloor.at [19:57] 17:58:00.978197 0:1:30:b7:30:40 0:4:75:aa:23:98 0800 74: 212.16.62.51.58816 > 216.114.78.253.23: [19:57] chbind --ip 212.16.62.52 telnet vstest.13thfloor.at [19:57] 17:58:25.868273 0:1:30:b7:30:40 0:4:75:aa:23:98 0800 74: 212.16.62.52.58817 > 216.114.78.253.23: [19:57] which is what should be happening... [19:57] well, which is what happens here ... [19:57] okay, just out of curiosity, what compiler? which version? [19:58] dasboot:~# gcc -v [19:58] Reading specs from /usr/lib/gcc-lib/i386-linux/2.95.4/specs [19:58] gcc version 2.95.4 20011002 (Debian prerelease) [19:58] hmm, well what gives uname -a [19:59] Linux dasboot 2.4.22 #1 SMP Sat Dec 27 01:11:15 GMT 2003 i686 unknown [19:59] hmm, interesting ... [19:59] no extension? no -vs1.21 ? [20:02] no, the EXTRA_VERSION appears to have been removed from all the debian patches--presumable to stop rejects, but that's only a guess [20:03] hmm, okay so you are using a debian patched kernel .. ahhh ... [20:03] those are experimental and not tested, at least not by me ... [20:06] you probably found a debian patch bug .. I'll ahve a look at it .. but not today ... [20:07] Nick change: Doener_aw -> Doener [20:07] hi [20:12] phil__ (~phil@20.22.204.212.sr2.DTM1.ip.versanet.de) joined #vserver. [20:14] I'll wait to the early hours, take that machine down and see what combinations still cause it [20:15] phil_ (~phil@20.22.204.212.sr2.DTM1.ip.versanet.de) left irc: Ping timeout: 480 seconds [20:15] sladen: I would suggest to use 2.4.23-vs1.22 for now .. no debian ... [20:16] until this is resolved ... [20:18] !praise lights out cards [20:19] how much work is getting QEMU to work? [20:19] hey, I said to the debian guy, he has to test it .. he said, he tested it, and _everything_ works perfect ... [20:20] Action: sladen still not decided either way--the testing shall tell... [20:28] Action: sladen grumbles at qemu for building out of the box [20:31] hehe [20:33] s/building/not building/ [20:34] sadly... :( [20:35] mcp (~hightower@wolk-project.de) left irc: Ping timeout: 512 seconds [20:35] kestrel (~athomas@dialup51.optus.net.au) joined #vserver. [20:44] mcp (~hightower@wolk-project.de) joined #vserver. [20:51] okay, cu l8er ... [20:51] Nick change: Bertl -> Bertl_oO [20:52] tanjix (ViRu_@pD904A808.dip.t-dialin.net) joined #vserver. [21:36] shuri (~ipv6@pornstar.at.blacktaboovideo.com) joined #vserver. [21:37] shuri (~ipv6@pornstar.at.blacktaboovideo.com) left irc: Remote host closed the connection [21:54] tanjix (ViRu_@pD904A808.dip.t-dialin.net) left irc: [22:00] hey bert [22:00] you around? [22:00] it was, earlier [22:00] and has answered email since [22:01] ah [22:22] hm [22:22] phil__ (~phil@20.22.204.212.sr2.DTM1.ip.versanet.de) left irc: Quit: Client Exiting [22:22] phil_ (~phil@20.22.204.212.sr2.DTM1.ip.versanet.de) joined #vserver. [22:25] Can't set the ipv4 root (Function not implemented) [22:25] <- what does this mean? [22:25] i just set eth0 to: IPROOT="eth0:192.7.20.1" [22:25] why doesn't this work? [22:26] you're probably using a newer versino of the kernel patches than the userspace tools [22:26] (eg, vs1.xx patches and 0.25 tools or something) [23:53] netrose (~john877@CC3-24.171.21.47.charter-stl.com) left irc: Ping timeout: 480 seconds [00:00] --- Wed Dec 31 2003