About this list Date view Thread view Subject view Author view Attachment view

From: Jan Kara (jack_at_suse.cz)
Date: Thu 21 Oct 2004 - 10:12:06 BST


  Hello,

> * Herbert Poetzl (herbert_at_13thfloor.at) wrote:
> > On Tue, Oct 19, 2004 at 06:01:00PM -0400, Stephen Frost wrote:
> > > Assertion failure in log_do_checkpoint() at fs/jbd/checkpoint.c:361:
> > > "drop_count != 0 || cleanup_ret != 0"
> >
> > you can split up this assertion into
> >
> > - drop_count != 0
> > - cleanup_ret != 0
> >
> > and fail on that (or just output those values
> > before you panic) ... this might give some
> > deeper insight into the issue ...
>
> Hmm, that's a good thought, though I have to say I'd really like to get
> a comment from the ext3 folks. This is also a production server, so I'd
> kind of like to minimize the downtime. :)
>
  I've been looking through the code and I think there might be a
following race (but it looks unlikely):
   Proc 1 Proc 2
 log_do_checkpoint()
   scans the list for buffers to flush
     and flushes everything
   scan again and throw out flushed buffers
                                                        lock_bh_state()
   on the last buffer fails jbd_trylock_bh_state()
     so we retry
                                                        unlock_bh_state()
                                                        lock_buffer()
   scanning again but now buffer is buffer_locked()
     so we cannot throw it out
                                                        mark_buffer_jbddirty()
                                                        unlock_buffer()
   __cleanup_transaction() called
     It finds nothing wrong with the buffer (and
       there is only one) => return 0
   So we have drop_count==0, cleanup_ret==0 => assertion failure

   But in this case IMHO nothing bad happened so maybe the assertion is
just the problem but probably someone with more knowledge of this code should
decide (that's why I CC'd Andrew ;).

                                                                Honza

-- 
Jan Kara <jack_at_suse.cz>
SuSE CR Labs
_______________________________________________
Vserver mailing list
Vserver_at_list.linux-vserver.org
http://list.linux-vserver.org/mailman/listinfo/vserver


About this list Date view Thread view Subject view Author view Attachment view
[Next/Previous Months] [Main vserver Project Homepage] [Howto Subscribe/Unsubscribe] [Paul Sladen's vserver stuff]
Generated on Thu 21 Oct 2004 - 10:22:30 BST by hypermail 2.1.3