Message ID | 1326219175-4529-1-git-send-email-josef@redhat.com |
---|---|
State | New, archived |
Headers | show |
On Tue 10-01-12 13:12:55, Josef Bacik wrote: > If we are journalling data (ie journal=data or big symlinks) we can discard > buffers and move them to different transactions to make sure they get cleaned up > properly. The problem is b_modified could still be set from the last > transaction that touched it, so putting it on the currently running transaction > or setting it up to be put on the next transaction will run into problems if the > buffer gets reused in that transaction as the space accounting logic won't be > done, which will result in panics at commit time because t_nr_buffers will end > up being more than t_outstanding_credits. Thanks to Jan Kara for pointing out > the other part of this problem a few months ago. Thanks, Ho hum, I'm inclined to apply this just because it makes sense. But I still don't see how a transaction can reuse a buffer from BJ_Forget list. We attach there only truncated buffers and their underlying block can be reallocated only after the transaction freeing them is committed. So have you some incentive that this patch indeed fixes the t_outstanding_credits assertion you were hunting? Honza > Signed-off-by: Josef Bacik <josef@redhat.com> > --- > fs/jbd/transaction.c | 5 ++++- > fs/jbd2/transaction.c | 5 ++++- > 2 files changed, 8 insertions(+), 2 deletions(-) > > diff --git a/fs/jbd/transaction.c b/fs/jbd/transaction.c > index 7e59c6e..c968788 100644 > --- a/fs/jbd/transaction.c > +++ b/fs/jbd/transaction.c > @@ -1784,6 +1784,7 @@ static int __dispose_buffer(struct journal_head *jh, transaction_t *transaction) > */ > clear_buffer_dirty(bh); > __journal_file_buffer(jh, transaction, BJ_Forget); > + jh->b_modified = 0; > may_free = 0; > } else { > JBUFFER_TRACE(jh, "on running transaction"); > @@ -1952,8 +1953,10 @@ static int journal_unmap_buffer(journal_t *journal, struct buffer_head *bh) > * clear dirty bits when it is done with the buffer. > */ > set_buffer_freed(bh); > - if (journal->j_running_transaction && buffer_jbddirty(bh)) > + if (journal->j_running_transaction && buffer_jbddirty(bh)) { > + jh->b_modified = 0; > jh->b_next_transaction = journal->j_running_transaction; > + } > journal_put_journal_head(jh); > spin_unlock(&journal->j_list_lock); > jbd_unlock_bh_state(bh); > diff --git a/fs/jbd2/transaction.c b/fs/jbd2/transaction.c > index a0e41a4..094dcd8 100644 > --- a/fs/jbd2/transaction.c > +++ b/fs/jbd2/transaction.c > @@ -1756,6 +1756,7 @@ static int __dispose_buffer(struct journal_head *jh, transaction_t *transaction) > * __journal_file_buffer > */ > clear_buffer_dirty(bh); > + jh->b_modified = 0; > __jbd2_journal_file_buffer(jh, transaction, BJ_Forget); > may_free = 0; > } else { > @@ -1917,8 +1918,10 @@ static int journal_unmap_buffer(journal_t *journal, struct buffer_head *bh) > * clear dirty bits when it is done with the buffer. > */ > set_buffer_freed(bh); > - if (journal->j_running_transaction && buffer_jbddirty(bh)) > + if (journal->j_running_transaction && buffer_jbddirty(bh)) { > + jh->b_modified = 0; > jh->b_next_transaction = journal->j_running_transaction; > + } > jbd2_journal_put_journal_head(jh); > spin_unlock(&journal->j_list_lock); > jbd_unlock_bh_state(bh); > -- > 1.7.5.2 >
On Tue, Jan 10, 2012 at 09:17:06PM +0100, Jan Kara wrote: > On Tue 10-01-12 13:12:55, Josef Bacik wrote: > > If we are journalling data (ie journal=data or big symlinks) we can discard > > buffers and move them to different transactions to make sure they get cleaned up > > properly. The problem is b_modified could still be set from the last > > transaction that touched it, so putting it on the currently running transaction > > or setting it up to be put on the next transaction will run into problems if the > > buffer gets reused in that transaction as the space accounting logic won't be > > done, which will result in panics at commit time because t_nr_buffers will end > > up being more than t_outstanding_credits. Thanks to Jan Kara for pointing out > > the other part of this problem a few months ago. Thanks, > Ho hum, I'm inclined to apply this just because it makes sense. But I > still don't see how a transaction can reuse a buffer from BJ_Forget list. > We attach there only truncated buffers and their underlying block can be > reallocated only after the transaction freeing them is committed. So have > you some incentive that this patch indeed fixes the t_outstanding_credits > assertion you were hunting? > So more the problem is where we set b_next_transaction, since it could be reallocated in the next transaction after the current transaction commits and then we're really screwed. I have no real evidence to prove that this is causing my problem yet, but it's definitely wrong and I want to get it fixed before I forget it :). Thanks, Josef -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Tue 10-01-12 15:21:20, Josef Bacik wrote: > On Tue, Jan 10, 2012 at 09:17:06PM +0100, Jan Kara wrote: > > On Tue 10-01-12 13:12:55, Josef Bacik wrote: > > > If we are journalling data (ie journal=data or big symlinks) we can discard > > > buffers and move them to different transactions to make sure they get cleaned up > > > properly. The problem is b_modified could still be set from the last > > > transaction that touched it, so putting it on the currently running transaction > > > or setting it up to be put on the next transaction will run into problems if the > > > buffer gets reused in that transaction as the space accounting logic won't be > > > done, which will result in panics at commit time because t_nr_buffers will end > > > up being more than t_outstanding_credits. Thanks to Jan Kara for pointing out > > > the other part of this problem a few months ago. Thanks, > > Ho hum, I'm inclined to apply this just because it makes sense. But I > > still don't see how a transaction can reuse a buffer from BJ_Forget list. > > We attach there only truncated buffers and their underlying block can be > > reallocated only after the transaction freeing them is committed. So have > > you some incentive that this patch indeed fixes the t_outstanding_credits > > assertion you were hunting? > > So more the problem is where we set b_next_transaction, since it could be > reallocated in the next transaction after the current transaction commits and > then we're really screwed. I have no real evidence to prove that this is > causing my problem yet, but it's definitely wrong and I want to get it fixed > before I forget it :). I see. But journal_invalidatepage() is called before blocks are freed which means that the freeing of the block happens either in the running transaction (to which we set b_next_transaction) or even in the following one. And we also set buffer_freed() so the buffer should be filed to BJ_Forget list when it is refiled. I agree the logic is kind of fragile so there can be bug somewhere. Just I don't see it (yet). Honza
diff --git a/fs/jbd/transaction.c b/fs/jbd/transaction.c index 7e59c6e..c968788 100644 --- a/fs/jbd/transaction.c +++ b/fs/jbd/transaction.c @@ -1784,6 +1784,7 @@ static int __dispose_buffer(struct journal_head *jh, transaction_t *transaction) */ clear_buffer_dirty(bh); __journal_file_buffer(jh, transaction, BJ_Forget); + jh->b_modified = 0; may_free = 0; } else { JBUFFER_TRACE(jh, "on running transaction"); @@ -1952,8 +1953,10 @@ static int journal_unmap_buffer(journal_t *journal, struct buffer_head *bh) * clear dirty bits when it is done with the buffer. */ set_buffer_freed(bh); - if (journal->j_running_transaction && buffer_jbddirty(bh)) + if (journal->j_running_transaction && buffer_jbddirty(bh)) { + jh->b_modified = 0; jh->b_next_transaction = journal->j_running_transaction; + } journal_put_journal_head(jh); spin_unlock(&journal->j_list_lock); jbd_unlock_bh_state(bh); diff --git a/fs/jbd2/transaction.c b/fs/jbd2/transaction.c index a0e41a4..094dcd8 100644 --- a/fs/jbd2/transaction.c +++ b/fs/jbd2/transaction.c @@ -1756,6 +1756,7 @@ static int __dispose_buffer(struct journal_head *jh, transaction_t *transaction) * __journal_file_buffer */ clear_buffer_dirty(bh); + jh->b_modified = 0; __jbd2_journal_file_buffer(jh, transaction, BJ_Forget); may_free = 0; } else { @@ -1917,8 +1918,10 @@ static int journal_unmap_buffer(journal_t *journal, struct buffer_head *bh) * clear dirty bits when it is done with the buffer. */ set_buffer_freed(bh); - if (journal->j_running_transaction && buffer_jbddirty(bh)) + if (journal->j_running_transaction && buffer_jbddirty(bh)) { + jh->b_modified = 0; jh->b_next_transaction = journal->j_running_transaction; + } jbd2_journal_put_journal_head(jh); spin_unlock(&journal->j_list_lock); jbd_unlock_bh_state(bh);
If we are journalling data (ie journal=data or big symlinks) we can discard buffers and move them to different transactions to make sure they get cleaned up properly. The problem is b_modified could still be set from the last transaction that touched it, so putting it on the currently running transaction or setting it up to be put on the next transaction will run into problems if the buffer gets reused in that transaction as the space accounting logic won't be done, which will result in panics at commit time because t_nr_buffers will end up being more than t_outstanding_credits. Thanks to Jan Kara for pointing out the other part of this problem a few months ago. Thanks, Signed-off-by: Josef Bacik <josef@redhat.com> --- fs/jbd/transaction.c | 5 ++++- fs/jbd2/transaction.c | 5 ++++- 2 files changed, 8 insertions(+), 2 deletions(-)