ext2: do not sleep in ext2_error()

Message ID	20210903090538.GA7283@kili
State	Not Applicable
Headers	show Return-Path: <linux-ext4-owner@vger.kernel.org> Date: Fri, 3 Sep 2021 12:05:38 +0300 From: Dan Carpenter <dan.carpenter@oracle.com> To: Jan Kara <jack@suse.com> Cc: linux-ext4@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-janitors@vger.kernel.org Subject: [PATCH] ext2: do not sleep in ext2_error() Message-ID: <20210903090538.GA7283@kili> Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.10.1 (2018-07-13) MIME-Version: 1.0 Precedence: bulk
Series	ext2: do not sleep in ext2_error() \| expand ext2: do not sleep in ext2_error()

Message ID

20210903090538.GA7283@kili

State

Not Applicable

Headers

Date: Fri, 3 Sep 2021 12:05:38 +0300
From: Dan Carpenter <dan.carpenter@oracle.com>
To: Jan Kara <jack@suse.com>
Cc: linux-ext4@vger.kernel.org, linux-kernel@vger.kernel.org,
        kernel-janitors@vger.kernel.org
Subject: [PATCH] ext2: do not sleep in ext2_error()
Message-ID: <20210903090538.GA7283@kili>
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.10.1 (2018-07-13)
MIME-Version: 1.0
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
 aPty/7N/1F+b41RNtjvKKfshs59+7BM9eHtjKzgO7B351WZby7YyfbuBI2ZYhXXIkG0q3RYOUzvSX7B2tZcZsK3nx0JSImWr4PI0uV6btwHt8nS9mVIVk79PhNK7jIzoE7w8jZxaC87vtj7BF8M07u4fDgtl2pHBTC+33PVvJjxhHL5RF866LBAo6WJ/7czXJykqv1wq0BCyczzWigNb2biViQU0NochOnmp8U776fi97Xg994c9qqS4TtB42ob977Jv9vcW4ehrfeHTBObGhw9i3uKO3UHTd0Bdv67ESbUthAGpWGFmFJ2WPzm77b5/bjs1usJwtjMuHrtV3vaA06CpYlPtVjNEt7pn6jWVb0Mt98clBaTyB1l47doDlQm51uuIDQML7qNIIITm+KE/eKLPCkdfqRK49vUx6YTXMEcY3pOxj8LhGTaMV7kTvqCik22Ufmv4BronSuoeO2cDO8UJzhz4uw1GoFScT0jSOR2Hr4PbGI1BiKPktJnYzUJhxJkBQe9Pqr4VZUY1Cz8VGr09fzotElrLv7mo24N9tVgce9MSJ53yP0VxwyuKVkkfQ72u7/chXI8FKOXYFs+P7YaFQuaBzrndA9T5dkNCa355opDoqeFEUknpSOhjiWk3Z0gSyY0DkiNHNd/l1Yfot6XBoXlgNlCDff5KFve8UQmwyUpb7Sx9Es/3HqWK5ZYIiUv7HKaWpXrjSma/d0p7gZreH0C18VumNNEhfrw0qH7LYPXnhyGQPDtWSi5hTuo6hhszXAvVNMniQmtF6GT8uRMJemSVbeWhCeJlXHStqdQb17EDAFugd3pYRKWUyiYGjvVHVJ1kAh5IwPFEd4QStflO42DGyBqZudSX4uuy1ArzJCQNJ+J44Ywpf8RzE8o9lk6+Kos6c6GQ/sipjQ1iCF6VnH4yn1+gx1j9mu82otznyMq7QH14Y+ruKll54mU2S/z8D2v38cJohQdGUQHXbi3CU4k24AReA//TAJsTomKOpQ/N2fAThF6aTkqV+sIklYptQSF7iBEKM0KdMt0HldAWpcAoT7BENuKJfc1dvVYwZ2Tq+Yqym2pYEjXFbnjkRRmXFkH6KkQqdWXmOwjguqj/MKGGQVAbzXUXZMaUCR3fi0pl+FBhFzs0fLF5yToI9cOYPN+HJelcCL+tJRLDonuwFb4JHBeBvU14OYoL0dsrvupSXm3pGnjCGlCAjOFZeV/AY7fIcEvmNmccRCbHUk1Bmteez24AM+g3uvokfhkxL1Z/ZywvwmBnscRpQtVMoJnCD1QluKtib4oqMPE+KbodSvA63NM7IoXGWsAhrpvSXW0G5zzCkHRVf5+PS3kn
X-OriginatorOrg: oracle.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 d73d622b-a8c5-459a-d239-08d96eba18b9
X-MS-Exchange-CrossTenant-AuthSource: 
 MWHPR1001MB2365.namprd10.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 03 Sep 2021 09:06:21.0367
 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 4e2c6054-71cb-48f1-bd6c-3a9705aca71b
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 aq0wwffk8ri6WxKqjhygT81yjJ/SX7jGaTJR5840pP6fcRMB0jiX2krIRpRk5OSDrf2ANwJf8zh3I7MYVZsOaUQCQywy54b6beLyGEmFgh4=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MWHPR10MB1823
X-Proofpoint-Virus-Version: vendor=nai engine=6300 definitions=10095
 signatures=668682
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0
 phishscore=0 suspectscore=0
 malwarescore=0 mlxscore=0 mlxlogscore=999 bulkscore=0 adultscore=0
 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2108310000
 definitions=main-2109030055
X-Proofpoint-ORIG-GUID: -qk480mRuDXMF2a8rtl314cwHZyrJuKE
X-Proofpoint-GUID: -qk480mRuDXMF2a8rtl314cwHZyrJuKE
Precedence: bulk
List-ID: <linux-ext4.vger.kernel.org>
X-Mailing-List: linux-ext4@vger.kernel.org

Series

ext2: do not sleep in ext2_error() | expand

Commit Message

Dan Carpenter Sept. 3, 2021, 9:05 a.m. UTC

No one expects error logging functions to sleep so sometimes they are
called with spinlocks held.  In this case the problematic call tree is:

ext2_statfs() <- disables preempt
-> ext2_count_free_inodes()
   -> ext2_get_group_desc()
      -> ext2_error()

Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
---
This is just from static analysis.  NOT TESTED!

Probably a safer fix would be to just call pr_err() instead of
ext2_error() in ext2_get_group_desc().  I can send that fix instead if
people want.

 fs/ext2/super.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Comments

Theodore Ts'o Sept. 3, 2021, 12:48 p.m. UTC | #1

On Fri, Sep 03, 2021 at 12:05:38PM +0300, Dan Carpenter wrote:
> No one expects error logging functions to sleep so sometimes they are
> called with spinlocks held.  In this case the problematic call tree is:
> 
> ext2_statfs() <- disables preempt
> -> ext2_count_free_inodes()
>    -> ext2_get_group_desc()
>       -> ext2_error()
> 
> Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
> ---
> This is just from static analysis.  NOT TESTED!
> 
> Probably a safer fix would be to just call pr_err() instead of
> ext2_error() in ext2_get_group_desc().  I can send that fix instead if
> people want.

Looking at both of the ext2_error() calls in ext2_get_group_desc(),
those are really more in the way of assertions rather than warning of
an on-disk corruption issue.  The second "group descriptor not loaded"
should never happen, and the "block_group >= groups_count" should have
been caught via an invalid block number or check by the caller (or an
outright code bug in say ext2_statfs().

So I suspect both of those would be more usefule as a WARN() rather
than a call to ext2_error(), since stack trace would actually provide
more useful data to root causing the issue.  Jan, what do you think?

     	    	    	 	 - Ted

P.S.  The same analysis applies for ext4_get_group_desc(), BTW.  We
don't take a lock in ext4_statfs() so trying to take a lock while
sleeping is not an issue.

For both ext2 and ext4, the caller is not supposed to holding spin
locks when it calls ext[24]_error().  In cases where it is absolutely
not avoidable, special measures are required --- see for example
__ext4_grp_locked_error().

Dan Carpenter Sept. 3, 2021, 1:09 p.m. UTC | #2

On Fri, Sep 03, 2021 at 08:48:38AM -0400, Theodore Ts'o wrote:
> On Fri, Sep 03, 2021 at 12:05:38PM +0300, Dan Carpenter wrote:
> > No one expects error logging functions to sleep so sometimes they are
> > called with spinlocks held.  In this case the problematic call tree is:
> > 
> > ext2_statfs() <- disables preempt
> > -> ext2_count_free_inodes()
> >    -> ext2_get_group_desc()
> >       -> ext2_error()
> > 
> > Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
> > ---
> > This is just from static analysis.  NOT TESTED!
> > 
> > Probably a safer fix would be to just call pr_err() instead of
> > ext2_error() in ext2_get_group_desc().  I can send that fix instead if
> > people want.
> 
> Looking at both of the ext2_error() calls in ext2_get_group_desc(),
> those are really more in the way of assertions rather than warning of
> an on-disk corruption issue.  The second "group descriptor not loaded"
> should never happen, and the "block_group >= groups_count" should have
> been caught via an invalid block number or check by the caller (or an
> outright code bug in say ext2_statfs().
> 
> So I suspect both of those would be more usefule as a WARN() rather
> than a call to ext2_error(), since stack trace would actually provide
> more useful data to root causing the issue.  Jan, what do you think?
> 
>      	    	    	 	 - Ted

Thanks Ted,

I'll resend with the WARN() change.

regards,
dan carpenter

Jan Kara Sept. 16, 2021, 9:48 a.m. UTC | #3

On Fri 03-09-21 08:48:38, Theodore Ts'o wrote:
> On Fri, Sep 03, 2021 at 12:05:38PM +0300, Dan Carpenter wrote:
> > No one expects error logging functions to sleep so sometimes they are
> > called with spinlocks held.  In this case the problematic call tree is:
> > 
> > ext2_statfs() <- disables preempt
> > -> ext2_count_free_inodes()
> >    -> ext2_get_group_desc()
> >       -> ext2_error()
> > 
> > Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
> > ---
> > This is just from static analysis.  NOT TESTED!
> > 
> > Probably a safer fix would be to just call pr_err() instead of
> > ext2_error() in ext2_get_group_desc().  I can send that fix instead if
> > people want.
> 
> Looking at both of the ext2_error() calls in ext2_get_group_desc(),
> those are really more in the way of assertions rather than warning of
> an on-disk corruption issue.  The second "group descriptor not loaded"
> should never happen, and the "block_group >= groups_count" should have
> been caught via an invalid block number or check by the caller (or an
> outright code bug in say ext2_statfs().
> 
> So I suspect both of those would be more usefule as a WARN() rather
> than a call to ext2_error(), since stack trace would actually provide
> more useful data to root causing the issue.  Jan, what do you think?

Yes, I agree. Definitely better than not flushing error on other
ext2_error() calls. BTW, Dan, I don't see a patch with WARN() in my inbox.
Did it get lost somewhere?

								Honza

diff --git a/fs/ext2/super.c b/fs/ext2/super.c
index d8d580b609ba..ba345ab860f0 100644
--- a/fs/ext2/super.c
+++ b/fs/ext2/super.c
@@ -59,7 +59,7 @@  void ext2_error(struct super_block *sb, const char *function,
 		sbi->s_mount_state |= EXT2_ERROR_FS;
 		es->s_state |= cpu_to_le16(EXT2_ERROR_FS);
 		spin_unlock(&sbi->s_lock);
-		ext2_sync_super(sb, es, 1);
+		ext2_sync_super(sb, es, 0);
 	}
 
 	va_start(args, fmt);

ext2: do not sleep in ext2_error()

Commit Message

Comments

Patch