[SRU,Xenial,0/2] Improve our SAUCE for virtio-scsi reqs counter (fix CPU soft lockup)

Message ID	20181016153820.29487-1-mfo@canonical.com
Headers	show Return-Path: <kernel-team-bounces@lists.ubuntu.com> From: Mauricio Faria de Oliveira <mfo@canonical.com> To: kernel-team@lists.ubuntu.com Subject: [SRU Xenial][PATCH 0/2] Improve our SAUCE for virtio-scsi reqs counter (fix CPU soft lockup) Date: Tue, 16 Oct 2018 12:38:18 -0300 Message-Id: <20181016153820.29487-1-mfo@canonical.com> Precedence: list MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: kernel-team-bounces@lists.ubuntu.com Sender: "kernel-team" <kernel-team-bounces@lists.ubuntu.com>
Series	Improve our SAUCE for virtio-scsi reqs counter (fix CPU soft lockup) \| expand [SRU,Xenial,0/2] Improve our SAUCE for virtio-scsi reqs counter (fix CPU soft lockup) [SRU,Xenial,1/2] UBUNTU: SAUCE: (no-up) virtio-scsi: Decrement reqs counter before SCSI command req… [SRU,Xenial,2/2] UBUNTU: SAUCE: (no-up) virtio-scsi: (cosmetic) reestablish virtscsi_pick_vq_mq() s…

Message ID

20181016153820.29487-1-mfo@canonical.com

Headers

From: Mauricio Faria de Oliveira <mfo@canonical.com>
To: kernel-team@lists.ubuntu.com
Subject: [SRU Xenial][PATCH 0/2] Improve our SAUCE for virtio-scsi reqs
	counter (fix CPU soft lockup)
Date: Tue, 16 Oct 2018 12:38:18 -0300
Message-Id: <20181016153820.29487-1-mfo@canonical.com>
Precedence: list
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: kernel-team-bounces@lists.ubuntu.com
Sender: "kernel-team" <kernel-team-bounces@lists.ubuntu.com>

Series

Improve our SAUCE for virtio-scsi reqs counter (fix CPU soft lockup) | expand

Message

Mauricio Faria de Oliveira Oct. 16, 2018, 3:38 p.m. UTC

BugLink: https://bugs.launchpad.net/bugs/1798110

[Impact] 

 * Detaching virtio-scsi disk in Xenial guest can cause
   CPU soft lockup in guest (and take 100% CPU in host).

 * It may prevent further progress on other tasks that
   depend on resources locked earlier in the SCSI target
   removal stack, and/or impact other SCSI functionality.

 * The fix resolves a corner case in the requests counter
   in the virtio SCSI target, which impacts a downstream
   (SAUCE) patch in the virtio-scsi target removal handler
   that depends on the requests counter value to be zero.

[Test Case]

 * See LP #1798110 (this bug)'s comment #3 (too long for
   this section -- synthetic case with GDB+QEMU) and
   comment #4 (organic test case in cloud instance).

[Regression Potential] 

 * It seem low -- this only affects the SCSI command requeue
   path with regards to the reference counter, which is only
   used with real chance of problems in our downstream patch
   (which is now passing this testcase).

 * The other less serious issue would be decrementing it to
   a negative / < 0 value, which is not possible with this
   driver logic (see commit message), because the reqs counter
   is always incremented before calling virtscsi_queuecommand(),
   where this decrement operation is inserted.


Mauricio Faria de Oliveira (2):
  UBUNTU: SAUCE: (no-up) virtio-scsi: Decrement reqs counter before SCSI
    command requeue
  UBUNTU: SAUCE: (no-up) virtio-scsi: (cosmetic) reestablish
    virtscsi_pick_vq_mq() signature

 drivers/scsi/virtio_scsi.c | 10 ++++++++--
 1 file changed, 8 insertions(+), 2 deletions(-)

Comments

Khalid Elmously Oct. 16, 2018, 4:42 p.m. UTC | #1

On 2018-10-16 12:38:18 , Mauricio Faria de Oliveira wrote:
> BugLink: https://bugs.launchpad.net/bugs/1798110
> 
> [Impact] 
> 
>  * Detaching virtio-scsi disk in Xenial guest can cause
>    CPU soft lockup in guest (and take 100% CPU in host).
> 
>  * It may prevent further progress on other tasks that
>    depend on resources locked earlier in the SCSI target
>    removal stack, and/or impact other SCSI functionality.
> 
>  * The fix resolves a corner case in the requests counter
>    in the virtio SCSI target, which impacts a downstream
>    (SAUCE) patch in the virtio-scsi target removal handler
>    that depends on the requests counter value to be zero.
> 
> [Test Case]
> 
>  * See LP #1798110 (this bug)'s comment #3 (too long for
>    this section -- synthetic case with GDB+QEMU) and
>    comment #4 (organic test case in cloud instance).
> 
> [Regression Potential] 
> 
>  * It seem low -- this only affects the SCSI command requeue
>    path with regards to the reference counter, which is only
>    used with real chance of problems in our downstream patch
>    (which is now passing this testcase).
> 
>  * The other less serious issue would be decrementing it to
>    a negative / < 0 value, which is not possible with this
>    driver logic (see commit message), because the reqs counter
>    is always incremented before calling virtscsi_queuecommand(),
>    where this decrement operation is inserted.
> 
> 
> Mauricio Faria de Oliveira (2):
>   UBUNTU: SAUCE: (no-up) virtio-scsi: Decrement reqs counter before SCSI
>     command requeue
>   UBUNTU: SAUCE: (no-up) virtio-scsi: (cosmetic) reestablish
>     virtscsi_pick_vq_mq() signature
> 
>  drivers/scsi/virtio_scsi.c | 10 ++++++++--
>  1 file changed, 8 insertions(+), 2 deletions(-)
> 

 - Maybe I'm misunderstanding what you mean by 'no-up'. I assumed it means "no operation", as in, no change in the resulting binary (e.g., whitespace changes, comment changes, casting etc.). Both patches seem to be "op" changes
 - The second patch doesn't actually appear to be related to the 'reqs' counter or any related soft-lockups
 - It seems strange to me that you'd have a loop that repeatedly decrements a counter because you know it will be incremented again when about to retry. I would think it makes more sense to increment the counter only once when queuing the command and decrement only once on success or "giving up" - though I assume you did it this way for a reason?


None of the above points are major enough for me to NAK this patchset, but I wanted to bring them to your attention (in case you missed something).

Acked-by: Khalid Elmously <khalid.elmously@canonical.com>

Mauricio Faria de Oliveira Oct. 16, 2018, 6:08 p.m. UTC | #2

On Tue, Oct 16, 2018 at 1:42 PM Khaled Elmously
<khalid.elmously@canonical.com> wrote:
>
> On 2018-10-16 12:38:18 , Mauricio Faria de Oliveira wrote:
> > BugLink: https://bugs.launchpad.net/bugs/1798110
> >
> > [Impact]
> >
> >  * Detaching virtio-scsi disk in Xenial guest can cause
> >    CPU soft lockup in guest (and take 100% CPU in host).
> >
> >  * It may prevent further progress on other tasks that
> >    depend on resources locked earlier in the SCSI target
> >    removal stack, and/or impact other SCSI functionality.
> >
> >  * The fix resolves a corner case in the requests counter
> >    in the virtio SCSI target, which impacts a downstream
> >    (SAUCE) patch in the virtio-scsi target removal handler
> >    that depends on the requests counter value to be zero.
> >
> > [Test Case]
> >
> >  * See LP #1798110 (this bug)'s comment #3 (too long for
> >    this section -- synthetic case with GDB+QEMU) and
> >    comment #4 (organic test case in cloud instance).
> >
> > [Regression Potential]
> >
> >  * It seem low -- this only affects the SCSI command requeue
> >    path with regards to the reference counter, which is only
> >    used with real chance of problems in our downstream patch
> >    (which is now passing this testcase).
> >
> >  * The other less serious issue would be decrementing it to
> >    a negative / < 0 value, which is not possible with this
> >    driver logic (see commit message), because the reqs counter
> >    is always incremented before calling virtscsi_queuecommand(),
> >    where this decrement operation is inserted.
> >
> >
> > Mauricio Faria de Oliveira (2):
> >   UBUNTU: SAUCE: (no-up) virtio-scsi: Decrement reqs counter before SCSI
> >     command requeue
> >   UBUNTU: SAUCE: (no-up) virtio-scsi: (cosmetic) reestablish
> >     virtscsi_pick_vq_mq() signature
> >
> >  drivers/scsi/virtio_scsi.c | 10 ++++++++--
> >  1 file changed, 8 insertions(+), 2 deletions(-)
> >
>

Hi Khalid,

Thanks for reviewing.

>  - Maybe I'm misunderstanding what you mean by 'no-up'. I assumed it means "no operation", as in, no change in the resulting binary (e.g., whitespace changes, comment changes, casting etc.). Both patches seem to be "op" changes

Hm, I meant 'no up-stream' (e.g., downstream/SAUCE patch that is not
expected to go upstream, as described in the StablePatchFormat wiki
page [3].

>  - The second patch doesn't actually appear to be related to the 'reqs' counter or any related soft-lockups

It's related to the reqs counter "area", and you're absolutely right
about no relation to the soft lockup.

I should have mentioned the 2nd patch was more of a 'while still here'
patch, sorry.
It is related to the last applied virtio-scsi no-up patch, that
touches the reqs counter functionality as well.

>  - It seems strange to me that you'd have a loop that repeatedly decrements a counter because you know it will be incremented again when about to retry. I would think it makes more sense to increment the counter only once when queuing the command and decrement only once on success or "giving up" - though I assume you did it this way for a reason?

I didn' t follow you on the loop scenario (I don't see a loop unless a
SCSI commands repeatedly fails to be kicked and gets requeued/resent
by the SCSI layer - is that it?)

I agree, and indeed, there's a reason -- upstream only used the reqs
counter in the .queuecommand path for queue steering
in two cases, 1) single-virtqueue,  and 2) multi-virtqueue without
blk-mq -- so not all cases had it.
(and the first one makes it hard to move the increment call around, see below..)

Then we introduced the first reqs-related SAUCE patch [1] (spin-loop
in virtscsi_target_destroy() until reqs is zero),
which fixed a race, but introduced a regression for multi-virtqueue
with blk-mq (which didn't increment the counter).

The second reqs-related SAUCE patch [2] fixed that regression for
multi-virtqueue blk-mq,
but for that it ended up adding the increment in the blk-mq related
function as well.
(even there, when cases would do atomic_inc(), it would be hard to
move, as one case has atomic_inc_return()
 which checks for the counter value right there.. so this complicates
moving that around.)

The patch [1] introduced another regression (this CPU softlockup),
which this patch now addresses.

I'd like to investigate later whether there's a way we can accomplish
the sync in commit [1]
without resorting to the reqs counter, as it fixed one issued and
introduced two for now.
This would allow to drop these SAUCE patches.

I'll likely have a chat w/ Jay about it this week, but since the fix
was obvious and conservative,
I imagined it would be better to have it fixed on top for now, and
then refactored/re-fixed later).

[1] f1f609d8015e UBUNTU: SAUCE: (no-up) virtio-scsi: Fix race in target free
[2] a6c038a533f3 UBUNTU: SAUCE: (no-up) virtio-scsi: Increment reqs counter.

> None of the above points are major enough for me to NAK this patchset, but I wanted to bring them to your attention (in case you missed something).

Sure, thanks for the careful review.
Hope this helps clarify things!

> Acked-by: Khalid Elmously <khalid.elmously@canonical.com>
>

[3] https://wiki.ubuntu.com/Kernel/Dev/StablePatchFormat

Khalid Elmously Oct. 17, 2018, 2:59 p.m. UTC | #3

On 2018-10-16 15:08:48 , Mauricio Faria de Oliveira wrote:
> On Tue, Oct 16, 2018 at 1:42 PM Khaled Elmously
> <khalid.elmously@canonical.com> wrote:
> >
> > On 2018-10-16 12:38:18 , Mauricio Faria de Oliveira wrote:
> > > BugLink: https://bugs.launchpad.net/bugs/1798110
> > >
> > > [Impact]
> > >
> > >  * Detaching virtio-scsi disk in Xenial guest can cause
> > >    CPU soft lockup in guest (and take 100% CPU in host).
> > >
> > >  * It may prevent further progress on other tasks that
> > >    depend on resources locked earlier in the SCSI target
> > >    removal stack, and/or impact other SCSI functionality.
> > >
> > >  * The fix resolves a corner case in the requests counter
> > >    in the virtio SCSI target, which impacts a downstream
> > >    (SAUCE) patch in the virtio-scsi target removal handler
> > >    that depends on the requests counter value to be zero.
> > >
> > > [Test Case]
> > >
> > >  * See LP #1798110 (this bug)'s comment #3 (too long for
> > >    this section -- synthetic case with GDB+QEMU) and
> > >    comment #4 (organic test case in cloud instance).
> > >
> > > [Regression Potential]
> > >
> > >  * It seem low -- this only affects the SCSI command requeue
> > >    path with regards to the reference counter, which is only
> > >    used with real chance of problems in our downstream patch
> > >    (which is now passing this testcase).
> > >
> > >  * The other less serious issue would be decrementing it to
> > >    a negative / < 0 value, which is not possible with this
> > >    driver logic (see commit message), because the reqs counter
> > >    is always incremented before calling virtscsi_queuecommand(),
> > >    where this decrement operation is inserted.
> > >
> > >
> > > Mauricio Faria de Oliveira (2):
> > >   UBUNTU: SAUCE: (no-up) virtio-scsi: Decrement reqs counter before SCSI
> > >     command requeue
> > >   UBUNTU: SAUCE: (no-up) virtio-scsi: (cosmetic) reestablish
> > >     virtscsi_pick_vq_mq() signature
> > >
> > >  drivers/scsi/virtio_scsi.c | 10 ++++++++--
> > >  1 file changed, 8 insertions(+), 2 deletions(-)
> > >
> >
> 
> Hi Khalid,
> 
> Thanks for reviewing.
> 
> >  - Maybe I'm misunderstanding what you mean by 'no-up'. I assumed it means "no operation", as in, no change in the resulting binary (e.g., whitespace changes, comment changes, casting etc.). Both patches seem to be "op" changes
> 
> Hm, I meant 'no up-stream' (e.g., downstream/SAUCE patch that is not
> expected to go upstream, as described in the StablePatchFormat wiki
> page [3].
> 

I had no idea that that's what "no-up" means :) Thanks


> >  - The second patch doesn't actually appear to be related to the 'reqs' counter or any related soft-lockups
> 
> It's related to the reqs counter "area", and you're absolutely right
> about no relation to the soft lockup.
> 
> I should have mentioned the 2nd patch was more of a 'while still here'
> patch, sorry.
> It is related to the last applied virtio-scsi no-up patch, that
> touches the reqs counter functionality as well.
> 
> >  - It seems strange to me that you'd have a loop that repeatedly decrements a counter because you know it will be incremented again when about to retry. I would think it makes more sense to increment the counter only once when queuing the command and decrement only once on success or "giving up" - though I assume you did it this way for a reason?
> 
> I didn' t follow you on the loop scenario (I don't see a loop unless a
> SCSI commands repeatedly fails to be kicked and gets requeued/resent
> by the SCSI layer - is that it?)

Yep, that's what I meant.


> 
> I agree, and indeed, there's a reason -- upstream only used the reqs
> counter in the .queuecommand path for queue steering
> in two cases, 1) single-virtqueue,  and 2) multi-virtqueue without
> blk-mq -- so not all cases had it.
> (and the first one makes it hard to move the increment call around, see below..)
> 
> Then we introduced the first reqs-related SAUCE patch [1] (spin-loop
> in virtscsi_target_destroy() until reqs is zero),
> which fixed a race, but introduced a regression for multi-virtqueue
> with blk-mq (which didn't increment the counter).
> 
> The second reqs-related SAUCE patch [2] fixed that regression for
> multi-virtqueue blk-mq,
> but for that it ended up adding the increment in the blk-mq related
> function as well.
> (even there, when cases would do atomic_inc(), it would be hard to
> move, as one case has atomic_inc_return()
>  which checks for the counter value right there.. so this complicates
> moving that around.)
> 
> The patch [1] introduced another regression (this CPU softlockup),
> which this patch now addresses.
> 
> I'd like to investigate later whether there's a way we can accomplish
> the sync in commit [1]
> without resorting to the reqs counter, as it fixed one issued and
> introduced two for now.
> This would allow to drop these SAUCE patches.
> 
> I'll likely have a chat w/ Jay about it this week, but since the fix
> was obvious and conservative,
> I imagined it would be better to have it fixed on top for now, and
> then refactored/re-fixed later).

Fair enough - and thanks for clarifying in detail


> 
> [1] f1f609d8015e UBUNTU: SAUCE: (no-up) virtio-scsi: Fix race in target free
> [2] a6c038a533f3 UBUNTU: SAUCE: (no-up) virtio-scsi: Increment reqs counter.
> 
> 
> > None of the above points are major enough for me to NAK this patchset, but I wanted to bring them to your attention (in case you missed something).
> 
> Sure, thanks for the careful review.
> Hope this helps clarify things!
> 
> > Acked-by: Khalid Elmously <khalid.elmously@canonical.com>
> >
> 
> [3] https://wiki.ubuntu.com/Kernel/Dev/StablePatchFormat
> 
> -- 
> Mauricio Faria de Oliveira

Mauricio Faria de Oliveira Oct. 17, 2018, 3:29 p.m. UTC | #4

On Wed, Oct 17, 2018 at 11:59 AM Khaled Elmously
<khalid.elmously@canonical.com> wrote:
[snip]
> > >  - Maybe I'm misunderstanding what you mean by 'no-up'. I assumed it means "no operation", as in, no change in the resulting binary (e.g., whitespace changes, comment changes, casting etc.). Both patches seem to be "op" changes
> >
> > Hm, I meant 'no up-stream' (e.g., downstream/SAUCE patch that is not
> > expected to go upstream, as described in the StablePatchFormat wiki
> > page [3].
> >
>
> I had no idea that that's what "no-up" means :) Thanks

Heh, happens to everyone :-) Glad to help!

[snip]
> > >  - It seems strange to me that you'd have a loop that repeatedly decrements a counter because you know it will be incremented again when about to retry. I would think it makes more sense to increment the counter only once when queuing the command and decrement only once on success or "giving up" - though I assume you did it this way for a reason?
> >
> > I didn' t follow you on the loop scenario (I don't see a loop unless a
> > SCSI commands repeatedly fails to be kicked and gets requeued/resent
> > by the SCSI layer - is that it?)
>
> Yep, that's what I meant.

Okay. So, indeed, that scenario is possible and actually weird,
but that's not the common case nor hot path, fortunately
(so not much is wasted on these atomic inc/dec error path)

And yeah, unfortunately we had that 'reason' I mentioned in the previous email,
which didn't easily allow for changes in where the incrementing is done.
(If things were upstream the way they are downstram for us now,
I'd like to have sent something to simplify those things up, as you said.)

[snip]
> > I'll likely have a chat w/ Jay about it this week, but since the fix
> > was obvious and conservative,
> > I imagined it would be better to have it fixed on top for now, and
> > then refactored/re-fixed later).
>
> Fair enough - and thanks for clarifying in detail

Sure, thanks again for your careful review.

>
> >
> > [1] f1f609d8015e UBUNTU: SAUCE: (no-up) virtio-scsi: Fix race in target free
> > [2] a6c038a533f3 UBUNTU: SAUCE: (no-up) virtio-scsi: Increment reqs counter.
> >
> >
> > > None of the above points are major enough for me to NAK this patchset, but I wanted to bring them to your attention (in case you missed something).
> >
> > Sure, thanks for the careful review.
> > Hope this helps clarify things!
> >
> > > Acked-by: Khalid Elmously <khalid.elmously@canonical.com>
> > >
> >
> > [3] https://wiki.ubuntu.com/Kernel/Dev/StablePatchFormat
> >
> > --
> > Mauricio Faria de Oliveira