Schedule by INSN_COST in case of tie

Message ID	5B97C539.1020009@arm.com
State	New
Headers	show Return-Path: <gcc-patches-return-485434-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:to :from:subject:cc:message-id:date:mime-version:content-type; q= dns; s=default; b=myrgtr/lsVnve4sjBzZ2egNV2Zg6XN7ZFBjk08n/kbpE2e cHj7UflulihU1028QnVqNnBDHfP+FrjrjMi1efKsEtPalPgV8U1+x3Q5QccbqvO/ 7+MSntDv8WobAtkK5juUTVokQ33LObpIZGnesWmZr9VJX8moAtFFX2fOaAfmc= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org To: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org> From: Vlad Lazar <vlad.lazar@arm.com> Subject: [PATCH] Schedule by INSN_COST in case of tie Cc: Jeff Law <law@redhat.com>, vmakarov@redhat.com, gnu@the-meissners.org, wilson@tuliptree.org, nd <nd@arm.com> Message-ID: <5B97C539.1020009@arm.com> Date: Tue, 11 Sep 2018 14:38:01 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.4.0 MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="------------060506020901050606070302" Received-SPF: None (protection.outlook.com: arm.com does not designate permitted sender hosts)
Series	Schedule by INSN_COST in case of tie \| expand Schedule by INSN_COST in case of tie

Vlad Lazar Sept. 11, 2018, 1:38 p.m. UTC

Hi.

This patch makes the scheduler prefer instructions with higher cost if two given instructions are equally good.
Issuing more restricted instructions first is particularly useful on in-order cores because it increases the
number of dual issue opportunities.

For example, on AArch64, instead of:

   add     x11, x2, 96
   mov     x4, x2
   mov     w10, 1
   ldrh    w5, [x0]
   ldrh    w13, [x0, 2]
   ldrh    w9, [x0, 4]
   ldrh    w12, [x0, 6]
   b       .L759

Generate:

   ldrh    w5, [x0]
   add     x11, x2, 96
   ldrh    w13, [x0, 2]
   mov     x4, x2
   ldrh    w9, [x0, 4]
   mov     w10, 1
   ldrh    w12, [x0, 6]
   b       .L759

Bootstrapped and regtested on aarch64-none-linux-gnu and there are no regressions.
Ok for trunk?

Thanks,
Vlad

gcc/
Changelog for gcc/Changelog
2018-09-11  Vlad Lazar  <vlad.lazar@arm.com>

	* haifa-sched.c (rank_for_schedule): Schedule by INSN_COST.
	(rfs_decision): New scheduling decision.

Ramana Radhakrishnan Sept. 11, 2018, 2:55 p.m. UTC | #1

On Tue, 11 Sep 2018, 14:38 Vlad Lazar, <vlad.lazar@arm.com> wrote:

> Hi.
>
> This patch makes the scheduler prefer instructions with higher cost if two
> given instructions are equally good.
> Issuing more restricted instructions first is particularly useful on
> in-order cores because it increases the
> number of dual issue opportunities.
>
> For example, on AArch64, instead of:
>
>    add     x11, x2, 96
>    mov     x4, x2
>    mov     w10, 1
>    ldrh    w5, [x0]
>    ldrh    w13, [x0, 2]
>    ldrh    w9, [x0, 4]
>    ldrh    w12, [x0, 6]
>    b       .L759
>
> Generate:
>
>    ldrh    w5, [x0]
>    add     x11, x2, 96
>    ldrh    w13, [x0, 2]
>    mov     x4, x2
>    ldrh    w9, [x0, 4]
>    mov     w10, 1
>    ldrh    w12, [x0, 6]
>    b       .L759
>
> Bootstrapped and regtested on aarch64-none-linux-gnu and there are no
> regressions.
> Ok for trunk?
>

This to me feels like the wrong approach as it feels like you are assuming
INSN_COST is latency in some way ? Surely, we shouldn't be introducing
INSN_COST based stuff into the scheduler.

Have you investigated  using TARGET_SCHED_ADJUST_COST (IIRC, look for the
right name in the internals documents) and such hooks that come from the
scheduler rather than trying to massage INSN_COST into the target
independent parts of the scheduler ?

Ramana

>
> Thanks,
> Vlad
>
> gcc/
> Changelog for gcc/Changelog
> 2018-09-11  Vlad Lazar  <vlad.lazar@arm.com>
>
>         * haifa-sched.c (rank_for_schedule): Schedule by INSN_COST.
>         (rfs_decision): New scheduling decision.
>

Kyrill Tkachov Sept. 11, 2018, 3 p.m. UTC | #2

Hi Ramana,

On 11/09/18 15:55, Ramana Radhakrishnan wrote:
> On Tue, 11 Sep 2018, 14:38 Vlad Lazar, <vlad.lazar@arm.com> wrote:
>
> > Hi.
> >
> > This patch makes the scheduler prefer instructions with higher cost if two
> > given instructions are equally good.
> > Issuing more restricted instructions first is particularly useful on
> > in-order cores because it increases the
> > number of dual issue opportunities.
> >
> > For example, on AArch64, instead of:
> >
> >    add     x11, x2, 96
> >    mov     x4, x2
> >    mov     w10, 1
> >    ldrh    w5, [x0]
> >    ldrh    w13, [x0, 2]
> >    ldrh    w9, [x0, 4]
> >    ldrh    w12, [x0, 6]
> >    b       .L759
> >
> > Generate:
> >
> >    ldrh    w5, [x0]
> >    add     x11, x2, 96
> >    ldrh    w13, [x0, 2]
> >    mov     x4, x2
> >    ldrh    w9, [x0, 4]
> >    mov     w10, 1
> >    ldrh    w12, [x0, 6]
> >    b       .L759
> >
> > Bootstrapped and regtested on aarch64-none-linux-gnu and there are no
> > regressions.
> > Ok for trunk?
> >
>
> This to me feels like the wrong approach as it feels like you are assuming
> INSN_COST is latency in some way ? Surely, we shouldn't be introducing
> INSN_COST based stuff into the scheduler.
>
> Have you investigated  using TARGET_SCHED_ADJUST_COST (IIRC, look for the
> right name in the internals documents) and such hooks that come from the
> scheduler rather than trying to massage INSN_COST into the target
> independent parts of the scheduler ?
>

In the context of haifa-sched.c, INSN_COST is the latency cost.
It is not the rtx_cost of the insn, as used by combine and others.
So this approach looks reasonable to me (though I haven't done a deep review).

Thanks,
Kyrill

> Ramana
>
>
> >
> > Thanks,
> > Vlad
> >
> > gcc/
> > Changelog for gcc/Changelog
> > 2018-09-11  Vlad Lazar  <vlad.lazar@arm.com>
> >
> >         * haifa-sched.c (rank_for_schedule): Schedule by INSN_COST.
> >         (rfs_decision): New scheduling decision.
> >

Ramana Radhakrishnan Sept. 11, 2018, 3:03 p.m. UTC | #3

>
> > This to me feels like the wrong approach as it feels like you are assuming
> > INSN_COST is latency in some way ? Surely, we shouldn't be introducing
> > INSN_COST based stuff into the scheduler.
> >
> > Have you investigated  using TARGET_SCHED_ADJUST_COST (IIRC, look for the
> > right name in the internals documents) and such hooks that come from the
> > scheduler rather than trying to massage INSN_COST into the target
> > independent parts of the scheduler ?
> >
>
> In the context of haifa-sched.c, INSN_COST is the latency cost.
> It is not the rtx_cost of the insn, as used by combine and others.

Ah, I was conflating rtx_cost with INSN_COST, sorry about the noise.

Ramana

> So this approach looks reasonable to me (though I haven't done a deep review).
>
> Thanks,
> Kyrill
>
> > Ramana
> >
> >
> > >
> > > Thanks,
> > > Vlad
> > >
> > > gcc/
> > > Changelog for gcc/Changelog
> > > 2018-09-11  Vlad Lazar  <vlad.lazar@arm.com>
> > >
> > >         * haifa-sched.c (rank_for_schedule): Schedule by INSN_COST.
> > >         (rfs_decision): New scheduling decision.
> > >
>

Jeff Law Sept. 11, 2018, 8:46 p.m. UTC | #4

On 9/11/18 9:00 AM, Kyrill Tkachov wrote:
> Hi Ramana,
> 
> On 11/09/18 15:55, Ramana Radhakrishnan wrote:
>> On Tue, 11 Sep 2018, 14:38 Vlad Lazar, <vlad.lazar@arm.com> wrote:
>>
>> > Hi.
>> >
>> > This patch makes the scheduler prefer instructions with higher cost
>> if two
>> > given instructions are equally good.
>> > Issuing more restricted instructions first is particularly useful on
>> > in-order cores because it increases the
>> > number of dual issue opportunities.
>> >
>> > For example, on AArch64, instead of:
>> >
>> >    add     x11, x2, 96
>> >    mov     x4, x2
>> >    mov     w10, 1
>> >    ldrh    w5, [x0]
>> >    ldrh    w13, [x0, 2]
>> >    ldrh    w9, [x0, 4]
>> >    ldrh    w12, [x0, 6]
>> >    b       .L759
>> >
>> > Generate:
>> >
>> >    ldrh    w5, [x0]
>> >    add     x11, x2, 96
>> >    ldrh    w13, [x0, 2]
>> >    mov     x4, x2
>> >    ldrh    w9, [x0, 4]
>> >    mov     w10, 1
>> >    ldrh    w12, [x0, 6]
>> >    b       .L759
>> >
>> > Bootstrapped and regtested on aarch64-none-linux-gnu and there are no
>> > regressions.
>> > Ok for trunk?
>> >
>>
>> This to me feels like the wrong approach as it feels like you are
>> assuming
>> INSN_COST is latency in some way ? Surely, we shouldn't be introducing
>> INSN_COST based stuff into the scheduler.
>>
>> Have you investigated  using TARGET_SCHED_ADJUST_COST (IIRC, look for the
>> right name in the internals documents) and such hooks that come from the
>> scheduler rather than trying to massage INSN_COST into the target
>> independent parts of the scheduler ?
>>
> 
> In the context of haifa-sched.c, INSN_COST is the latency cost.
> It is not the rtx_cost of the insn, as used by combine and others.
> So this approach looks reasonable to me (though I haven't done a deep
> review).
It looks reasonable to me as well.  Essentially it's a new tie-breaker
if everything else is equal.

jeff

Jeff Law Sept. 11, 2018, 8:46 p.m. UTC | #5

On 9/11/18 7:38 AM, Vlad Lazar wrote:
> Hi.
> 
> This patch makes the scheduler prefer instructions with higher cost if
> two given instructions are equally good.
> Issuing more restricted instructions first is particularly useful on
> in-order cores because it increases the
> number of dual issue opportunities.
> 
> For example, on AArch64, instead of:
> 
>   add     x11, x2, 96
>   mov     x4, x2
>   mov     w10, 1
>   ldrh    w5, [x0]
>   ldrh    w13, [x0, 2]
>   ldrh    w9, [x0, 4]
>   ldrh    w12, [x0, 6]
>   b       .L759
> 
> Generate:
> 
>   ldrh    w5, [x0]
>   add     x11, x2, 96
>   ldrh    w13, [x0, 2]
>   mov     x4, x2
>   ldrh    w9, [x0, 4]
>   mov     w10, 1
>   ldrh    w12, [x0, 6]
>   b       .L759
> 
> Bootstrapped and regtested on aarch64-none-linux-gnu and there are no
> regressions.
> Ok for trunk?
> 
> Thanks,
> Vlad
> 
> gcc/
> Changelog for gcc/Changelog
> 2018-09-11  Vlad Lazar  <vlad.lazar@arm.com>
> 
>     * haifa-sched.c (rank_for_schedule): Schedule by INSN_COST.
>     (rfs_decision): New scheduling decision.
OK.
jeff

Schedule by INSN_COST in case of tie

Commit Message

Comments

Patch