diff mbox

[AArch64] Handle compare of zero_extract form of TST-immediate in rtx costs

Message ID 5693DB32.3010106@foss.arm.com
State New
Headers show

Commit Message

Kyrill Tkachov Jan. 11, 2016, 4:41 p.m. UTC
Hi all,

The test gcc.target/aarch64/tst_3.c fails for an explicit -mcpu=cortex-a53 because we don't
handle the recent compare with zero_extract pattern properly in rtx costs, so we end up recursing
into its operands and end up rejecting the combination for some CPUs, generating an AND-immediate
followed by a comparison against zero, instead of the TST-immediate instruction expected by the test.

This patch adds handling for that pattern so that we properly handle it the same ways as an ANDS instruction.
With this patch the aforementioned test passes for -mcpu=cortex-a53 as well.

Bootstrapped and tested on aarch64-none-linux-gnu.
Ok for trunk?

Thanks,
Kyrill

2016-01-11  Kyrylo Tkachov  <kyrylo.tkachov@arm.com>

     * config/aarch64/aarch64.c (aarch64_rtx_costs, COMPARE case):
     Handle COMPARE of ZERO_EXTRACT against zero form of TST-immediate.

Comments

James Greenhalgh Jan. 15, 2016, 5:21 p.m. UTC | #1
On Mon, Jan 11, 2016 at 04:41:22PM +0000, Kyrill Tkachov wrote:
> Hi all,
> 
> The test gcc.target/aarch64/tst_3.c fails for an explicit -mcpu=cortex-a53
> because we don't handle the recent compare with zero_extract pattern properly
> in rtx costs, so we end up recursing into its operands and end up rejecting
> the combination for some CPUs, generating an AND-immediate followed by a
> comparison against zero, instead of the TST-immediate instruction expected by
> the test.
> 
> This patch adds handling for that pattern so that we properly handle it the
> same ways as an ANDS instruction.  With this patch the aforementioned test
> passes for -mcpu=cortex-a53 as well.
> 
> Bootstrapped and tested on aarch64-none-linux-gnu.
> Ok for trunk?

OK.

Thanks,
James

> 
> Thanks,
> Kyrill
> 
> 2016-01-11  Kyrylo Tkachov  <kyrylo.tkachov@arm.com>
> 
>     * config/aarch64/aarch64.c (aarch64_rtx_costs, COMPARE case):
>     Handle COMPARE of ZERO_EXTRACT against zero form of TST-immediate.
diff mbox

Patch

diff --git a/gcc/config/aarch64/aarch64.c b/gcc/config/aarch64/aarch64.c
index a66508a11fe380e9277d1589ce9dc11eef2fe6a3..8fe433f8f1dac836983d57d5c3dde128697dc2d8 100644
--- a/gcc/config/aarch64/aarch64.c
+++ b/gcc/config/aarch64/aarch64.c
@@ -6490,6 +6490,23 @@  aarch64_rtx_costs (rtx x, machine_mode mode, int outer ATTRIBUTE_UNUSED,
               goto cost_minus;
             }
 
+	  if (GET_CODE (op0) == ZERO_EXTRACT && op1 == const0_rtx
+	      && GET_MODE (x) == CC_NZmode && CONST_INT_P (XEXP (op0, 1))
+	      && CONST_INT_P (XEXP (op0, 2)))
+	    {
+	      /* COMPARE of ZERO_EXTRACT form of TST-immediate.
+		 Handle it here directly rather than going to cost_logic
+		 since we know the immediate generated for the TST is valid
+		 so we can avoid creating an intermediate rtx for it only
+		 for costing purposes.  */
+	      if (speed)
+		*cost += extra_cost->alu.logical;
+
+	      *cost += rtx_cost (XEXP (op0, 0), GET_MODE (op0),
+				 ZERO_EXTRACT, 0, speed);
+	      return true;
+	    }
+
           if (GET_CODE (op1) == NEG)
             {
 	      /* CMN.  */