[AArch64_be] Don't fold reduction intrinsics.

Message ID	1406715564-23681-1-git-send-email-james.greenhalgh@arm.com
State	New
Headers	show Return-Path: <gcc-patches-return-373591-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:from :to:cc:subject:date:message-id:mime-version:content-type; q=dns; s=default; b=ibB4Ti9QFZHSMKeewUr97jqjczbyw4yr0tmev/sKJRebOtPHk8 1PGxXMbcR/2dDkyD+VuBkuKDsYgU8Q2SSX7lEaZloTl52Jkq2B8LP//TUZuwjLtd zLN5h9xw+A1sb8wRBBINzuGbgkW92R+n4L2eQMBSxfLkPa87Z8Fl11L4c= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org From: James Greenhalgh <james.greenhalgh@arm.com> To: gcc-patches@gcc.gnu.org Cc: alan.lawrence@arm.com, marcus.shawcroft@arm.com Subject: [AArch64_be] Don't fold reduction intrinsics. Date: Wed, 30 Jul 2014 11:19:24 +0100 Message-Id: <1406715564-23681-1-git-send-email-james.greenhalgh@arm.com> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="------------1.8.3-rc0"

Message ID

1406715564-23681-1-git-send-email-james.greenhalgh@arm.com

State

New

Headers

DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id
	:list-unsubscribe:list-archive:list-post:list-help:sender:from
	:to:cc:subject:date:message-id:mime-version:content-type; q=dns;
	s=default; b=ibB4Ti9QFZHSMKeewUr97jqjczbyw4yr0tmev/sKJRebOtPHk8
	1PGxXMbcR/2dDkyD+VuBkuKDsYgU8Q2SSX7lEaZloTl52Jkq2B8LP//TUZuwjLtd
	zLN5h9xw+A1sb8wRBBINzuGbgkW92R+n4L2eQMBSxfLkPa87Z8Fl11L4c=
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
Sender: gcc-patches-owner@gcc.gnu.org
From: James Greenhalgh <james.greenhalgh@arm.com>
To: gcc-patches@gcc.gnu.org
Cc: alan.lawrence@arm.com,	marcus.shawcroft@arm.com
Subject: [AArch64_be] Don't fold reduction intrinsics.
Date: Wed, 30 Jul 2014 11:19:24 +0100
Message-Id: <1406715564-23681-1-git-send-email-james.greenhalgh@arm.com>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="------------1.8.3-rc0"

Commit Message

James Greenhalgh July 30, 2014, 10:19 a.m. UTC

Hi,

Reduction operations are defined in tree.def to

   return a vector of the same type, with the first element in the vector
   holding the result of the reduction of all elements of the operand.  The
   content of the other elements in the returned vector is undefined.

The reduction intrinsics map to AArch64's reduction instructions (addv and
friends). These return their result in architectural lane 0. In GCC's view,
this is at the opposite end of the vector from element 0.

It is therefore not correct to make this fold for BYTES_BIG_ENDIAN.

Tested big/little-endian with no issues on aarch64-none-elf.

OK?

Thanks,
James

---
gcc/

2014-07-28  James Greenhalgh  <james.greenhalgh@arm.com>

	* config/aarch64/aarch64-builtins.c
	(aarch64_gimple_fold_builtin): Don't fold reduction operations for
	BYTES_BIG_ENDIAN.

Comments

Marcus Shawcroft July 31, 2014, 1:36 p.m. UTC | #1

On 30 July 2014 11:19, James Greenhalgh <james.greenhalgh@arm.com> wrote:

> 2014-07-28  James Greenhalgh  <james.greenhalgh@arm.com>
>
>         * config/aarch64/aarch64-builtins.c
>         (aarch64_gimple_fold_builtin): Don't fold reduction operations for
>         BYTES_BIG_ENDIAN.

OK /Marcus

diff --git a/gcc/config/aarch64/aarch64-builtins.c b/gcc/config/aarch64/aarch64-builtins.c
index fee17ec..58db77e 100644
--- a/gcc/config/aarch64/aarch64-builtins.c
+++ b/gcc/config/aarch64/aarch64-builtins.c
@@ -1383,6 +1383,20 @@  aarch64_gimple_fold_builtin (gimple_stmt_iterator *gsi)
   tree call = gimple_call_fn (stmt);
   tree fndecl;
   gimple new_stmt = NULL;
+
+  /* The operations folded below are reduction operations.  These are
+     defined to leave their result in the 0'th element (from the perspective
+     of GCC).  The architectural instruction we are folding will leave the
+     result in the 0'th element (from the perspective of the architecture).
+     For big-endian systems, these perspectives are not aligned.
+
+     It is therefore wrong to perform this fold on big-endian.  There
+     are some tricks we could play with shuffling, but the mid-end is
+     inconsistent in the way it treats reduction operations, so we will
+     end up in difficulty.  Until we fix the ambiguity - just bail out.  */
+  if (BYTES_BIG_ENDIAN)
+    return false;
+
   if (call)
     {
       fndecl = gimple_call_fndecl (stmt);

[AArch64_be] Don't fold reduction intrinsics.

Commit Message

Comments

Patch