Simplify a VEC_SELECT fed by its own inverse

Message ID	1398088880.19378.28.camel@gnopaine
State	New
Headers	show Return-Path: <gcc-patches-return-365438-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender :message-id:subject:from:to:cc:date:content-type :content-transfer-encoding:mime-version; q=dns; s=default; b=kqv UQfP5aqCXObu1cCwc5Y5Gz2x/mdfDA8mrnIIymqtxwCb6lgZ0PwX6bh/0bFAhdoL kUFc/NN9SfQP2/f/dDJ39v25GKcvPxiHED6QxJwSujx9FzMPVrid7h+h0jfq6mw4 JNY8FXtNAR6P7Npp5bz3M6sW695cP9zBlPNk86tk= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org Gateway: Authorized Use Only! Violators will be prosecuted for <gcc-patches@gcc.gnu.org> from <wschmidt@linux.vnet.ibm.com>; Mon, 21 Apr 2014 10:01:21 -0400 Gateway: Authorized Use Only! Violators will be prosecuted; Mon, 21 Apr 2014 10:01:19 -0400 Message-ID: <1398088880.19378.28.camel@gnopaine> Subject: [PATCH] Simplify a VEC_SELECT fed by its own inverse From: Bill Schmidt <wschmidt@linux.vnet.ibm.com> To: gcc-patches@gcc.gnu.org Cc: ebotcazou@libertysurf.fr, rth@redhat.com Date: Mon, 21 Apr 2014 09:01:20 -0500 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit Mime-Version: 1.0

Message ID

1398088880.19378.28.camel@gnopaine

State

New

Headers

DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id
	:list-unsubscribe:list-archive:list-post:list-help:sender
	:message-id:subject:from:to:cc:date:content-type
	:content-transfer-encoding:mime-version; q=dns; s=default; b=kqv
	UQfP5aqCXObu1cCwc5Y5Gz2x/mdfDA8mrnIIymqtxwCb6lgZ0PwX6bh/0bFAhdoL
	kUFc/NN9SfQP2/f/dDJ39v25GKcvPxiHED6QxJwSujx9FzMPVrid7h+h0jfq6mw4
	JNY8FXtNAR6P7Npp5bz3M6sW695cP9zBlPNk86tk=
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
Sender: gcc-patches-owner@gcc.gnu.org
Message-ID: <1398088880.19378.28.camel@gnopaine>
Subject: [PATCH] Simplify a VEC_SELECT fed by its own inverse
From: Bill Schmidt <wschmidt@linux.vnet.ibm.com>
To: gcc-patches@gcc.gnu.org
Cc: ebotcazou@libertysurf.fr, rth@redhat.com
Date: Mon, 21 Apr 2014 09:01:20 -0500
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0

Commit Message

Bill Schmidt April 21, 2014, 2:01 p.m. UTC

Hi,

This patch adds a small RTL simplification for the case where the first
operand to a VEC_SELECT is another VEC_SELECT with an inverse selection
function.  E.g.,

(vec_select:V4SF (vec_select:V4SF (OP:V4SF) (parallel [2 3 0 1]))
                 (parallel [2 3 0 1]))

may be simplified to (OP:V4SF).  This comes up in practice on
powerpc64le-linux-gnu because of the characteristics of certain loads
and stores in the Power7 and later ISAs.  When running in little endian
modes, one of these loads must be followed by a permute to reorder the
vector elements, and one of these stores must be preceded by a permute
to once again reorder them.  Thus a simple copy may end up with two
permutes whose effect cancels each other out.  The patch ensures those
redundancies are detected and removed.

Note that it would be possible to do a more general transformation here,
in which any vec_select feeding another could be replaced by a
vec_select performing the composite function of the other two.  I have
not done this because I am unaware of this situation arising in
practice.  If it's desirable, I can extend the patch in this direction.

Bootstrapped and tested on powerpc64[,le]-unknown-linux-gnu with no
regressions.  Is this ok for trunk?

Thanks,
Bill


[gcc]

2014-04-21  Bill Schmidt  <wschmidt@linux.vnet.ibm.com>

	* simplify-rtx.c (simplify_binary_operation_1): Optimize case of
	nested VEC_SELECTs that are inverses of each other.

[gcc/testsuite]

2014-04-21  Bill Schmidt  <wschmidt@linux.vnet.ibm.com>

	* gcc.target/powerpc/vsxcopy.c: New test.

Comments

Bill Schmidt April 21, 2014, 5:29 p.m. UTC | #1

Hi Marc,

Good points!  I will rework the patch with your suggestions in mind.

Thanks!
Bill

On Mon, 2014-04-21 at 18:51 +0200, Marc Glisse wrote:
> On Mon, 21 Apr 2014, Bill Schmidt wrote:
> 
> > Note that it would be possible to do a more general transformation here,
> > in which any vec_select feeding another could be replaced by a
> > vec_select performing the composite function of the other two.  I have
> > not done this because I am unaware of this situation arising in
> > practice.  If it's desirable, I can extend the patch in this direction.
> 
> It does arise, but I think it isn't done because not all permutations are 
> (optimally) supported by all targets.
> 
> > Index: gcc/simplify-rtx.c
> > ===================================================================
> > --- gcc/simplify-rtx.c	(revision 209516)
> > +++ gcc/simplify-rtx.c	(working copy)
> > @@ -3673,6 +3673,34 @@ simplify_binary_operation_1 (enum rtx_code code, e
> > 	    }
> > 	}
> >
> > +      /* If we have two nested selects that are inverses of each
> > +	 other, replace them with the source operand.  */
> > +      if (GET_CODE (trueop0) == VEC_SELECT)
> > +	{
> > +	  enum machine_mode reg_mode = GET_MODE (XEXP (trueop0, 0));
> > +	  rtx op0_subop1 = XEXP (trueop0, 1);
> > +	  gcc_assert (VECTOR_MODE_P (reg_mode));
> > +	  gcc_assert (GET_MODE_INNER (mode) == GET_MODE_INNER (reg_mode));
> > +	  gcc_assert (GET_CODE (op0_subop1) == PARALLEL);
> > +
> > +	  if (XVECLEN (trueop1, 0) == XVECLEN (op0_subop1, 0))
> > +	    {
> > +	      /* Apply the second ordering vector to the first.
> > +		 If the result is { 0, 1, ..., n-1 } then the
> > +		 two VEC_SELECTs cancel.  */
> > +	      for (int i = 0; i < XVECLEN (trueop1, 0); ++i)
> > +		{
> > +		  rtx x = XVECEXP (trueop1, 0, i);
> > +		  gcc_assert (CONST_INT_P (x));
> > +		  rtx y = XVECEXP (op0_subop1, 0, INTVAL (x));
> > +		  gcc_assert (CONST_INT_P (y));
> > +		  if (i != INTVAL (y))
> > +		    return 0;
> > +		}
> > +	      return XEXP (trueop0, 0);
> > +	    }
> > +	}
> 
> I may have missed it, but don't you want to check that what you are 
> returning has the right mode/length (or generate the obvious vec_select 
> otherwise)? I don't know if any platform has such constructions (probably 
> not), but in principle you could start from a vector of size 4, extract 
> {1,0} from it, extract {1,0} from that, and you don't want to return the 
> initial vector as is. On the other hand, I don't think you really care 
> whether trueop1 is smaller than op0_subop1. Starting from a vector of size 
> 2, extracting {1,0,1,0} then {3,0} gives the initial vector just fine.
>

Index: gcc/simplify-rtx.c
===================================================================
--- gcc/simplify-rtx.c	(revision 209516)
+++ gcc/simplify-rtx.c	(working copy)
@@ -3673,6 +3673,34 @@  simplify_binary_operation_1 (enum rtx_code code, e
 	    }
 	}
 
+      /* If we have two nested selects that are inverses of each
+	 other, replace them with the source operand.  */
+      if (GET_CODE (trueop0) == VEC_SELECT)
+	{
+	  enum machine_mode reg_mode = GET_MODE (XEXP (trueop0, 0));
+	  rtx op0_subop1 = XEXP (trueop0, 1);
+	  gcc_assert (VECTOR_MODE_P (reg_mode));
+	  gcc_assert (GET_MODE_INNER (mode) == GET_MODE_INNER (reg_mode));
+	  gcc_assert (GET_CODE (op0_subop1) == PARALLEL);
+
+	  if (XVECLEN (trueop1, 0) == XVECLEN (op0_subop1, 0))
+	    {
+	      /* Apply the second ordering vector to the first.
+		 If the result is { 0, 1, ..., n-1 } then the
+		 two VEC_SELECTs cancel.  */
+	      for (int i = 0; i < XVECLEN (trueop1, 0); ++i)
+		{
+		  rtx x = XVECEXP (trueop1, 0, i);
+		  gcc_assert (CONST_INT_P (x));
+		  rtx y = XVECEXP (op0_subop1, 0, INTVAL (x));
+		  gcc_assert (CONST_INT_P (y));
+		  if (i != INTVAL (y))
+		    return 0;
+		}
+	      return XEXP (trueop0, 0);
+	    }
+	}
+
       return 0;
     case VEC_CONCAT:
       {
Index: gcc/testsuite/gcc.target/powerpc/vsxcopy.c
===================================================================
--- gcc/testsuite/gcc.target/powerpc/vsxcopy.c	(revision 0)
+++ gcc/testsuite/gcc.target/powerpc/vsxcopy.c	(working copy)
@@ -0,0 +1,15 @@ 
+/* { dg-do compile { target { powerpc64*-*-* } } } */
+/* { dg-require-effective-target powerpc_vsx_ok } */
+/* { dg-options "-O1" } */
+/* { dg-final { scan-assembler "lxvd2x" } } */
+/* { dg-final { scan-assembler "stxvd2x" } } */
+/* { dg-final { scan-assembler-not "xxpermdi" } } */
+
+typedef float vecf __attribute__ ((vector_size (16)));
+extern vecf j, k;
+
+void fun (void)
+{
+  j = k;
+}
+

Simplify a VEC_SELECT fed by its own inverse

Commit Message

Comments

Patch