diff mbox

RFA: Clean up ADDRESS handling in alias.c

Message ID 87y5pxxc6e.fsf@talisman.home
State New
Headers show

Commit Message

Richard Sandiford April 15, 2012, 3:11 p.m. UTC
The comment in alias.c says:

   The contents of an ADDRESS is not normally used, the mode of the
   ADDRESS determines whether the ADDRESS is a function argument or some
   other special value.  Pointer equality, not rtx_equal_p, determines whether
   two ADDRESS expressions refer to the same base address.

   The only use of the contents of an ADDRESS is for determining if the
   current function performs nonlocal memory memory references for the
   purposes of marking the function as a constant function.  */

The first paragraph is a bit misleading IMO.  AFAICT, rtx_equal_p has
always given ADDRESS the full recursive treatment, rather than saying
that pointer equality determines ADDRESS equality.  (This is in contrast
to something like VALUE, where pointer equality is used.)  And AFAICT
we've always had:

static int
base_alias_check (rtx x, rtx y, enum machine_mode x_mode,
		  enum machine_mode y_mode)
{
  ...
  /* If the base addresses are equal nothing is known about aliasing.  */
  if (rtx_equal_p (x_base, y_base))
    return 1;
  ...
}

So I think the contents of an ADDRESS _are_ used to distinguish
between different bases.

The second paragraph ceased to be true in 2005 when the pure/const
analysis moved to its own IPA pass.  Nothing now looks at the contents
beyond rtx_equal_p.

Also, base_alias_check effectively treats all arguments as a single base.
That makes conceptual sense, because this analysis isn't strong enough
to determine whether arguments are base values at all, never mind whether
accesses based on different arguments conflict.  But the fact that we have
a single base isn't obvious from the way the code is written, because we
create several separate, non-rtx_equal_p, ADDRESSes to represent arguments.
See:

  for (i = 0; i < FIRST_PSEUDO_REGISTER; i++)
    /* Check whether this register can hold an incoming pointer
       argument.  FUNCTION_ARG_REGNO_P tests outgoing register
       numbers, so translate if necessary due to register windows.  */
    if (FUNCTION_ARG_REGNO_P (OUTGOING_REGNO (i))
	&& HARD_REGNO_MODE_OK (i, Pmode))
      static_reg_base_value[i]
	= gen_rtx_ADDRESS (VOIDmode, gen_rtx_REG (Pmode, i));

and:

      /* Check for an argument passed in memory.  Only record in the
	 copying-arguments block; it is too hard to track changes
	 otherwise.  */
      if (copying_arguments
	  && (XEXP (src, 0) == arg_pointer_rtx
	      || (GET_CODE (XEXP (src, 0)) == PLUS
		  && XEXP (XEXP (src, 0), 0) == arg_pointer_rtx)))
	return gen_rtx_ADDRESS (VOIDmode, src);

I think it would be cleaner and less wasteful to use a single rtx for
the single "base" (really "potential base").

So if we wanted to, we could now remove the operand from ADDRESS and
simply rely on pointer equality.  I'm a bit reluctant to do that though.
It would make debugging harder, and it would mean either adding knowledge
of this alias-specific code to other files (specifically rtl.c:rtx_equal_p),
or adding special ADDRESS shortcuts to alias.c.  But I think the code
would be more obvious if we replaced the rtx operand with a unique id,
which is what we already use for the REG_NOALIAS case:

      new_reg_base_value[regno] = gen_rtx_ADDRESS (Pmode,
						   GEN_INT (unique_id++));

And if we do that, we can make the id a direct operand of the ADDRESS,
rather than a CONST_INT subrtx[*].  That should make rtx_equal_p cheaper too.

  [*] I'm trying to get rid of CONST_INTs like these that have
      no obvious mode.

All of which led to the patch below.  I checked that it didn't change
the code generated at -O2 for a recent set of cc1 .ii files.  Also
bootstrapped & regression-tested on x86_64-linux-gnu.  OK to install?

To cover my back: I'm just trying to rewrite the current code according
to its current assumptions.  Whether those assumptions are correct or not
is always open to debate...

Richard


gcc/
	* rtl.def (ADDRESS): Turn operand into a HOST_WIDE_INT.
	* alias.c (reg_base_value): Expand and update comment.
	(arg_base_value): New variable.
	(unique_id): Move up file.
	(unique_base_value, unique_base_value_p, known_base_value_p): New.
	(find_base_value): Use arg_base_value and known_base_value_p.
	(record_set): Document REG_NOALIAS handling.  Use unique_base_value.
	(find_base_term): Use known_base_value_p.
	(base_alias_check): Use unique_base_value_p.
	(init_alias_target): Initialize arg_base_value.  Use unique_base_value.
	(init_alias_analysis): Use 1 as the first id for REG_NOALIAS bases.

Comments

Richard Biener April 17, 2012, 2:12 p.m. UTC | #1
On Sun, Apr 15, 2012 at 5:11 PM, Richard Sandiford
<rdsandiford@googlemail.com> wrote:
> The comment in alias.c says:
>
>   The contents of an ADDRESS is not normally used, the mode of the
>   ADDRESS determines whether the ADDRESS is a function argument or some
>   other special value.  Pointer equality, not rtx_equal_p, determines whether
>   two ADDRESS expressions refer to the same base address.
>
>   The only use of the contents of an ADDRESS is for determining if the
>   current function performs nonlocal memory memory references for the
>   purposes of marking the function as a constant function.  */
>
> The first paragraph is a bit misleading IMO.  AFAICT, rtx_equal_p has
> always given ADDRESS the full recursive treatment, rather than saying
> that pointer equality determines ADDRESS equality.  (This is in contrast
> to something like VALUE, where pointer equality is used.)  And AFAICT
> we've always had:
>
> static int
> base_alias_check (rtx x, rtx y, enum machine_mode x_mode,
>                  enum machine_mode y_mode)
> {
>  ...
>  /* If the base addresses are equal nothing is known about aliasing.  */
>  if (rtx_equal_p (x_base, y_base))
>    return 1;
>  ...
> }
>
> So I think the contents of an ADDRESS _are_ used to distinguish
> between different bases.
>
> The second paragraph ceased to be true in 2005 when the pure/const
> analysis moved to its own IPA pass.  Nothing now looks at the contents
> beyond rtx_equal_p.
>
> Also, base_alias_check effectively treats all arguments as a single base.
> That makes conceptual sense, because this analysis isn't strong enough
> to determine whether arguments are base values at all, never mind whether
> accesses based on different arguments conflict.  But the fact that we have
> a single base isn't obvious from the way the code is written, because we
> create several separate, non-rtx_equal_p, ADDRESSes to represent arguments.
> See:
>
>  for (i = 0; i < FIRST_PSEUDO_REGISTER; i++)
>    /* Check whether this register can hold an incoming pointer
>       argument.  FUNCTION_ARG_REGNO_P tests outgoing register
>       numbers, so translate if necessary due to register windows.  */
>    if (FUNCTION_ARG_REGNO_P (OUTGOING_REGNO (i))
>        && HARD_REGNO_MODE_OK (i, Pmode))
>      static_reg_base_value[i]
>        = gen_rtx_ADDRESS (VOIDmode, gen_rtx_REG (Pmode, i));
>
> and:
>
>      /* Check for an argument passed in memory.  Only record in the
>         copying-arguments block; it is too hard to track changes
>         otherwise.  */
>      if (copying_arguments
>          && (XEXP (src, 0) == arg_pointer_rtx
>              || (GET_CODE (XEXP (src, 0)) == PLUS
>                  && XEXP (XEXP (src, 0), 0) == arg_pointer_rtx)))
>        return gen_rtx_ADDRESS (VOIDmode, src);
>
> I think it would be cleaner and less wasteful to use a single rtx for
> the single "base" (really "potential base").
>
> So if we wanted to, we could now remove the operand from ADDRESS and
> simply rely on pointer equality.  I'm a bit reluctant to do that though.
> It would make debugging harder, and it would mean either adding knowledge
> of this alias-specific code to other files (specifically rtl.c:rtx_equal_p),
> or adding special ADDRESS shortcuts to alias.c.  But I think the code
> would be more obvious if we replaced the rtx operand with a unique id,
> which is what we already use for the REG_NOALIAS case:
>
>      new_reg_base_value[regno] = gen_rtx_ADDRESS (Pmode,
>                                                   GEN_INT (unique_id++));
>
> And if we do that, we can make the id a direct operand of the ADDRESS,
> rather than a CONST_INT subrtx[*].  That should make rtx_equal_p cheaper too.
>
>  [*] I'm trying to get rid of CONST_INTs like these that have
>      no obvious mode.
>
> All of which led to the patch below.  I checked that it didn't change
> the code generated at -O2 for a recent set of cc1 .ii files.  Also
> bootstrapped & regression-tested on x86_64-linux-gnu.  OK to install?
>
> To cover my back: I'm just trying to rewrite the current code according
> to its current assumptions.  Whether those assumptions are correct or not
> is always open to debate...

This all looks reasonable and matches what I discovered by reverse
engineering the last time I ran into ADDRESSes ...

So, ok, given that nobody else has commented yet.

Thanks,
Richard.

> Richard
>
>
> gcc/
>        * rtl.def (ADDRESS): Turn operand into a HOST_WIDE_INT.
>        * alias.c (reg_base_value): Expand and update comment.
>        (arg_base_value): New variable.
>        (unique_id): Move up file.
>        (unique_base_value, unique_base_value_p, known_base_value_p): New.
>        (find_base_value): Use arg_base_value and known_base_value_p.
>        (record_set): Document REG_NOALIAS handling.  Use unique_base_value.
>        (find_base_term): Use known_base_value_p.
>        (base_alias_check): Use unique_base_value_p.
>        (init_alias_target): Initialize arg_base_value.  Use unique_base_value.
>        (init_alias_analysis): Use 1 as the first id for REG_NOALIAS bases.
>
> Index: gcc/rtl.def
> ===================================================================
> --- gcc/rtl.def 2012-04-15 15:23:52.747632394 +0100
> +++ gcc/rtl.def 2012-04-15 15:58:48.234515667 +0100
> @@ -109,8 +109,8 @@ DEF_RTL_EXPR(INSN_LIST, "insn_list", "ue
>    `emit_insn' takes the SEQUENCE apart and makes separate insns.  */
>  DEF_RTL_EXPR(SEQUENCE, "sequence", "E", RTX_EXTRA)
>
> -/* Refers to the address of its argument.  This is only used in alias.c.  */
> -DEF_RTL_EXPR(ADDRESS, "address", "e", RTX_MATCH)
> +/* Represents a non-global base address.  This is only used in alias.c.  */
> +DEF_RTL_EXPR(ADDRESS, "address", "w", RTX_EXTRA)
>
>  /* ----------------------------------------------------------------------
>    Expression types used for things in the instruction chain.
> Index: gcc/alias.c
> ===================================================================
> --- gcc/alias.c 2012-04-15 15:23:52.748632394 +0100
> +++ gcc/alias.c 2012-04-15 15:59:45.073512500 +0100
> @@ -187,21 +187,42 @@ #define MAX_ALIAS_LOOP_PASSES 10
>    of the first set.
>
>    A base address can be an ADDRESS, SYMBOL_REF, or LABEL_REF.  ADDRESS
> -   expressions represent certain special values: function arguments and
> -   the stack, frame, and argument pointers.
> +   expressions represent three types of base:
>
> -   The contents of an ADDRESS is not normally used, the mode of the
> -   ADDRESS determines whether the ADDRESS is a function argument or some
> -   other special value.  Pointer equality, not rtx_equal_p, determines whether
> -   two ADDRESS expressions refer to the same base address.
> -
> -   The only use of the contents of an ADDRESS is for determining if the
> -   current function performs nonlocal memory memory references for the
> -   purposes of marking the function as a constant function.  */
> +     1. incoming arguments.  There is just one ADDRESS to represent all
> +       arguments, since we do not know at this level whether accesses
> +       based on different arguments can alias.  The ADDRESS has id 0.
> +
> +     2. stack_pointer_rtx, frame_pointer_rtx, hard_frame_pointer_rtx
> +       (if distinct from frame_pointer_rtx) and arg_pointer_rtx.
> +       Each of these rtxes has a separate ADDRESS associated with it,
> +       each with a negative id.
> +
> +       GCC is (and is required to be) precise in which register it
> +       chooses to access a particular region of stack.  We can therefore
> +       assume that accesses based on one of these rtxes do not alias
> +       accesses based on another of these rtxes.
> +
> +     3. bases that are derived from malloc()ed memory (REG_NOALIAS).
> +       Each such piece of memory has a separate ADDRESS associated
> +       with it, each with an id greater than 0.
> +
> +   Accesses based on one ADDRESS do not alias accesses based on other
> +   ADDRESSes.  Accesses based on ADDRESSes in groups (2) and (3) do not
> +   alias globals either; the ADDRESSes have Pmode to indicate this.
> +   The ADDRESS in group (1) _may_ alias globals; it has VOIDmode to
> +   indicate this.  */
>
>  static GTY(()) VEC(rtx,gc) *reg_base_value;
>  static rtx *new_reg_base_value;
>
> +/* The single VOIDmode ADDRESS that represents all argument bases.
> +   It has id 0.  */
> +static GTY(()) rtx arg_base_value;
> +
> +/* Used to allocate unique ids to each REG_NOALIAS ADDRESS.  */
> +static int unique_id;
> +
>  /* We preserve the copy of old array around to avoid amount of garbage
>    produced.  About 8% of garbage produced were attributed to this
>    array.  */
> @@ -993,6 +1014,43 @@ get_frame_alias_set (void)
>   return frame_set;
>  }
>
> +/* Create a new, unique base with id ID.  */
> +
> +static rtx
> +unique_base_value (HOST_WIDE_INT id)
> +{
> +  return gen_rtx_ADDRESS (Pmode, id);
> +}
> +
> +/* Return true if accesses based on any other base value cannot access
> +   those based on X.  */
> +
> +static bool
> +unique_base_value_p (rtx x)
> +{
> +  return GET_CODE (x) == ADDRESS && GET_MODE (x) == Pmode;
> +}
> +
> +/* Return true if X is known to be a base value.  */
> +
> +static bool
> +known_base_value_p (rtx x)
> +{
> +  switch (GET_CODE (x))
> +    {
> +    case LABEL_REF:
> +    case SYMBOL_REF:
> +      return true;
> +
> +    case ADDRESS:
> +      /* Arguments may or may not be bases; we don't know for sure.  */
> +      return GET_MODE (x) != VOIDmode;
> +
> +    default:
> +      return false;
> +    }
> +}
> +
>  /* Inside SRC, the source of a SET, find a base address.  */
>
>  static rtx
> @@ -1049,7 +1107,7 @@ find_base_value (rtx src)
>          && (XEXP (src, 0) == arg_pointer_rtx
>              || (GET_CODE (XEXP (src, 0)) == PLUS
>                  && XEXP (XEXP (src, 0), 0) == arg_pointer_rtx)))
> -       return gen_rtx_ADDRESS (VOIDmode, src);
> +       return arg_base_value;
>       return 0;
>
>     case CONST:
> @@ -1090,18 +1148,10 @@ find_base_value (rtx src)
>        /* If either base is named object or a special address
>           (like an argument or stack reference), then use it for the
>           base term.  */
> -       if (src_0 != 0
> -           && (GET_CODE (src_0) == SYMBOL_REF
> -               || GET_CODE (src_0) == LABEL_REF
> -               || (GET_CODE (src_0) == ADDRESS
> -                   && GET_MODE (src_0) != VOIDmode)))
> +       if (src_0 != 0 && known_base_value_p (src_0))
>          return src_0;
>
> -       if (src_1 != 0
> -           && (GET_CODE (src_1) == SYMBOL_REF
> -               || GET_CODE (src_1) == LABEL_REF
> -               || (GET_CODE (src_1) == ADDRESS
> -                   && GET_MODE (src_1) != VOIDmode)))
> +       if (src_1 != 0 && known_base_value_p (src_1))
>          return src_1;
>
>        /* Guess which operand is the base address:
> @@ -1169,16 +1219,14 @@ find_base_value (rtx src)
>   return 0;
>  }
>
> -/* Called from init_alias_analysis indirectly through note_stores.  */
> +/* Called from init_alias_analysis indirectly through note_stores,
> +   or directly if DEST is a register with a REG_NOALIAS note attached.
> +   SET is null in the latter case.  */
>
>  /* While scanning insns to find base values, reg_seen[N] is nonzero if
>    register N has been set in this function.  */
>  static char *reg_seen;
>
> -/* Addresses which are known not to alias anything else are identified
> -   by a unique integer.  */
> -static int unique_id;
> -
>  static void
>  record_set (rtx dest, const_rtx set, void *data ATTRIBUTE_UNUSED)
>  {
> @@ -1223,14 +1271,14 @@ record_set (rtx dest, const_rtx set, voi
>     }
>   else
>     {
> +      /* There's a REG_NOALIAS note against DEST.  */
>       if (reg_seen[regno])
>        {
>          new_reg_base_value[regno] = 0;
>          return;
>        }
>       reg_seen[regno] = 1;
> -      new_reg_base_value[regno] = gen_rtx_ADDRESS (Pmode,
> -                                                  GEN_INT (unique_id++));
> +      new_reg_base_value[regno] = unique_base_value (unique_id++);
>       return;
>     }
>
> @@ -1666,18 +1714,10 @@ find_base_term (rtx x)
>        /* If either base term is named object or a special address
>           (like an argument or stack reference), then use it for the
>           base term.  */
> -       if (tmp1 != 0
> -           && (GET_CODE (tmp1) == SYMBOL_REF
> -               || GET_CODE (tmp1) == LABEL_REF
> -               || (GET_CODE (tmp1) == ADDRESS
> -                   && GET_MODE (tmp1) != VOIDmode)))
> +       if (tmp1 != 0 && known_base_value_p (tmp1))
>          return tmp1;
>
> -       if (tmp2 != 0
> -           && (GET_CODE (tmp2) == SYMBOL_REF
> -               || GET_CODE (tmp2) == LABEL_REF
> -               || (GET_CODE (tmp2) == ADDRESS
> -                   && GET_MODE (tmp2) != VOIDmode)))
> +       if (tmp2 != 0 && known_base_value_p (tmp2))
>          return tmp2;
>
>        /* We could not determine which of the two operands was the
> @@ -1762,12 +1802,7 @@ base_alias_check (rtx x, rtx y, enum mac
>   if (GET_CODE (x_base) != ADDRESS && GET_CODE (y_base) != ADDRESS)
>     return 0;
>
> -  /* If one address is a stack reference there can be no alias:
> -     stack references using different base registers do not alias,
> -     a stack reference can not alias a parameter, and a stack reference
> -     can not alias a global.  */
> -  if ((GET_CODE (x_base) == ADDRESS && GET_MODE (x_base) == Pmode)
> -      || (GET_CODE (y_base) == ADDRESS && GET_MODE (y_base) == Pmode))
> +  if (unique_base_value_p (x_base) || unique_base_value_p (y_base))
>     return 0;
>
>   return 1;
> @@ -2686,6 +2721,9 @@ init_alias_target (void)
>  {
>   int i;
>
> +  if (!arg_base_value)
> +    arg_base_value = gen_rtx_ADDRESS (VOIDmode, 0);
> +
>   memset (static_reg_base_value, 0, sizeof static_reg_base_value);
>
>   for (i = 0; i < FIRST_PSEUDO_REGISTER; i++)
> @@ -2694,18 +2732,13 @@ init_alias_target (void)
>        numbers, so translate if necessary due to register windows.  */
>     if (FUNCTION_ARG_REGNO_P (OUTGOING_REGNO (i))
>        && HARD_REGNO_MODE_OK (i, Pmode))
> -      static_reg_base_value[i]
> -       = gen_rtx_ADDRESS (VOIDmode, gen_rtx_REG (Pmode, i));
> +      static_reg_base_value[i] = arg_base_value;
>
> -  static_reg_base_value[STACK_POINTER_REGNUM]
> -    = gen_rtx_ADDRESS (Pmode, stack_pointer_rtx);
> -  static_reg_base_value[ARG_POINTER_REGNUM]
> -    = gen_rtx_ADDRESS (Pmode, arg_pointer_rtx);
> -  static_reg_base_value[FRAME_POINTER_REGNUM]
> -    = gen_rtx_ADDRESS (Pmode, frame_pointer_rtx);
> +  static_reg_base_value[STACK_POINTER_REGNUM] = unique_base_value (-1);
> +  static_reg_base_value[ARG_POINTER_REGNUM] = unique_base_value (-2);
> +  static_reg_base_value[FRAME_POINTER_REGNUM] = unique_base_value (-3);
>  #if !HARD_FRAME_POINTER_IS_FRAME_POINTER
> -  static_reg_base_value[HARD_FRAME_POINTER_REGNUM]
> -    = gen_rtx_ADDRESS (Pmode, hard_frame_pointer_rtx);
> +  static_reg_base_value[HARD_FRAME_POINTER_REGNUM] = unique_base_value (-4);
>  #endif
>  }
>
> @@ -2791,8 +2824,8 @@ init_alias_analysis (void)
>       changed = 0;
>
>       /* We want to assign the same IDs each iteration of this loop, so
> -        start counting from zero each iteration of the loop.  */
> -      unique_id = 0;
> +        start counting from one each iteration of the loop.  */
> +      unique_id = 1;
>
>       /* We're at the start of the function each iteration through the
>         loop, so we're copying arguments.  */
H.J. Lu April 17, 2012, 6:27 p.m. UTC | #2
On Sun, Apr 15, 2012 at 8:11 AM, Richard Sandiford
<rdsandiford@googlemail.com> wrote:
> The comment in alias.c says:
>
>   The contents of an ADDRESS is not normally used, the mode of the
>   ADDRESS determines whether the ADDRESS is a function argument or some
>   other special value.  Pointer equality, not rtx_equal_p, determines whether
>   two ADDRESS expressions refer to the same base address.
>
>   The only use of the contents of an ADDRESS is for determining if the
>   current function performs nonlocal memory memory references for the
>   purposes of marking the function as a constant function.  */
>
> The first paragraph is a bit misleading IMO.  AFAICT, rtx_equal_p has
> always given ADDRESS the full recursive treatment, rather than saying
> that pointer equality determines ADDRESS equality.  (This is in contrast
> to something like VALUE, where pointer equality is used.)  And AFAICT
> we've always had:
>
> static int
> base_alias_check (rtx x, rtx y, enum machine_mode x_mode,
>                  enum machine_mode y_mode)
> {
>  ...
>  /* If the base addresses are equal nothing is known about aliasing.  */
>  if (rtx_equal_p (x_base, y_base))
>    return 1;
>  ...
> }
>
> So I think the contents of an ADDRESS _are_ used to distinguish
> between different bases.
>
> The second paragraph ceased to be true in 2005 when the pure/const
> analysis moved to its own IPA pass.  Nothing now looks at the contents
> beyond rtx_equal_p.
>
> Also, base_alias_check effectively treats all arguments as a single base.
> That makes conceptual sense, because this analysis isn't strong enough
> to determine whether arguments are base values at all, never mind whether
> accesses based on different arguments conflict.  But the fact that we have
> a single base isn't obvious from the way the code is written, because we
> create several separate, non-rtx_equal_p, ADDRESSes to represent arguments.
> See:
>
>  for (i = 0; i < FIRST_PSEUDO_REGISTER; i++)
>    /* Check whether this register can hold an incoming pointer
>       argument.  FUNCTION_ARG_REGNO_P tests outgoing register
>       numbers, so translate if necessary due to register windows.  */
>    if (FUNCTION_ARG_REGNO_P (OUTGOING_REGNO (i))
>        && HARD_REGNO_MODE_OK (i, Pmode))
>      static_reg_base_value[i]
>        = gen_rtx_ADDRESS (VOIDmode, gen_rtx_REG (Pmode, i));
>
> and:
>
>      /* Check for an argument passed in memory.  Only record in the
>         copying-arguments block; it is too hard to track changes
>         otherwise.  */
>      if (copying_arguments
>          && (XEXP (src, 0) == arg_pointer_rtx
>              || (GET_CODE (XEXP (src, 0)) == PLUS
>                  && XEXP (XEXP (src, 0), 0) == arg_pointer_rtx)))
>        return gen_rtx_ADDRESS (VOIDmode, src);
>
> I think it would be cleaner and less wasteful to use a single rtx for
> the single "base" (really "potential base").
>
> So if we wanted to, we could now remove the operand from ADDRESS and
> simply rely on pointer equality.  I'm a bit reluctant to do that though.
> It would make debugging harder, and it would mean either adding knowledge
> of this alias-specific code to other files (specifically rtl.c:rtx_equal_p),
> or adding special ADDRESS shortcuts to alias.c.  But I think the code
> would be more obvious if we replaced the rtx operand with a unique id,
> which is what we already use for the REG_NOALIAS case:
>
>      new_reg_base_value[regno] = gen_rtx_ADDRESS (Pmode,
>                                                   GEN_INT (unique_id++));
>
> And if we do that, we can make the id a direct operand of the ADDRESS,
> rather than a CONST_INT subrtx[*].  That should make rtx_equal_p cheaper too.
>
>  [*] I'm trying to get rid of CONST_INTs like these that have
>      no obvious mode.
>
> All of which led to the patch below.  I checked that it didn't change
> the code generated at -O2 for a recent set of cc1 .ii files.  Also
> bootstrapped & regression-tested on x86_64-linux-gnu.  OK to install?
>
> To cover my back: I'm just trying to rewrite the current code according
> to its current assumptions.  Whether those assumptions are correct or not
> is always open to debate...
>
> Richard
>
>
> gcc/
>        * rtl.def (ADDRESS): Turn operand into a HOST_WIDE_INT.
>        * alias.c (reg_base_value): Expand and update comment.
>        (arg_base_value): New variable.
>        (unique_id): Move up file.
>        (unique_base_value, unique_base_value_p, known_base_value_p): New.
>        (find_base_value): Use arg_base_value and known_base_value_p.
>        (record_set): Document REG_NOALIAS handling.  Use unique_base_value.
>        (find_base_term): Use known_base_value_p.
>        (base_alias_check): Use unique_base_value_p.
>        (init_alias_target): Initialize arg_base_value.  Use unique_base_value.
>        (init_alias_analysis): Use 1 as the first id for REG_NOALIAS bases.
>

This breaks bootstrap on Linux/x86:


home/regress/tbox/native/build/./gcc/xgcc
-B/home/regress/tbox/native/build/./gcc/
-B/home/regress/tbox/objs/i686-pc-linux-gnu/bin/
-B/home/regress/tbox/objs/i686-pc-linux-gnu/lib/ -isystem
/home/regress/tbox/objs/i686-pc-linux-gnu/include -isystem
/home/regress/tbox/objs/i686-pc-linux-gnu/sys-include    -g -O2 -O2
-g -O2 -DIN_GCC   -W -Wall -Wwrite-strings -Wcast-qual
-Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition
-isystem ./include   -fpic -g -DIN_LIBGCC2 -fbuilding-libgcc
-fno-stack-protector   -fpic -I. -I. -I../.././gcc
-I/home/regress/tbox/svn-gcc/libgcc
-I/home/regress/tbox/svn-gcc/libgcc/.
-I/home/regress/tbox/svn-gcc/libgcc/../gcc
-I/home/regress/tbox/svn-gcc/libgcc/../include
-I/home/regress/tbox/svn-gcc/libgcc/config/libbid
-DENABLE_DECIMAL_BID_FORMAT -DHAVE_CC_TLS  -DUSE_TLS -o __main_s.o -MT
__main_s.o -MD -MP -MF __main_s.dep -DSHARED -DL__main -c
/home/regress/tbox/svn-gcc/libgcc/libgcc2.c
In file included from
/home/regress/tbox/svn-gcc/libgcc/unwind-dw2-fde-dip.c:79:0:
/home/regress/tbox/svn-gcc/libgcc/unwind-dw2-fde.c: In function 'search_object':
/home/regress/tbox/svn-gcc/libgcc/unwind-dw2-fde.c:997:1: internal
compiler error: Segmentation fault
 }
 ^
Please submit a full bug report,
with preprocessed source if appropriate.
See <http://gcc.gnu.org/bugs.html> for instructions.
make[3]: *** [unwind-dw2-fde-dip.o] Error 1
make[3]: *** Waiting for unfinished jobs....
make[3]: Leaving directory
`/home/regress/tbox/native/build/i686-pc-linux-gnu/libgcc'
make[2]: *** [all-stage1-target-libgcc] Error 2
make[2]: Leaving directory `/home/regress/tbox/native/build'
make[1]: *** [stage1-bubble] Error 2
make[1]: Leaving directory `/home/regress/tbox/native/build'
make: *** [bootstrap] Error 2
diff mbox

Patch

Index: gcc/rtl.def
===================================================================
--- gcc/rtl.def	2012-04-15 15:23:52.747632394 +0100
+++ gcc/rtl.def	2012-04-15 15:58:48.234515667 +0100
@@ -109,8 +109,8 @@  DEF_RTL_EXPR(INSN_LIST, "insn_list", "ue
    `emit_insn' takes the SEQUENCE apart and makes separate insns.  */
 DEF_RTL_EXPR(SEQUENCE, "sequence", "E", RTX_EXTRA)
 
-/* Refers to the address of its argument.  This is only used in alias.c.  */
-DEF_RTL_EXPR(ADDRESS, "address", "e", RTX_MATCH)
+/* Represents a non-global base address.  This is only used in alias.c.  */
+DEF_RTL_EXPR(ADDRESS, "address", "w", RTX_EXTRA)
 
 /* ----------------------------------------------------------------------
    Expression types used for things in the instruction chain.
Index: gcc/alias.c
===================================================================
--- gcc/alias.c	2012-04-15 15:23:52.748632394 +0100
+++ gcc/alias.c	2012-04-15 15:59:45.073512500 +0100
@@ -187,21 +187,42 @@  #define MAX_ALIAS_LOOP_PASSES 10
    of the first set.
 
    A base address can be an ADDRESS, SYMBOL_REF, or LABEL_REF.  ADDRESS
-   expressions represent certain special values: function arguments and
-   the stack, frame, and argument pointers.
+   expressions represent three types of base:
 
-   The contents of an ADDRESS is not normally used, the mode of the
-   ADDRESS determines whether the ADDRESS is a function argument or some
-   other special value.  Pointer equality, not rtx_equal_p, determines whether
-   two ADDRESS expressions refer to the same base address.
-
-   The only use of the contents of an ADDRESS is for determining if the
-   current function performs nonlocal memory memory references for the
-   purposes of marking the function as a constant function.  */
+     1. incoming arguments.  There is just one ADDRESS to represent all
+	arguments, since we do not know at this level whether accesses
+	based on different arguments can alias.  The ADDRESS has id 0.
+
+     2. stack_pointer_rtx, frame_pointer_rtx, hard_frame_pointer_rtx
+	(if distinct from frame_pointer_rtx) and arg_pointer_rtx.
+	Each of these rtxes has a separate ADDRESS associated with it,
+	each with a negative id.
+
+	GCC is (and is required to be) precise in which register it
+	chooses to access a particular region of stack.  We can therefore
+	assume that accesses based on one of these rtxes do not alias
+	accesses based on another of these rtxes.
+
+     3. bases that are derived from malloc()ed memory (REG_NOALIAS).
+	Each such piece of memory has a separate ADDRESS associated
+	with it, each with an id greater than 0.
+
+   Accesses based on one ADDRESS do not alias accesses based on other
+   ADDRESSes.  Accesses based on ADDRESSes in groups (2) and (3) do not
+   alias globals either; the ADDRESSes have Pmode to indicate this.
+   The ADDRESS in group (1) _may_ alias globals; it has VOIDmode to
+   indicate this.  */
 
 static GTY(()) VEC(rtx,gc) *reg_base_value;
 static rtx *new_reg_base_value;
 
+/* The single VOIDmode ADDRESS that represents all argument bases.
+   It has id 0.  */
+static GTY(()) rtx arg_base_value;
+
+/* Used to allocate unique ids to each REG_NOALIAS ADDRESS.  */
+static int unique_id;
+
 /* We preserve the copy of old array around to avoid amount of garbage
    produced.  About 8% of garbage produced were attributed to this
    array.  */
@@ -993,6 +1014,43 @@  get_frame_alias_set (void)
   return frame_set;
 }
 
+/* Create a new, unique base with id ID.  */
+
+static rtx
+unique_base_value (HOST_WIDE_INT id)
+{
+  return gen_rtx_ADDRESS (Pmode, id);
+}
+
+/* Return true if accesses based on any other base value cannot access
+   those based on X.  */
+
+static bool
+unique_base_value_p (rtx x)
+{
+  return GET_CODE (x) == ADDRESS && GET_MODE (x) == Pmode;
+}
+
+/* Return true if X is known to be a base value.  */
+
+static bool
+known_base_value_p (rtx x)
+{
+  switch (GET_CODE (x))
+    {
+    case LABEL_REF:
+    case SYMBOL_REF:
+      return true;
+
+    case ADDRESS:
+      /* Arguments may or may not be bases; we don't know for sure.  */
+      return GET_MODE (x) != VOIDmode;
+
+    default:
+      return false;
+    }
+}
+
 /* Inside SRC, the source of a SET, find a base address.  */
 
 static rtx
@@ -1049,7 +1107,7 @@  find_base_value (rtx src)
 	  && (XEXP (src, 0) == arg_pointer_rtx
 	      || (GET_CODE (XEXP (src, 0)) == PLUS
 		  && XEXP (XEXP (src, 0), 0) == arg_pointer_rtx)))
-	return gen_rtx_ADDRESS (VOIDmode, src);
+	return arg_base_value;
       return 0;
 
     case CONST:
@@ -1090,18 +1148,10 @@  find_base_value (rtx src)
 	/* If either base is named object or a special address
 	   (like an argument or stack reference), then use it for the
 	   base term.  */
-	if (src_0 != 0
-	    && (GET_CODE (src_0) == SYMBOL_REF
-		|| GET_CODE (src_0) == LABEL_REF
-		|| (GET_CODE (src_0) == ADDRESS
-		    && GET_MODE (src_0) != VOIDmode)))
+	if (src_0 != 0 && known_base_value_p (src_0))
 	  return src_0;
 
-	if (src_1 != 0
-	    && (GET_CODE (src_1) == SYMBOL_REF
-		|| GET_CODE (src_1) == LABEL_REF
-		|| (GET_CODE (src_1) == ADDRESS
-		    && GET_MODE (src_1) != VOIDmode)))
+	if (src_1 != 0 && known_base_value_p (src_1))
 	  return src_1;
 
 	/* Guess which operand is the base address:
@@ -1169,16 +1219,14 @@  find_base_value (rtx src)
   return 0;
 }
 
-/* Called from init_alias_analysis indirectly through note_stores.  */
+/* Called from init_alias_analysis indirectly through note_stores,
+   or directly if DEST is a register with a REG_NOALIAS note attached.
+   SET is null in the latter case.  */
 
 /* While scanning insns to find base values, reg_seen[N] is nonzero if
    register N has been set in this function.  */
 static char *reg_seen;
 
-/* Addresses which are known not to alias anything else are identified
-   by a unique integer.  */
-static int unique_id;
-
 static void
 record_set (rtx dest, const_rtx set, void *data ATTRIBUTE_UNUSED)
 {
@@ -1223,14 +1271,14 @@  record_set (rtx dest, const_rtx set, voi
     }
   else
     {
+      /* There's a REG_NOALIAS note against DEST.  */
       if (reg_seen[regno])
 	{
 	  new_reg_base_value[regno] = 0;
 	  return;
 	}
       reg_seen[regno] = 1;
-      new_reg_base_value[regno] = gen_rtx_ADDRESS (Pmode,
-						   GEN_INT (unique_id++));
+      new_reg_base_value[regno] = unique_base_value (unique_id++);
       return;
     }
 
@@ -1666,18 +1714,10 @@  find_base_term (rtx x)
 	/* If either base term is named object or a special address
 	   (like an argument or stack reference), then use it for the
 	   base term.  */
-	if (tmp1 != 0
-	    && (GET_CODE (tmp1) == SYMBOL_REF
-		|| GET_CODE (tmp1) == LABEL_REF
-		|| (GET_CODE (tmp1) == ADDRESS
-		    && GET_MODE (tmp1) != VOIDmode)))
+	if (tmp1 != 0 && known_base_value_p (tmp1))
 	  return tmp1;
 
-	if (tmp2 != 0
-	    && (GET_CODE (tmp2) == SYMBOL_REF
-		|| GET_CODE (tmp2) == LABEL_REF
-		|| (GET_CODE (tmp2) == ADDRESS
-		    && GET_MODE (tmp2) != VOIDmode)))
+	if (tmp2 != 0 && known_base_value_p (tmp2))
 	  return tmp2;
 
 	/* We could not determine which of the two operands was the
@@ -1762,12 +1802,7 @@  base_alias_check (rtx x, rtx y, enum mac
   if (GET_CODE (x_base) != ADDRESS && GET_CODE (y_base) != ADDRESS)
     return 0;
 
-  /* If one address is a stack reference there can be no alias:
-     stack references using different base registers do not alias,
-     a stack reference can not alias a parameter, and a stack reference
-     can not alias a global.  */
-  if ((GET_CODE (x_base) == ADDRESS && GET_MODE (x_base) == Pmode)
-      || (GET_CODE (y_base) == ADDRESS && GET_MODE (y_base) == Pmode))
+  if (unique_base_value_p (x_base) || unique_base_value_p (y_base))
     return 0;
 
   return 1;
@@ -2686,6 +2721,9 @@  init_alias_target (void)
 {
   int i;
 
+  if (!arg_base_value)
+    arg_base_value = gen_rtx_ADDRESS (VOIDmode, 0);
+
   memset (static_reg_base_value, 0, sizeof static_reg_base_value);
 
   for (i = 0; i < FIRST_PSEUDO_REGISTER; i++)
@@ -2694,18 +2732,13 @@  init_alias_target (void)
        numbers, so translate if necessary due to register windows.  */
     if (FUNCTION_ARG_REGNO_P (OUTGOING_REGNO (i))
 	&& HARD_REGNO_MODE_OK (i, Pmode))
-      static_reg_base_value[i]
-	= gen_rtx_ADDRESS (VOIDmode, gen_rtx_REG (Pmode, i));
+      static_reg_base_value[i] = arg_base_value;
 
-  static_reg_base_value[STACK_POINTER_REGNUM]
-    = gen_rtx_ADDRESS (Pmode, stack_pointer_rtx);
-  static_reg_base_value[ARG_POINTER_REGNUM]
-    = gen_rtx_ADDRESS (Pmode, arg_pointer_rtx);
-  static_reg_base_value[FRAME_POINTER_REGNUM]
-    = gen_rtx_ADDRESS (Pmode, frame_pointer_rtx);
+  static_reg_base_value[STACK_POINTER_REGNUM] = unique_base_value (-1);
+  static_reg_base_value[ARG_POINTER_REGNUM] = unique_base_value (-2);
+  static_reg_base_value[FRAME_POINTER_REGNUM] = unique_base_value (-3);
 #if !HARD_FRAME_POINTER_IS_FRAME_POINTER
-  static_reg_base_value[HARD_FRAME_POINTER_REGNUM]
-    = gen_rtx_ADDRESS (Pmode, hard_frame_pointer_rtx);
+  static_reg_base_value[HARD_FRAME_POINTER_REGNUM] = unique_base_value (-4);
 #endif
 }
 
@@ -2791,8 +2824,8 @@  init_alias_analysis (void)
       changed = 0;
 
       /* We want to assign the same IDs each iteration of this loop, so
-	 start counting from zero each iteration of the loop.  */
-      unique_id = 0;
+	 start counting from one each iteration of the loop.  */
+      unique_id = 1;
 
       /* We're at the start of the function each iteration through the
 	 loop, so we're copying arguments.  */