Add support for masked load/store_lanes

This patch adds support for vectorising groups of IFN_MASK_LOADs
and IFN_MASK_STOREs using conditional load/store-lanes instructions.
This requires new internal functions to represent the result
(IFN_MASK_{LOAD,STORE}_LANES), as well as associated optabs.

The normal IFN_{LOAD,STORE}_LANES functions are const operations
that logically just perform the permute: the load or store is
encoded as a MEM operand to the call statement.  In contrast,
the IFN_MASK_{LOAD,STORE}_LANES functions use the same kind of
interface as IFN_MASK_{LOAD,STORE}, since the memory is only
conditionally accessed.

The AArch64 patterns were added as part of the main LD[234]/ST[234] patch.

Tested on aarch64-linux-gnu (both with and without SVE), x86_64-linux-gnu
and powerpc64le-linux-gnu.  OK to install?

Thanks,
Richard

2017-11-08  Richard Sandiford  <richard.sandiford@linaro.org>
	    Alan Hayward  <alan.hayward@arm.com>
	    David Sherwood  <david.sherwood@arm.com>

gcc/
	* optabs.def (vec_mask_load_lanes_optab): New optab.
	(vec_mask_store_lanes_optab): Likewise.
	* internal-fn.def (MASK_LOAD_LANES): New internal function.
	(MASK_STORE_LANES): Likewise.
	* internal-fn.c (mask_load_lanes_direct): New macro.
	(mask_store_lanes_direct): Likewise.
	(expand_mask_load_optab_fn): Handle masked operations.
	(expand_mask_load_lanes_optab_fn): New macro.
	(expand_mask_store_optab_fn): Handle masked operations.
	(expand_mask_store_lanes_optab_fn): New macro.
	(direct_mask_load_lanes_optab_supported_p): Likewise.
	(direct_mask_store_lanes_optab_supported_p): Likewise.
	* tree-vectorizer.h (vect_store_lanes_supported): Take a masked_p
	parameter.
	(vect_load_lanes_supported): Likewise.
	* tree-vect-data-refs.c (strip_conversion): New function.
	(can_group_stmts_p): Likewise.
	(vect_analyze_data_ref_accesses): Use it instead of checking
	for a pair of assignments.
	(vect_store_lanes_supported): Take a masked_p parameter.
	(vect_load_lanes_supported): Likewise.
	* tree-vect-loop.c (vect_analyze_loop_2): Update calls to
	vect_store_lanes_supported and vect_load_lanes_supported.
	* tree-vect-slp.c (vect_analyze_slp_instance): Likewise.
	* tree-vect-stmts.c (replace_mask_load): New function, split
	out from vectorizable_mask_load_store.  Keep the group information
	up-to-date.
	(get_store_op): New function.
	(get_group_load_store_type): Take a masked_p parameter.  Don't
	allow gaps for masked accesses.  Use get_store_op.  Update calls
	to vect_store_lanes_supported and vect_load_lanes_supported.
	(get_load_store_type): Take a masked_p parameter and update
	call to get_group_load_store_type.
	(init_stored_values, advance_stored_values): New functions,
	split out from vectorizable_store.
	(do_load_lanes, do_store_lanes): New functions.
	(get_masked_group_alias_ptr_type): New function.
	(vectorizable_mask_load_store): Update call to get_load_store_type.
	Handle masked VMAT_LOAD_STORE_LANES.  Update GROUP_STORE_COUNT
	when vectorizing a group of stores and only vectorize when we
	reach the last statement in the group.  Vectorize the first
	statement in a group of loads.  Use an array aggregate type
	rather than a vector type for load/store_lanes.  Use
	init_stored_values, advance_stored_values, do_load_lanes,
	do_store_lanes, get_masked_group_alias_ptr_type and replace_mask_load.
	(vectorizable_store): Update call to get_load_store_type.
	Use init_stored_values, advance_stored_values and do_store_lanes.
	(vectorizable_load): Update call to get_load_store_type.
	Use do_load_lanes.
	(vect_transform_stmt): Set grouped_store for grouped IFN_MASK_STOREs.
	Only set is_store for the last element in the group.

gcc/testsuite/
	* gcc.dg/vect/vect-ooo-group-1.c: New test.
	* gcc.target/aarch64/sve_mask_struct_load_1.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_1_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_2.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_2_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_3.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_3_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_4.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_5.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_6.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_7.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_load_8.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_1.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_1_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_2.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_2_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_3.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_3_run.c: Likewise.
	* gcc.target/aarch64/sve_mask_struct_store_4.c: Likewise.

Message ID	87efp8wwu2.fsf@linaro.org
State	New
Headers	show Return-Path: <gcc-patches-return-466273-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:from :to:subject:date:message-id:mime-version:content-type; q=dns; s= default; b=rdBX7MojbwZdjUJgP9hXAr9edJRC+mAvzrumEQ3jppYrRmeFks3Pw mzeB4jM5xLb2shSTCrbt3b+mi4/zDFLCIP77dfkZGeLGUnN3SRKT+5bTQC7PKbDh nBA2k1qln8r/b82nrpXgozJNvMLwMIVbGuUjSMxkWBvBNvXhzJJkOY= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org From: Richard Sandiford <richard.sandiford@linaro.org> To: gcc-patches@gcc.gnu.org Mail-Followup-To: gcc-patches@gcc.gnu.org, richard.sandiford@linaro.org Subject: Add support for masked load/store_lanes Date: Wed, 08 Nov 2017 16:37:25 +0000 Message-ID: <87efp8wwu2.fsf@linaro.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain
Series	Add support for masked load/store_lanes \| expand Add support for masked load/store_lanes

Add support for masked load/store_lanes

Commit Message

Comments

Patch