From patchwork Wed Jun 21 06:25:52 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 1797661 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=8.43.85.97; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: legolas.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=AHR6UQUE; dkim-atps=neutral Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4QmD6N6y1Zz20Wk for ; Wed, 21 Jun 2023 16:26:20 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id F09913858D1E for ; Wed, 21 Jun 2023 06:26:18 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org F09913858D1E DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687328779; bh=tBTQyBHDfdKurtv2xLjhbYT6yt6F4IC9iBA2co03Dec=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=AHR6UQUE9cPqI9Tw42DNF4Vpl4B8nHj3LuNFF+EN/bm8QGaVl31z6rRvP6IgLtdql y8Ff+5XQIBEnPyHbkLN4TeH+dr/5iu4rgQWkTKPELjIJsgITZvpFZZ+6lqsFtK7ttv p9tKzbz3t4ApSwebvtpCddxkrVZLHqpU+r46bGn8= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from EUR03-AM7-obe.outbound.protection.outlook.com (mail-am7eur03on2087.outbound.protection.outlook.com [40.107.105.87]) by sourceware.org (Postfix) with ESMTPS id 7E5673858D1E for ; Wed, 21 Jun 2023 06:25:57 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 7E5673858D1E ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=apDheHSJm1tgsmyZ+7qxz0QiXHLAkhwUvlYW3V2mYIaDAE7GqO6K1aBvXoMlKcoJHFdD3OMpS0z/7FneIT5P9dHUGKf3VwR7XDjrMnKp6Xu9iT9wFixUOnI+puskmL2hXfhViJgiaVHLPsSKHRffwBZNqtlY4jj5aZVG4DAvfIyDRREWObk9H1QZJNRnKMscbwJwvaEPQdT8H7YOMPg+KUOqt6w/EwsRGxLKLtEEXsva3kOmf/TgOV7DD89b7Y8PBQNRiuDK9jgFQjIqXX/w1tB0e16mIXLTKS7S8UwG6HkYumlV3eK2tQBinh0eRgt5feKa1NAgUGMqOSuc8J4aDg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=tBTQyBHDfdKurtv2xLjhbYT6yt6F4IC9iBA2co03Dec=; b=h1kta/2zehVmV4/uU0Zd6LZt+XRBRtz3tpW/81DGZEHNF+LAK6VrRMkgfCx8f1OJdJPwrUl7+sApjxuIgugk7aD0twIR2kOHu/1asuB0VM22PdkuJnskRVs7EndWDzluAGPT+0iRHRZPi/ApAS6cdb4OV+lJTFTS38Jyt5QuTQAZ+nMYF4fLKor+FUABhztMHNN/PSb86WSKS5Rc++UJ/vqzUw4jCD46U1XhumTCM4sV6dDX/5/ktjmE0ab6sb0p/xrTJIW07zLoHpFInzDHkp8XVDCYtAD9jPYkLqOE1kPIYEXBb6dSQ3YZVrxWQFJIBD9oOakIVjgUkZBuRsKdxQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DUZPR04MB9982.eurprd04.prod.outlook.com (2603:10a6:10:4db::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6521.23; Wed, 21 Jun 2023 06:25:55 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.036; Wed, 21 Jun 2023 06:25:54 +0000 Message-ID: <457ffad0-9ecd-3e19-f5ab-6153ce4b8bad@suse.com> Date: Wed, 21 Jun 2023 08:25:52 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 1/5] x86: use VPTERNLOG for further bitwise two-vector operations Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Hongtao Liu , Kirill Yukhin References: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> In-Reply-To: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> X-ClientProxiedBy: FR3P281CA0058.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:4b::19) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DUZPR04MB9982:EE_ X-MS-Office365-Filtering-Correlation-Id: 097b78ef-a884-4336-5d27-08db72205e28 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: ez6m3mzG+ruWG9rhSKA4E2K8wvAR67hxGF2vGaQE/ywVeOz40NhDtZpnuXTFyyG9rEIjfhZVZkrNcqm0QrFgjZmbhiZ0uk5DHXCXeMq3RP9XoWcT6TvAFDzp4t0IwfeYnvv44rgM/7hDoIHgJjse8kLaZG3uB8YaoiXYOiD3yP46obnvLuGWxnZRRtZmtILjxexnZqxJoalTPSkHJa9nDcmMNjXaCmUGwuI67q3BoPZzDIlwNfFCVKVgP4C5lSjRSV/OsnTLBL3ANvRfAjRuWY54Co9FmxmsFma14yL6nXMk8ekh9cwYhSDSwYfa8LNtNhCOYXK9T6djCy9Nmi+urvznEOblOro39E84K9e3/u780NcI/4vW3czr6BmHq/MglY8UKjWYaQeTjqood8eYrl2OlUbRYkSNkcQToY3EbxVBWF050KeQNkpAy63r8rh1f4pmMdYWXZ6u2zITpgWAOwTYiDzJBGZzRD5Pl8Zez6LU5U7YzLdBVDFRuX8OEQ3o44guyxVb5fCFLKbxrjJBN4mdRLOvmDN5an1E4+Raf25Ycyh0nlVV/c3XinU85+0C9t55xuSV06AvnmZzgWc7E/bfiFfYGneDw4Texk4aeHxIVyyp1hgklknjV/04pKzNgyKtS6c5yQ8GMNvKL8FfKg== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(39860400002)(376002)(136003)(346002)(396003)(451199021)(2616005)(84970400001)(6506007)(26005)(186003)(6512007)(38100700002)(6486002)(478600001)(54906003)(31686004)(36756003)(31696002)(86362001)(66556008)(66946007)(66476007)(8936002)(8676002)(4326008)(316002)(6916009)(2906002)(5660300002)(41300700001)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?M1XbY6BsExRtWwE2rZWxVN2c/Jhm?= =?utf-8?q?/MT0ECFL1pRrZwz0xc7Ef90qK39gXrBWZ3dZlzxzZSap4NsCRPsqfzbnSqqs1bt2k?= =?utf-8?q?Wvl2hGUTGCkfQuXml4p0/sAD/qA4SGwAC0iTmIwEpwGAOWfltoqfRxJ1OtZPZJisv?= =?utf-8?q?hh9DAZfBjdkBwATRqhZtkQu2vCdAv6hO3ABoK5A7TtfKv3ahXcRwd4sejLtGvGCzx?= =?utf-8?q?JuVqW7JV9Hxtlzx7npvKajyP2/ZDzNXDmWZ3DJt58af+jUqjKM6hDaQEGVPtf0PJU?= =?utf-8?q?hpkKMh3olMR2tMNKve56yHDeh/PtPqJNXp1hRkhW3Oa7natd6xRYDaZaak3nVQzfY?= =?utf-8?q?eCUDfMqCkbQoUiro3LBx/h/rhOcwDkfEYDZnqgwKXhz/XQI+thZf2jyXTEz4t6niT?= =?utf-8?q?n5VOZzp6xYOOVJKyNTx09RJ/E6GII5/g6XnOiglXINkGuT88SfpIK/6BFcBoQAZkL?= =?utf-8?q?8jPHUki+mItu1KoCOZ+iV9YiQkW8XUzhlDfXD7A0Zj24QYE+rMwG/qEjnmq2xofLh?= =?utf-8?q?DgzB61C8BtMAF5m66WhlN1S3ZbAMSuZ/LhrOywwis+RU+OOXEmkmgySqFwclfyEX5?= =?utf-8?q?UY4M/5C1r+5Ttv2N/ewIUa2n6DgBuueOO5v86misSbVFsjsFU0LXcPZgy4HhLciZ6?= =?utf-8?q?4n9bVLUBCvCpShkYQk0iWY6bKS/W3e5NpAMmfW5GZgNVcPrtT/q9nHJFUsJ9mxHPs?= =?utf-8?q?MGVGkU4b182vHFtuy21JH1D+aW2hJGgtYVaqUEDy2xh4P75FOOpw1Hi+3xrqnuhMg?= =?utf-8?q?5Om9hFGOwksYGHiIobnKOhwBzVrCZ3sCMYKdJUqrXyN5wXeh6/YCjz9B4YCRdFHJk?= =?utf-8?q?4uhhHcM3Rss/4AZUy4Gq9VwftTV74Gp02vp3wZnU68ZlzPibiPQZNFSDNVmBR6JOR?= =?utf-8?q?RlbhF7gXNDe1wcCqbznFnbQP4wx7nGvr3HTpKE9NiKOsGUuOaIsjNX37umuz9BKdP?= =?utf-8?q?fJtVUKQz3UVWk0nDdfXMd692+gQk0DXDhe3png/fXDi4GRGY9+HQXixHapVhFvSBg?= =?utf-8?q?+5pzvW/OwGe9xk7kobeL4Iiw3BOEO4/6VPxAHZ5EiVlbbPopRZ+LR/Dv6YaAFkJcy?= =?utf-8?q?mypt0/I3EuJkFr+Yc1BbXl1oL3rXBGtGXcUECJuagfuqqWpAbeFF1N/gupj/DqvWd?= =?utf-8?q?/dV+n5CEwoRfpt+ODC9atwx8RpStdQdCoYRbXCwFYssbKAt3XbtGy/fUd8EqM57nh?= =?utf-8?q?cXufIOpSjCBxvvOcTkk+cok6clHaf+nBEYQD5mJqHdwtXeXDy/ohgYN/bkcPhDkNH?= =?utf-8?q?bAsV9wv1DWM38mdoVxIEuAPvw/6XJQAsjh6/efXJ4pKlXLk3EMz97PPO833CuLHzP?= =?utf-8?q?uA8M+P79lrrcaEXZPeWRxhcOkGiWsnkkEU8UoLurQ74OGdFXXu56w+VMtI5JZt7MJ?= =?utf-8?q?k3Z8ZbAmuAUW1ClnSxU+E0F/MpzcsMsJ+5087ebBQi6TUlJFicWI4iVth1cuUL8Io?= =?utf-8?q?Ob6PPWjJGM5WgFOUzpprtRXiBtZFrKiThqMuqR2+YKEdHJ2ThK603MBeNZR15Kyl8?= =?utf-8?q?U5uxtJP8n9ju?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 097b78ef-a884-4336-5d27-08db72205e28 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 21 Jun 2023 06:25:54.9101 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: eFOeJO+/bHthDTTDPhIefHWTB0Ul65eI8mVBa9nmHK7nwRMEm5olu8mFqXiNRRAo9iUlnhgMp8ga21ZGXsdi4w== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DUZPR04MB9982 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Gcc-patches From: Jan Beulich Reply-To: Jan Beulich Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" All combinations of and, ior, xor, and not involving two operands can be expressed that way in a single insn. gcc/ PR target/93768 * config/i386/i386.cc (ix86_rtx_costs): Further special-case bitwise vector operations. * config/i386/sse.md (*iornot3): New insn. (*xnor3): Likewise. (*3): Likewise. (andor): New code iterator. (nlogic): New code attribute. (ternlog_nlogic): Likewise. gcc/testsuite/ PR target/93768 gcc.target/i386/avx512-binop-not-1.h: New. gcc.target/i386/avx512-binop-not-2.h: New. gcc.target/i386/avx512f-orn-si-zmm-1.c: New test. gcc.target/i386/avx512f-orn-si-zmm-2.c: New test. --- The use of VI matches that in e.g. one_cmpl2 / one_cmpl2 and *andnot3, despite (here and there) - V64QI and V32HI being needlessly excluded when AVX512BW isn't enabled, - VTI not being covered, - vector modes more narrow than 16 bytes not being covered. --- a/gcc/config/i386/i386.cc +++ b/gcc/config/i386/i386.cc @@ -21178,6 +21178,32 @@ ix86_rtx_costs (rtx x, machine_mode mode return false; case IOR: + if (GET_MODE_CLASS (mode) == MODE_VECTOR_INT) + { + /* (ior (not ...) ...) can be a single insn in AVX512. */ + if (GET_CODE (XEXP (x, 0)) == NOT && TARGET_AVX512F + && (GET_MODE_SIZE (mode) == 64 + || (TARGET_AVX512VL + && (GET_MODE_SIZE (mode) == 32 + || GET_MODE_SIZE (mode) == 16)))) + { + rtx right = GET_CODE (XEXP (x, 1)) != NOT + ? XEXP (x, 1) : XEXP (XEXP (x, 1), 0); + + *total = ix86_vec_cost (mode, cost->sse_op) + + rtx_cost (XEXP (XEXP (x, 0), 0), mode, + outer_code, opno, speed) + + rtx_cost (right, mode, outer_code, opno, speed); + return true; + } + *total = ix86_vec_cost (mode, cost->sse_op); + } + else if (GET_MODE_SIZE (mode) > UNITS_PER_WORD) + *total = cost->add * 2; + else + *total = cost->add; + return false; + case XOR: if (GET_MODE_CLASS (mode) == MODE_VECTOR_INT) *total = ix86_vec_cost (mode, cost->sse_op); @@ -21198,11 +21224,20 @@ ix86_rtx_costs (rtx x, machine_mode mode /* pandn is a single instruction. */ if (GET_CODE (XEXP (x, 0)) == NOT) { + rtx right = XEXP (x, 1); + + /* (and (not ...) (not ...)) can be a single insn in AVX512. */ + if (GET_CODE (right) == NOT && TARGET_AVX512F + && (GET_MODE_SIZE (mode) == 64 + || (TARGET_AVX512VL + && (GET_MODE_SIZE (mode) == 32 + || GET_MODE_SIZE (mode) == 16)))) + right = XEXP (right, 0); + *total = ix86_vec_cost (mode, cost->sse_op) + rtx_cost (XEXP (XEXP (x, 0), 0), mode, outer_code, opno, speed) - + rtx_cost (XEXP (x, 1), mode, - outer_code, opno, speed); + + rtx_cost (right, mode, outer_code, opno, speed); return true; } else if (GET_CODE (XEXP (x, 1)) == NOT) @@ -21260,8 +21295,25 @@ ix86_rtx_costs (rtx x, machine_mode mode case NOT: if (GET_MODE_CLASS (mode) == MODE_VECTOR_INT) - // vnot is pxor -1. - *total = ix86_vec_cost (mode, cost->sse_op) + 1; + { + /* (not (xor ...)) can be a single insn in AVX512. */ + if (GET_CODE (XEXP (x, 0)) == XOR && TARGET_AVX512F + && (GET_MODE_SIZE (mode) == 64 + || (TARGET_AVX512VL + && (GET_MODE_SIZE (mode) == 32 + || GET_MODE_SIZE (mode) == 16)))) + { + *total = ix86_vec_cost (mode, cost->sse_op) + + rtx_cost (XEXP (XEXP (x, 0), 0), mode, + outer_code, opno, speed) + + rtx_cost (XEXP (XEXP (x, 0), 1), mode, + outer_code, opno, speed); + return true; + } + + // vnot is pxor -1. + *total = ix86_vec_cost (mode, cost->sse_op) + 1; + } else if (GET_MODE_SIZE (mode) > UNITS_PER_WORD) *total = cost->add * 2; else --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -17616,6 +17616,98 @@ operands[2] = force_reg (V1TImode, CONSTM1_RTX (V1TImode)); }) +(define_insn "*iornot3" + [(set (match_operand:VI 0 "register_operand" "=v,v,v,v") + (ior:VI + (not:VI + (match_operand:VI 1 "bcst_vector_operand" "v,Br,v,m")) + (match_operand:VI 2 "bcst_vector_operand" "vBr,v,m,v")))] + "( == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)) + && (register_operand (operands[1], mode) + || register_operand (operands[2], mode))" +{ + if (!register_operand (operands[1], mode)) + { + if (TARGET_AVX512VL) + return "vpternlog\t{$0xdd, %1, %2, %0|%0, %2, %1, 0xdd}"; + return "vpternlog\t{$0xdd, %g1, %g2, %g0|%g0, %g2, %g1, 0xdd}"; + } + if (TARGET_AVX512VL) + return "vpternlog\t{$0xbb, %2, %1, %0|%0, %1, %2, 0xbb}"; + return "vpternlog\t{$0xbb, %g2, %g1, %g0|%g0, %g1, %g2, 0xbb}"; +} + [(set_attr "type" "sselog") + (set_attr "length_immediate" "1") + (set_attr "prefix" "evex") + (set (attr "mode") + (if_then_else (match_test "TARGET_AVX512VL") + (const_string "") + (const_string "XI"))) + (set (attr "enabled") + (if_then_else (eq_attr "alternative" "2,3") + (symbol_ref " == 64 || TARGET_AVX512VL") + (const_string "*")))]) + +(define_insn "*xnor3" + [(set (match_operand:VI 0 "register_operand" "=v,v") + (not:VI + (xor:VI + (match_operand:VI 1 "bcst_vector_operand" "%v,v") + (match_operand:VI 2 "bcst_vector_operand" "vBr,m"))))] + "( == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)) + && (register_operand (operands[1], mode) + || register_operand (operands[2], mode))" +{ + if (TARGET_AVX512VL) + return "vpternlog\t{$0x99, %2, %1, %0|%0, %1, %2, 0x99}"; + else + return "vpternlog\t{$0x99, %g2, %g1, %g0|%g0, %g1, %g2, 0x99}"; +} + [(set_attr "type" "sselog") + (set_attr "length_immediate" "1") + (set_attr "prefix" "evex") + (set (attr "mode") + (if_then_else (match_test "TARGET_AVX512VL") + (const_string "") + (const_string "XI"))) + (set (attr "enabled") + (if_then_else (eq_attr "alternative" "1") + (symbol_ref " == 64 || TARGET_AVX512VL") + (const_string "*")))]) + +(define_code_iterator andor [and ior]) +(define_code_attr nlogic [(and "nor") (ior "nand")]) +(define_code_attr ternlog_nlogic [(and "0x11") (ior "0x77")]) + +(define_insn "*3" + [(set (match_operand:VI 0 "register_operand" "=v,v") + (andor:VI + (not:VI (match_operand:VI 1 "bcst_vector_operand" "%v,v")) + (not:VI (match_operand:VI 2 "bcst_vector_operand" "vBr,m"))))] + "( == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)) + && (register_operand (operands[1], mode) + || register_operand (operands[2], mode))" +{ + if (TARGET_AVX512VL) + return "vpternlog\t{$, %2, %1, %0|%0, %1, %2, }"; + else + return "vpternlog\t{$, %g2, %g1, %g0|%g0, %g1, %g2, }"; +} + [(set_attr "type" "sselog") + (set_attr "length_immediate" "1") + (set_attr "prefix" "evex") + (set (attr "mode") + (if_then_else (match_test "TARGET_AVX512VL") + (const_string "") + (const_string "XI"))) + (set (attr "enabled") + (if_then_else (eq_attr "alternative" "1") + (symbol_ref " == 64 || TARGET_AVX512VL") + (const_string "*")))]) + (define_mode_iterator AVX512ZEXTMASK [(DI "TARGET_AVX512BW") (SI "TARGET_AVX512BW") HI]) --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/avx512-binop-not-1.h @@ -0,0 +1,13 @@ +#include + +#define PASTER2(x,y) x##y +#define PASTER3(x,y,z) _mm##x##_##y##_##z +#define OP(vec, op, suffix) PASTER3 (vec, op, suffix) +#define DUP(vec, suffix, val) PASTER3 (vec, set1, suffix) (val) + +type +foo (type x, SCALAR *f) +{ + return OP (vec, op, suffix) (x, OP (vec, xor, suffix) (DUP (vec, suffix, *f), + DUP (vec, suffix, ~0))); +} --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/avx512-binop-not-2.h @@ -0,0 +1,13 @@ +#include + +#define PASTER2(x,y) x##y +#define PASTER3(x,y,z) _mm##x##_##y##_##z +#define OP(vec, op, suffix) PASTER3 (vec, op, suffix) +#define DUP(vec, suffix, val) PASTER3 (vec, set1, suffix) (val) + +type +foo (type x, SCALAR *f) +{ + return OP (vec, op, suffix) (OP (vec, xor, suffix) (x, DUP (vec, suffix, ~0)), + DUP (vec, suffix, *f)); +} --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/avx512f-orn-si-zmm-1.c @@ -0,0 +1,12 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512f -mno-avx512vl -mprefer-vector-width=512 -O2" } */ +/* { dg-final { scan-assembler-times "vpternlogd\[ \\t\]+\\\$0xdd, \\(%(?:eax|rdi|edi)\\)\\\{1to\[1-8\]+\\\}, %zmm\[0-9\]+, %zmm0" 1 } } */ +/* { dg-final { scan-assembler-not "vpbroadcast" } } */ + +#define type __m512i +#define vec 512 +#define op or +#define suffix epi32 +#define SCALAR int + +#include "avx512-binop-not-1.h" --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/avx512f-orn-si-zmm-2.c @@ -0,0 +1,12 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512f -mno-avx512vl -mprefer-vector-width=512 -O2" } */ +/* { dg-final { scan-assembler-times "vpternlogd\[ \\t\]+\\\$0xbb, \\(%(?:eax|rdi|edi)\\)\\\{1to\[1-8\]+\\\}, %zmm\[0-9\]+, %zmm0" 1 } } */ +/* { dg-final { scan-assembler-not "vpbroadcast" } } */ + +#define type __m512i +#define vec 512 +#define op or +#define suffix epi32 +#define SCALAR int + +#include "avx512-binop-not-2.h" From patchwork Wed Jun 21 06:27:11 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 1797662 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=2620:52:3:1:0:246e:9693:128c; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: legolas.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=n3hciU9A; dkim-atps=neutral Received: from sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4QmD7t0q9jz20Wk for ; Wed, 21 Jun 2023 16:27:38 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id C925F385841A for ; Wed, 21 Jun 2023 06:27:35 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org C925F385841A DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687328855; bh=H4H+gPSOIQO23blyADL5m5UDuhZssSGUlBE3obSsSoM=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=n3hciU9AojQxR9I3p3EVkZiO45c1+k0A1+QKcj1Pl0H+W6+i2QJIVMo/G6ZFJ4Zs4 s/ZY8f1hIZgOpFyWuEd0w/eCuf/LDhJTEu7nYSSVB/yi5obF2bAdUAyKr0S1a873pN /ySEm3rLklyiJfWJ8pmddvQrWA84uwBgum0WtTlc= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from EUR05-AM6-obe.outbound.protection.outlook.com (mail-am6eur05on2049.outbound.protection.outlook.com [40.107.22.49]) by sourceware.org (Postfix) with ESMTPS id DB52B3858D1E for ; Wed, 21 Jun 2023 06:27:14 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org DB52B3858D1E ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=HMoAWh4OXf8G9R6u+gnGewTU46Uj6sTQMsAQ9r8hJP7poVDgOaxFcbte9W2/UtkUSMQzJfIap8iyySgSZOxxiQdq57NUYbDMo3Hq3oa5tsK6f321ecC+aybzrj+TVnBQVy3Qk+RrlByz1DtsShIZ2aMSogjPG0mN+Rphd77rkUFwzIPf35+7MdbGON6GK1A0xRpCeYqSMfAhxsQVJJPW1TL+DBwdl8rli9+NgUU0d7SsH3MPq0LMxLww6wHny87yZe4DpKiherDpzTWp2pyx/8CvTdQZMYDUlWwNw8W757OJAo89b/sKkgTj/ZmJ7BvllDDKrW6mRCOK0wm7DIQNoQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=H4H+gPSOIQO23blyADL5m5UDuhZssSGUlBE3obSsSoM=; b=Gp2Ed374nVuYYF+5zcNMSoMAl2rJAg++RX8Ubp+tmFUxgXCkS5dW6oV2DwtJ6N3EtzUwj61d7n9b3cenhGKx5q/vJHy6hliM+KYVjciTVBD/77RhgSQfc63zasQuXzjYpx/JmcG+7c7zkW71hMFEjMCeMpuglaB3gAS6qP1CSoAucwvki0nLFcwtFtBIUvb74egLTrH9xMiQZAj3wQVi8VThb9z/GjP6LmLk6E2OwhzOV5PijWyqhe/7mIuU6CDBukVR9vToSzspNANYBC+sZeN4OX7oP/itTJDX9TWs55S0U80avJWBbXdwb+zu3Yv3TDa6kYX+bBhVAx+nFxakng== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DUZPR04MB9982.eurprd04.prod.outlook.com (2603:10a6:10:4db::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6521.23; Wed, 21 Jun 2023 06:27:13 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.036; Wed, 21 Jun 2023 06:27:13 +0000 Message-ID: <3cf55c98-d18a-d1ad-2fc2-015c63e217ca@suse.com> Date: Wed, 21 Jun 2023 08:27:11 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 2/5] x86: use VPTERNLOG also for certain andnot forms Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Hongtao Liu , Kirill Yukhin References: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> In-Reply-To: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> X-ClientProxiedBy: FR3P281CA0106.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:a3::16) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DUZPR04MB9982:EE_ X-MS-Office365-Filtering-Correlation-Id: 5807154e-659b-4d31-60d2-08db72208cd7 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: uh2P5j62wDl+L8j8DJMpWQUjt9tVooPcvEiJNgZB4FbTamMxyUWEstvJiMmjXdEQTBUYS5UZyDawE3B5w7Wf4VMxZjeXes+mZU/NPKQObeW97bgFs0U4Zk8p7wPRDQNzQ4NyPAElj95un9ApKeFEamTn2V7IFylr4z7m6Rk7o77w3u9UevGmlcp4KRn430AFzpbjYUGksLq5ncycIq3ynOzz76AwbjQINXzzm695G7i1cP+1oWB66MUTjG1E9mVhRPzhGdhNR+UPfiHqL7qauPeL0OLzBoqT+J6uiMVDS5DG1IoVEFHQBIZNQPVAv4JtJROivguL3yGCaq5bqXWZCs5BUbW4yjPWLbwzcA1eBberwWJPWCZIuD9imeVb+WK4zGQMsDVeMssdfzEGrhitzwHX81tspsCi3iNVij4mQ0BK0BYXl8M/sWmTDEmuTgRXbbialwejhD/BV2+xOI8xBhp+3b85HDNdn3SyjIgmN7SivIKpNeYiq/3Jm7uzHd5W7W3rM/EYWAIgIbTY6oAcVOpwhQ3BR/lyAzKpcKcZw+oeVc3ufU5eAl8sBWNQmvaFJSNhnvC0I+PuPTXqHCmM2IBydcDzHP/kjrOHmRaTXSQqGD0SFgDDvHLyv8Crtnc99TKwM3lW2noF2rC/CuQgUQ== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(39860400002)(376002)(136003)(346002)(396003)(451199021)(2616005)(84970400001)(6506007)(83380400001)(26005)(186003)(6512007)(38100700002)(6486002)(478600001)(54906003)(31686004)(36756003)(31696002)(86362001)(66556008)(66946007)(66476007)(8936002)(8676002)(4326008)(316002)(6916009)(2906002)(5660300002)(41300700001)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?oV/QT3dc7jcKVDR5oJSibuu5H0/l?= =?utf-8?q?VMNM4XHdS3sj1l6Q++nCqxVsW4fYY3aqVfK7FyGed/tSWqZEpG1AYuKgb+AsJL+zl?= =?utf-8?q?2v+fe5v2OGThWPvVHYpDHP4PRJrfvn6+stYk+6JYqZLGs0ld+DZF44nu/PlHTlMTe?= =?utf-8?q?UNUaP4XuXZorcRxaykM2Sp/VKCIMd35viwyyNFtDOVPbs7HmM2xCAdlyySaZZEZI9?= =?utf-8?q?SYRcaHMrH8IH0p4Cx6dql5HtGnulOopvEMLginXxGZ2PnIGSXeo/1CnpxRRLjmdJa?= =?utf-8?q?7tFg4SUousBY6a9h4n7uuUn4ywer1yFFHFouHIQQpktXfx2yktexJs3L82FPLuKk7?= =?utf-8?q?rvS/bT03gvUYiZvfAbmM3+HarrszwiI6qH9eZYaE9Cb+hVvnkgveWYkO6YQcK93rv?= =?utf-8?q?npXnGPcU4sVm2g7hDUIkW7bAEGEzq2h/dc9yddiYbCo0CW9CzUZbuYaX8EdPQoxOP?= =?utf-8?q?xpqO5HUzcjZeVHtpRtI5tAo7u7vBP0ue9Y+v0+xMIks/bcbFWFwvK0rlHsVHNujAe?= =?utf-8?q?/SucyhFP7cpxcRg8PYPtoDmVcSdXnOyy0slfImbndWLBmt0YEAwx9D0Uglc89srkT?= =?utf-8?q?pniwDlvJIciAh6wxLjhL5cSyKHwYqUmG5f0F0uc3B8gq7hEts/wsMGSwQeEbmJHch?= =?utf-8?q?qEdcRodOlyihPAItNnFR2J9O62oemzjhipXy7SKZ3YHd1GSpJa7RGBeX/6lCyg0vd?= =?utf-8?q?4JSZab8SZ7HqNflXTTQQXmS0eZw0wkndrhYa/Y/nF/i6E5Bk1bfEb179AnUAa1/Ac?= =?utf-8?q?DBKKw8/UFVQLqqtHQEGXNdE+nrp7Epl4lHiSJRpn3XJ63JgvFZFHbW5P6foJeuMxs?= =?utf-8?q?G/7Jr/75IbDmTpshfuRU0xqUbAKxMq11ZO7lpA6Gzd0xrRpzz2MOr4gE/RYfmYXRp?= =?utf-8?q?YhhVkTV8qBw4X4GpdiFpkFzSDSMgk5+aDcDemr3SRbLyt6D7E28bwBzpT8RD7k9LT?= =?utf-8?q?PwfCss+Tf+RYp8v7T6G7KbMOUXRHBC3nwDQxhSalF+8Kd5C271133LqZ/BNdXIg7S?= =?utf-8?q?9njTi9L51roTCAmUeJqyP2gT/2oXxBuoKhDCW/qITQruW7zX83+THB0HhvMuEg6T1?= =?utf-8?q?MIF0DDDHK4KlBl9eFjFqxTn4ugEkg1tzhx2h2NYYQneAZo16IJlrwN2WAhiUzK1xG?= =?utf-8?q?G/XboYOQajYvfNply/w5OHaxZnpEI401aTpEk7XRuWPr46N8JltDvnM81qV1yRHz7?= =?utf-8?q?AkfSD1Fs6LWRLfI0J2ZnVg4ytoqfx6t/li120GJWTVSo62mpaT79s9vMxiJ6dNoiU?= =?utf-8?q?hWVgTBf/y8oaxf9LuLPAcdWFnA+9cbs9N15pG/aR/AvtMyywTQyaFsC7z1ZBV2PEf?= =?utf-8?q?yDRDCY02Be74lU+okEXLqVbmAvB9K3CeENHM7MhSTtLKtK/dfIgFzFBX+dvplKF/F?= =?utf-8?q?lJTGTmEATcsg41SmFfOJ13g/pRDdKj0qES2mfkDzSTMAzAyynXYMRG+GUGXVq6VCu?= =?utf-8?q?lqNhDeNxQhqPsyBCHOoaGbl/iRNBhTK+s+cCFcwh6TeP1g3UxAiX8Z0uKnmVQ4ypa?= =?utf-8?q?GYE8AdSwwaMY?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 5807154e-659b-4d31-60d2-08db72208cd7 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 21 Jun 2023 06:27:13.1673 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: pQca5zu7YW5Gzq8Z1JryppIpaOTS2Iy6dv9vDHclA0IM733yW9OnVjGnx5EkYACPIjfh5ywt2QtzRW3+enQfHQ== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DUZPR04MB9982 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Gcc-patches From: Jan Beulich Reply-To: Jan Beulich Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" When it's the memory operand which is to be inverted, using VPANDN* requires a further load instruction. The same can be achieved by a single VPTERNLOG*. Add two new alternatives (for plain memory and embedded broadcast), adjusting the predicate for the first operand accordingly. Two pre-existing testcases actually end up being affected (improved) by the change, which is reflected in updated expectations there. gcc/ PR target/93768 * config/i386/sse.md (*andnot3): Add new alternatives for memory form operand 1. gcc/testsuite/ PR target/93768 * gcc.target/i386/avx512f-andn-di-zmm-2.c: New test. * gcc.target/i386/avx512f-andn-si-zmm-2.c: Adjust expecations towards generated code. * gcc.target/i386/pr100711-3.c: Adjust expectations for 32-bit code. --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -17210,11 +17210,13 @@ "TARGET_AVX512F") (define_insn "*andnot3" - [(set (match_operand:VI 0 "register_operand" "=x,x,v") + [(set (match_operand:VI 0 "register_operand" "=x,x,v,v,v") (and:VI - (not:VI (match_operand:VI 1 "vector_operand" "0,x,v")) - (match_operand:VI 2 "bcst_vector_operand" "xBm,xm,vmBr")))] - "TARGET_SSE" + (not:VI (match_operand:VI 1 "bcst_vector_operand" "0,x,v,m,Br")) + (match_operand:VI 2 "bcst_vector_operand" "xBm,xm,vmBr,v,v")))] + "TARGET_SSE + && (register_operand (operands[1], mode) + || register_operand (operands[2], mode))" { char buf[64]; const char *ops; @@ -17281,6 +17283,15 @@ case 2: ops = "v%s%s\t{%%2, %%1, %%0|%%0, %%1, %%2}"; break; + case 3: + case 4: + tmp = "pternlog"; + ssesuffix = ""; + if (which_alternative != 4 || TARGET_AVX512VL) + ops = "v%s%s\t{$0x44, %%1, %%2, %%0|%%0, %%2, %%1, $0x44}"; + else + ops = "v%s%s\t{$0x44, %%g1, %%g2, %%g0|%%g0, %%g2, %%g1, $0x44}"; + break; default: gcc_unreachable (); } @@ -17289,7 +17300,7 @@ output_asm_insn (buf, operands); return ""; } - [(set_attr "isa" "noavx,avx,avx") + [(set_attr "isa" "noavx,avx,avx,*,*") (set_attr "type" "sselog") (set (attr "prefix_data16") (if_then_else @@ -17297,9 +17308,12 @@ (eq_attr "mode" "TI")) (const_string "1") (const_string "*"))) - (set_attr "prefix" "orig,vex,evex") + (set_attr "prefix" "orig,vex,evex,evex,evex") (set (attr "mode") - (cond [(match_test "TARGET_AVX2") + (cond [(and (eq_attr "alternative" "3,4") + (match_test " < 64 && !TARGET_AVX512VL")) + (const_string "XI") + (match_test "TARGET_AVX2") (const_string "") (match_test "TARGET_AVX") (if_then_else @@ -17310,7 +17324,15 @@ (match_test "optimize_function_for_size_p (cfun)")) (const_string "V4SF") ] - (const_string "")))]) + (const_string ""))) + (set (attr "enabled") + (cond [(eq_attr "alternative" "3") + (symbol_ref " == 64 || TARGET_AVX512VL") + (eq_attr "alternative" "4") + (symbol_ref " == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)") + ] + (const_string "*")))]) ;; PR target/100711: Split notl; vpbroadcastd; vpand as vpbroadcastd; vpandn (define_split --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/avx512f-andn-di-zmm-2.c @@ -0,0 +1,12 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512f -mno-avx512vl -mprefer-vector-width=512 -O2" } */ +/* { dg-final { scan-assembler-times "vpternlogq\[ \\t\]+\\\$0x44, \\(%(?:eax|rdi|edi)\\)\\\{1to\[1-8\]+\\\}, %zmm\[0-9\]+, %zmm0" 1 } } */ +/* { dg-final { scan-assembler-not "vpbroadcast" } } */ + +#define type __m512i +#define vec 512 +#define op andnot +#define suffix epi64 +#define SCALAR long long + +#include "avx512-binop-2.h" --- a/gcc/testsuite/gcc.target/i386/avx512f-andn-si-zmm-2.c +++ b/gcc/testsuite/gcc.target/i386/avx512f-andn-si-zmm-2.c @@ -1,7 +1,7 @@ /* { dg-do compile } */ /* { dg-options "-mavx512f -O2" } */ -/* { dg-final { scan-assembler-times "vpbroadcastd\[^\n\]*%zmm\[0-9\]+" 1 } } */ -/* { dg-final { scan-assembler-times "vpandnd\[^\n\]*%zmm\[0-9\]+" 1 } } */ +/* { dg-final { scan-assembler-times "vpternlogd\[ \\t\]+\\\$0x44, \\(%(?:eax|rdi|edi)\\)\\\{1to\[1-8\]+\\\}, %zmm\[0-9\]+, %zmm0" 1 } } */ +/* { dg-final { scan-assembler-not "vpbroadcast" } } */ #define type __m512i #define vec 512 --- a/gcc/testsuite/gcc.target/i386/pr100711-3.c +++ b/gcc/testsuite/gcc.target/i386/pr100711-3.c @@ -37,4 +37,6 @@ v8di foo_v8di (long long a, v8di b) return (__extension__ (v8di) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) & b; } -/* { dg-final { scan-assembler-times "vpandn" 4 } } */ +/* { dg-final { scan-assembler-times "vpandn" 4 { target { ! ia32 } } } } */ +/* { dg-final { scan-assembler-times "vpandn" 2 { target { ia32 } } } } */ +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0x44" 2 { target { ia32 } } } } */ From patchwork Wed Jun 21 06:27:32 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 1797663 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=8.43.85.97; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: legolas.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=RHpOC06z; dkim-atps=neutral Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4QmD8G5WXVz20Wk for ; Wed, 21 Jun 2023 16:27:57 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 939073857719 for ; Wed, 21 Jun 2023 06:27:55 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 939073857719 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687328875; bh=jjDHrJq8Rq+mUGyZSNCwLXIGDo5cXtj/aTQ9U0grbdA=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=RHpOC06zWInleE0WGC191PcgtJCJsdvbmSSR+mMWw9NsdLlIIsgcj1ZfsrQhd4yu+ HW9l2FRo+UHpQ0wMZpFOHJpeuVGLM6wmbYByrRRNJf/DnBeztIg5MvrUOhfSR0uGx2 QLDVA4UvJbdowjgzdmh57ajX6DKpxgXu/Q4R/OhQ= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from EUR03-AM7-obe.outbound.protection.outlook.com (mail-am7eur03on2053.outbound.protection.outlook.com [40.107.105.53]) by sourceware.org (Postfix) with ESMTPS id 5E1863858281 for ; Wed, 21 Jun 2023 06:27:35 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 5E1863858281 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=gH+26k1Z389BC06WurF6QTDuGiCFXtN2l6F5NZ7W5/D9Zasj11gpZDZ60PHDQ6FONfT3nWpQ5/9IkimnfNokADf2fwggRqzBpppe4jPUKfvf//Myk7Cd7aWf7pgxc8NHoYZJ0RMJdDhWoHv4dh138YCfgbSYZb7CEC3cmuYf8HDm2r68ZQNPZ9T7hoO+nh9LsSTIsjSNXDJs+Vd5tfB2dZ3WuXMRv6hWo7ODax3OlKgfbKf/ZA6VGemVKOxCdrlxx0p1nM7P8eONbMnf983OFVylzGrn5P1mnuTawlE/5fUCrQcgYezt+/TAiOlJg7hP8SIrgxmyXw1miWD8I3wRQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=jjDHrJq8Rq+mUGyZSNCwLXIGDo5cXtj/aTQ9U0grbdA=; b=FmbdzrzDQcv+/IW0z3K8bRGhDj477JAKzda8y7oUl4+7US/o3zZc6qkxFd0z0ctAkh/xh6xXNyPeM47gJCSP8ryZTNMhepQ/TgVQXEdbpwwg9TxnjFnAB/Ud0Q3UMzE4ZwGl1U1PyX1KcJxUscYouv9j8f3aFeW4sd/+QsFulOgUD0+3nsE4Jzn4vxCnYJ/6fBh6hWAK+gGUtiJ7i0pIs+/vfiUMZBUWK41i663buyuwXPXhUsJy/2iHtxkewbGrpjOsUWTrnpGi1hWzKsqxXGIWwEQLIS2jqz5RWkMdvFottTmbfKIBHSQY81kkZxQ90LqKYixNjvbCjdDyB0gVWA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DUZPR04MB9982.eurprd04.prod.outlook.com (2603:10a6:10:4db::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6521.23; Wed, 21 Jun 2023 06:27:33 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.036; Wed, 21 Jun 2023 06:27:33 +0000 Message-ID: <4080e2a5-7d77-0ff7-8dc6-935ac79da0ce@suse.com> Date: Wed, 21 Jun 2023 08:27:32 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 3/5] x86: allow memory operand for AVX2 splitter for PR target/100711 Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Hongtao Liu , Kirill Yukhin References: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> In-Reply-To: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> X-ClientProxiedBy: FR3P281CA0119.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:a3::19) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DUZPR04MB9982:EE_ X-MS-Office365-Filtering-Correlation-Id: daaeab89-66d2-46f7-17e6-08db722098fc X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 4MAObKooPx2C4tvwFQfhcJE+6FKOTbtj/b/8S8MexOZB/p37HmMHoeqjfP/7GF2tSySVIbdJQxNfF1nQMzUbjGuxwyV7SS8OHRFZ2zXg0CNvM0VJzgaGuDTN/n0qrc5vinlNtypRh9DZ/YsW7Pm+n+8PTx42ZTxKoi4T4hcQputBQ5g5SYx3CQ0T1MrXeVdWiQlkZDJmQrD9o4ImAUFD9cseb9E5ui460AZ25uz+XTWqsSOqn5l4/lQ2WqFb0XbnCxd+HnOqG43db9Lq6uIaWH2l2f0DaotQtcUFHDPOd1T+MOS8dW9Z+EJf0QuxSEWfOP8ZwlVvezK14SmMhrqrhPn5RdfYA/C3fM6xPwMAvquao1oV/v0akuLfryButf70J5NSUiw7esv6pyR48Gl/zZo/jyF8BR9Vc6J1j/xRpEHN6ODqX5jvOhviM8JyffpRHIvhvAaFOJtBvRdzIpoNl45+Ry02Z7D9nZ5trAtEZ1soc82UVzoa9ExGXHGl7BeiQb1+AYOwfwWgzuI0CJShMZedSYXIUg3wlwu1gT78gVH74F3MrnMmOSpMEX0Yl3vxkD0PLWrT1qxAd3RWJWBAxftWXMXGj7ILG18cWDwGrsIoTHnpRiIHVoBPKKarVPttcNjN2ho0ImCtombfAMFRVQ== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(39860400002)(376002)(136003)(346002)(396003)(451199021)(2616005)(6506007)(26005)(186003)(6512007)(38100700002)(6486002)(478600001)(54906003)(31686004)(36756003)(31696002)(86362001)(66556008)(66946007)(66476007)(8936002)(8676002)(4326008)(316002)(6916009)(2906002)(4744005)(5660300002)(41300700001)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?MiqFzK7IRf9XhdGXpGKSW+ydmrZ1?= =?utf-8?q?+t2BV08TtVzkUhVZz7G7/CzD/b21mHecm6yw4VU2ynOi7NmpgsWnkXUAId0QfG1Ua?= =?utf-8?q?kse9ZJStYsc1SEVJHlbGMhgVyjz4oeFNW3N9lyyULJBsdZOXm7WAtrbRZeUvLkmuT?= =?utf-8?q?OxF20yw9g69wt9Ca1mWyFCvKCfNBaXAoHYtUbMfvkU2HPhIGEZzPKgz9gncgz+X2V?= =?utf-8?q?/D2jMUsvPRLQZX0/h+wxrOkR0oQ0J517JilKOHGF0MIP+8O4Mn/deOuQmvQIJsu2y?= =?utf-8?q?Ka8rqAG0lGjpuXFHLasoRM4w3nttJxT7rB6TVLyH1BhmhPZtftfOWwk9MVSVSDmBk?= =?utf-8?q?giblxJVmPvWtn90x0igZFkEitzq6HGvJwljWIcaksZ23DTujBGUhCK5NQVl4mqUw0?= =?utf-8?q?kcDo6MjgVRvP3WPEPHQhZjJHByt/MEKhF7aFMYa6bEm8XvmYqi78ahDB1RXtC74Br?= =?utf-8?q?g0eqPxK6S5qOBJ3PlcZ+FRXG3+eopmCR/jO6SwLsxjj26GWeo5Z736xT5dy0d18cw?= =?utf-8?q?+fOsNHh2nBrm74kMvpuemR8pXAdMGCFEY3TYokhlTEBix0XjtcSpv9nYtuZfwKeks?= =?utf-8?q?E4R0FFJ5C0TeeAfoIhwCU3FReNnyBXvx7eD0Rx9/o8FHr0lSP8iHI3dGnNqVpCHKK?= =?utf-8?q?2OnyOEboBq6Ww9ZhpHErd35vVMZJYngbF5UNM/9w27XrVjzRwj6H0nlkwlJVjG8nB?= =?utf-8?q?cnZ3EU0xpMS5GKtk2vKVE5AO0A30aDeQwonES5ozVjwqgNONw0Qd4tG8TZ0KHjIGU?= =?utf-8?q?kPu4VFAUWh/tx4jYx+g9dGOEhP/yMnmKCwgLYNs1kGBKefqkMvHoE7ffyNbXPO7qN?= =?utf-8?q?JA/AYJ9y0n7qiBc3xO5+QBR9DwXTx3G+EgSco2NIkRyBvTmZhd35a71moI4F1NWpW?= =?utf-8?q?SZ34QI//mcxpXT5d41koffSpT0RZTqmJta/FlbM6n1Di27XEO/cIBeRNDw0EPyL5o?= =?utf-8?q?5t5NMCO0KIOlxSlBWBLodZjxjgCrKYpSKfAtWDnTRhPUlR1srKU57fU9fjvIm7fID?= =?utf-8?q?xDuJsvb2yTcfKvq8gFvNzCNsdyB4b3M+2wCZGF3jFxDpPpCsamgyLcqYPGHhQ9K5h?= =?utf-8?q?h6YYxK1ciNXv9scnmTdO5hjsEcnzVpi1ZNF1s+tfhPTwDsbSVXGzySqbU/6Kh+xfz?= =?utf-8?q?pprIU9f1dNE3Xs90B95Zn9tDoOjHrARm7S8c3F9K5QflOQuTXCZsRD4zM/Z1JwQ3c?= =?utf-8?q?WCALeb3nG4HaDUFlFR3URrs4c4aR692xt4aERb+KHGAJHoqqR9OZsWRfz9SrdsKym?= =?utf-8?q?IEMUqiywZhnlCxKRM8PR0kA5bDRTOO8QzQl0SJw+jlGutUQUiEJIN7CQ7I6ygYgpC?= =?utf-8?q?Lfm90xdH4ySAXXf+/VfeoTiDd3hjUvq12Oouu3w7/GJ1ap0u1HVqhJuN/nIUeN4U4?= =?utf-8?q?aA+Ki2ICJlA8e3Gdj7owTo9cvD6HVVUdIjdSJ6bOIjxkIw5kRaPRWSmQzdSpqoXhk?= =?utf-8?q?xBwD1M+4olq+UFG9yZANKp5V0/DLPt4prqST6rVwMMOF4z0cqAGvh/ZVncajDepTk?= =?utf-8?q?U3uFWIaom1an?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: daaeab89-66d2-46f7-17e6-08db722098fc X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 21 Jun 2023 06:27:33.5775 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: QXTlaoj2mD3Gm6ZNw6HpskOqJlaKNdl2W649PBMruhPwGDzKubGYWZ3vT0a4e09u8shrNs7FkouEwN+eGi6YTg== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DUZPR04MB9982 X-Spam-Status: No, score=-3027.3 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_NUMSUBJECT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Gcc-patches From: Jan Beulich Reply-To: Jan Beulich Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" The intended broadcast (with AVX512) can very well be done right from memory. gcc/ * config/i386/sse.md: Permit non-immediate operand 1 in AVX2 form of splitter for PR target/100711. --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -17356,7 +17356,7 @@ (and:VI_AVX2 (vec_duplicate:VI_AVX2 (not: - (match_operand: 1 "register_operand"))) + (match_operand: 1 "nonimmediate_operand"))) (match_operand:VI_AVX2 2 "vector_operand")))] "TARGET_AVX2" [(set (match_dup 3) From patchwork Wed Jun 21 06:27:51 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 1797664 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=2620:52:3:1:0:246e:9693:128c; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: legolas.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=EpGl4nxH; dkim-atps=neutral Received: from sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4QmD8g27HLz20Wk for ; Wed, 21 Jun 2023 16:28:19 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 0D3153857352 for ; Wed, 21 Jun 2023 06:28:17 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 0D3153857352 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687328897; bh=M21CcXEDFb8i15kOIOiejLCPJm1vvcniThgHMcivVok=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=EpGl4nxHwUD9KO5hnVJArxJkJpE+yLVfPKVRtSSba6XWmsgmq/ykozi0KEUqXjYY+ HkBzQWC6y0LF3Mou18T0cYKARo2DgLtO2Eqau7cMaoQzWScSBxpFq2rSdkB5Bib99T 3BccW7zapOZarTnwRoddAaOET1hWMVJma5QtQ9uI= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from EUR03-AM7-obe.outbound.protection.outlook.com (mail-am7eur03on2081.outbound.protection.outlook.com [40.107.105.81]) by sourceware.org (Postfix) with ESMTPS id B4B3C3858281 for ; Wed, 21 Jun 2023 06:27:54 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org B4B3C3858281 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Xj/EtJ2ddR39JjBXx8clj9Goc2nacdJujrOufDqM2mC27TNPURebINDgqptE8UNxN154ei9EsswydBb1Pz+8n+BqkJ33LHon3Nk0AOp5665PzQ9YtdG4H+Yf1TSId1HuGA11YJuQJH2MVFT9r/TpbZFXhXotsmmntl3aXF6cvYzXtcOhjl+tpQjwR4RjnuX5kUm8lJ59UJZlzTukj2XedKOhle4coKXhc4gxGdsV6L7dd/dMC1esWkabGtaf+y4yVhB0AGxERovdRzKI+YsazXOLdqq0ao+ZkrYNhl8zJPe2gCWQFslPuZwek6Jki+btEnFtMwPQO2LzKWJlPKNrrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=M21CcXEDFb8i15kOIOiejLCPJm1vvcniThgHMcivVok=; b=d9HkSVL9X6Tner3tTQvYTRU84lsVzfwP/QSH1n86eX/N651/juOXEJPg8NN8wvxgtr495cpVI/81ducX2pV5dEo5+cUPGWzUw6oKF8UgGE83eaL7YLqbo2WUaKiActxQb9mVH15C/ssM2ia8p60rPF+HkDLPqkCHCS4Ult/fM+opC5N9aCew5Fch2e17XfAMge/HS6flpBsLLJflNTcB+E0XwIc+K/r4v1S6nN5OqY6VoicRMCWQATBxrAD4X3leOakSXrUXP21ow0J2usll7+SCMyJP+uRhEG9u/PtXSkyKyZoCXhr+L9ucJRhb24yUB1Qo4VZaw3S02nu4Qp18wQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DUZPR04MB9982.eurprd04.prod.outlook.com (2603:10a6:10:4db::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6521.23; Wed, 21 Jun 2023 06:27:52 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.036; Wed, 21 Jun 2023 06:27:52 +0000 Message-ID: Date: Wed, 21 Jun 2023 08:27:51 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 4/5] x86: further PR target/100711-like splitting Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Hongtao Liu , Kirill Yukhin References: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> In-Reply-To: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> X-ClientProxiedBy: FR0P281CA0074.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:1e::20) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DUZPR04MB9982:EE_ X-MS-Office365-Filtering-Correlation-Id: 2af66d0a-5342-42b3-1648-08db7220a481 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: iM4lz5OGtSLa4o1G6ab6V2REksE2bNH5c18ML1nHL+kf5apjcj4sfgU48tBK+I7kirPO7c2dasuXd1nl6jLcwM6MjmXao2vMyeYLTvCz59UMZ91pAD+pOVA18RNRvQNJHqT41AnRB4fKzSjOa/nsCL1L4iHztj4j5kKDkbykWo93GAkdf/gzqqreml1zxhdfi3bDGCC5VK9iRaQ45rt7pUSdqRMoNFm8OWVdUjzexR3M/JzU4+abhPjUTdTojUxqaf/ahon2zGmEnsnrD6PikPfFbyrVGHzJOUAUGe3pXkUVXM2sc8J9Y0jQAIYw6eIT3jgknwRSbL7FY4eSgL4lw+qBTEx3/imf7RBvtjHjLZnh+VvV7Ga2lEdze6LME9NVUdPvRa6Ff8vvu0qXFx4IqJFa9Hc2UGBUjE/5TwAO+MdTsT3itkYRy/i4Rm5C1clA6Uigoc747yAulrywwO65MbSQ3Gki+HNr5/2HsPXBNhBLTEDndQOioIOE8c9X+rbMG1ItKa3aOXDOjQzZr/p43Zc+RPdKKXlOh7PcQ0dT/5vVqgN+jqoQa2uW+Z/2h5bm/vtQAm+tjJOKr5zE6HOykFWpkLMxoz53gImZHhsWDzmjD7watO+0+PIUBRqo+KX8z0GubM19L3np51hk4/y6hw== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(39860400002)(376002)(136003)(346002)(396003)(451199021)(2616005)(84970400001)(6506007)(26005)(186003)(6512007)(66899021)(38100700002)(6486002)(478600001)(54906003)(31686004)(36756003)(31696002)(86362001)(66556008)(66946007)(66476007)(8936002)(8676002)(4326008)(316002)(6916009)(2906002)(5660300002)(41300700001)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?qAZPUz1NbnN0uODneUGP8u7YqCYg?= =?utf-8?q?XC64c926KmkZsblRC/Qeqany/Z+ZlCxuKNSwpIbIKlcR2DHwi7c4IGbCopp6icr9x?= =?utf-8?q?Dd8xyLvGytdFfP6axTax/YFIf91WXTZSU3szzQlm3f/Lff4f8k2mMAwhs8EiFSlpV?= =?utf-8?q?bs3HgNxgogIva27ztfIKmiTlVgk588WAk50BtKCmwonndqiW5wKP0vgV6MPOlnMwA?= =?utf-8?q?LGdh64qPLd5q9NmvHftR+VdgBAv0SzvLchv66+w+DDX0btVR/rXFUguBdCtFVOvXF?= =?utf-8?q?mMR5fFBhPmTLYz3tIPc0QfKcswOQKnPBMHvEjDWnlpJbff7+t7l8ghcSwN6i7Nkhc?= =?utf-8?q?D0SZAO5R2gM0RpqzcLN765QjYCiT3q9bDv3w3AngzLRGmTzbqlM51fK/ZnUc/MUwv?= =?utf-8?q?UbgsxJTnAytAM5I2ZQnyk4X84/D6h5GXGHaPlUVu+2O4nquqRwVPPoCAxUocfO8j5?= =?utf-8?q?6rAAOU3xv+q9eY0NSxFN6RcwjQQpo0DSnsL5TevvwG1kWOrW7BixVE7zhD1wvj0lb?= =?utf-8?q?v/e90wcVaAGo6HJB3ovrPh5wlHDDgY8cnLNyULAy2U8NXG+Af4uLnnZVFNhpZ4WSw?= =?utf-8?q?TAh/4jrI8dXQ80s02S2twT7IxccVUG2XJ8YcnibxnmhTuWSEDJeFVGWSBVFm8JdGI?= =?utf-8?q?Lb7OwoAhaFWLphpkcvU/KH+rJtpKtPi+027UkBlOoZ5IeW2ijLfzDO0eaBwV5oLP/?= =?utf-8?q?h3ei0ddt6HNaXjIUkudAFbqiRmm3eHD+OiQ1CBPt2putmoqHMgqMeQK5CUQvCOqRt?= =?utf-8?q?sVwqRktb0DyiGbKTQW9Pgq1U1cb4+GFYPQEe1GkFfoL8U+ksz6CgQpl04MJzB6OM8?= =?utf-8?q?ptvd7O+oQK3mJpPtsCNF2J4cHYSBvXkl+K1PvXcL1dKtAE1HvXz+RbxCsqmms+w3R?= =?utf-8?q?o3ZbSdl5wm0D0uUIXjmFfYvIcn4BVO+aiZ71d6ChRwrqIj1Dc0HXiJlqDsmDLun41?= =?utf-8?q?K6ddNIk5abl+YHWEt4GcNWxhXZN6KvQM7P82TIUFdE5SeV+JTZNl5I9NMDHVQiD46?= =?utf-8?q?YO1hJi0TaE9l8AlZuu+2Da05c2dipZkcWsvzSktpMddzAUtSFr4xqMfpzw44nGyIs?= =?utf-8?q?/sauIDQ5tkV8/McBt7Hjjgd6azWoubUqdp4YroKsnTvQ5+4bKDrw0Bs9zSMYsr0NE?= =?utf-8?q?DWeaizQgRUfLJPtezAA+lg5YatTFAb/Nu9ruPKVGiBSjk9YhaUZoYe0MkZLgKX6uy?= =?utf-8?q?J5ia4TBZrOasbNitX13nI5e1s4dIPxTuf7Ke4BBBf41n6jE3OdFFUdaOAVv+ePruG?= =?utf-8?q?ePhi5BvaB0B9dTSMUMW/jJdOID8AgbnG/UEgFiiijNaTioCUiBsFeJuJsCseoKcxB?= =?utf-8?q?x3rchfV/HnzJ0QziwyzbNuovjI/X0Sl2va7yvITrND/EBtWHCeX8ScruOW6SRcYup?= =?utf-8?q?kUFoqpqJXCxjqQ3se8zJxc0cxwwQTB5ULbp5XoBysrgqciRbZYeehe+2QypOBRi2F?= =?utf-8?q?UisvgzNox7l4jja2Zi6wx7pVUtuTNqOGkGmYt1FH1QVD2iR9ccpF7vBKcAzSPfqBI?= =?utf-8?q?VUZTkDWJpvGh?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 2af66d0a-5342-42b3-1648-08db7220a481 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 21 Jun 2023 06:27:52.8755 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: z7jUKBG9eRnzew0yGdYGwVVXYgOeZyG0IFdeszgEEs4feQ3JM0J5rNUkc9yWQml6eMO6RrUXeq8jR1v7/ycd+g== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DUZPR04MB9982 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Gcc-patches From: Jan Beulich Reply-To: Jan Beulich Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" With respective two-operand bitwise operations now expressable by a single VPTERNLOG, add splitters to also deal with ior and xor counterparts of the original and-only case. Note that the splitters need to be separate, as the placement of "not" differs in the final insns (*iornot3, *xnor3) which are intended to pick up one half of the result. gcc/ * config/i386/sse.md: New splitters to simplify not;vec_duplicate;{ior,xor} as vec_duplicate;{iornot,xnor}. gcc/testsuite/ * gcc.target/i386/pr100711-4.c: New test. * gcc.target/i386/pr100711-5.c: New test. --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -17366,6 +17366,36 @@ (match_dup 2)))] "operands[3] = gen_reg_rtx (mode);") +(define_split + [(set (match_operand:VI 0 "register_operand") + (ior:VI + (vec_duplicate:VI + (not: + (match_operand: 1 "nonimmediate_operand"))) + (match_operand:VI 2 "vector_operand")))] + " == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)" + [(set (match_dup 3) + (vec_duplicate:VI (match_dup 1))) + (set (match_dup 0) + (ior:VI (not:VI (match_dup 3)) (match_dup 2)))] + "operands[3] = gen_reg_rtx (mode);") + +(define_split + [(set (match_operand:VI 0 "register_operand") + (xor:VI + (vec_duplicate:VI + (not: + (match_operand: 1 "nonimmediate_operand"))) + (match_operand:VI 2 "vector_operand")))] + " == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)" + [(set (match_dup 3) + (vec_duplicate:VI (match_dup 1))) + (set (match_dup 0) + (not:VI (xor:VI (match_dup 3) (match_dup 2))))] + "operands[3] = gen_reg_rtx (mode);") + (define_insn "*andnot3_mask" [(set (match_operand:VI48_AVX512VL 0 "register_operand" "=v") (vec_merge:VI48_AVX512VL --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/pr100711-4.c @@ -0,0 +1,42 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512bw -mno-avx512vl -mprefer-vector-width=512 -O2" } */ + +typedef char v64qi __attribute__ ((vector_size (64))); +typedef short v32hi __attribute__ ((vector_size (64))); +typedef int v16si __attribute__ ((vector_size (64))); +typedef long long v8di __attribute__((vector_size (64))); + +v64qi foo_v64qi (char a, v64qi b) +{ + return (__extension__ (v64qi) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) | b; +} + +v32hi foo_v32hi (short a, v32hi b) +{ + return (__extension__ (v32hi) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) | b; +} + +v16si foo_v16si (int a, v16si b) +{ + return (__extension__ (v16si) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) | b; +} + +v8di foo_v8di (long long a, v8di b) +{ + return (__extension__ (v8di) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) | b; +} + +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0xbb" 4 { target { ! ia32 } } } } */ +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0xbb" 2 { target { ia32 } } } } */ +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0xdd" 2 { target { ia32 } } } } */ --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/pr100711-5.c @@ -0,0 +1,40 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512bw -mno-avx512vl -mprefer-vector-width=512 -O2" } */ + +typedef char v64qi __attribute__ ((vector_size (64))); +typedef short v32hi __attribute__ ((vector_size (64))); +typedef int v16si __attribute__ ((vector_size (64))); +typedef long long v8di __attribute__((vector_size (64))); + +v64qi foo_v64qi (char a, v64qi b) +{ + return (__extension__ (v64qi) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) ^ b; +} + +v32hi foo_v32hi (short a, v32hi b) +{ + return (__extension__ (v32hi) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) ^ b; +} + +v16si foo_v16si (int a, v16si b) +{ + return (__extension__ (v16si) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a, + ~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) ^ b; +} + +v8di foo_v8di (long long a, v8di b) +{ + return (__extension__ (v8di) {~a, ~a, ~a, ~a, ~a, ~a, ~a, ~a}) ^ b; +} + +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0x99" 4 } } */ From patchwork Wed Jun 21 06:28:12 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 1797665 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=8.43.85.97; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: legolas.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=SRbJ6MWU; dkim-atps=neutral Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4QmD9n4QKcz20Wk for ; Wed, 21 Jun 2023 16:29:17 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 6F9183856DD0 for ; Wed, 21 Jun 2023 06:29:15 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 6F9183856DD0 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687328955; bh=GYy39gWtqi7euAFh1YQJAE89ZdTOyEFxanNO5R0P+A8=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=SRbJ6MWUfWQ8j+idrtBD/FiA/s6mA0IEEpG7Xi53JHhTl/iWtuGC0VYBnicMmQkeg fQucRDf4P4vL86c5Tk33AdXqvQHTmtwgmFDlDrqDtEGbbAcuO2eS3smN6EaUpJ4dG+ 2ivGA7UvV2T4cnXfh6rTK6d5Er4K4J94hs2kJSxs= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from EUR03-AM7-obe.outbound.protection.outlook.com (mail-am7eur03on2081.outbound.protection.outlook.com [40.107.105.81]) by sourceware.org (Postfix) with ESMTPS id 2986F3858D35 for ; Wed, 21 Jun 2023 06:28:16 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 2986F3858D35 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=H40kg1fKU0jAFqH+3OfzsV63MjxuoC2DcoJBEoW+ojYCMDo+nOyUETf5c7U8O9/bXwKfOD6RAiKe3p3AlQcJ4JekPhRb6Xf9oZ4ifJS2F3wEPSJtk7VcXst6Pm7lB0C49wSGAnZNfnEWO33QjlvuMFpGpnnHcQ/PGGdGdT7grQftZjwsI7r9++dpyosQ8WnvGFupeZKfh8PLHymeYsACY9eVPzFBipVruZfymcb9H7f0sSQquqi5hOrl8zuwZ9g19z1oMednzMyfF/r62aMBstptwWUm9oE1MSbgEaV8OWgezet+yp5l3ZwsD26MkHgsqbV+QF+V7CvqmS2Dq8zPQg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=GYy39gWtqi7euAFh1YQJAE89ZdTOyEFxanNO5R0P+A8=; b=EJ8vl0w+Ob1Vne9vtoaHxLdmPx867BvSLJZ1e5Zy2AOwpGIeWFfd6A5cP7X13dRAvpZpulE4Os2lqkoeVkNGJKbpB5XaqTrs3yEuBlFUZxhOosa4VEw71pbhU9kWws1LtEI2WwciHx5vNm5g40FAvrnJEy8bxeGlaJBcje8fps5Q7aNjtYsMiXyLm1pFLT5Pnd6dtehcka446iyg5vKYysR9bCC2fiZrUGv71IlXHiV8DhsUdY9zFLO270Btnq+oqkGRvYQ/6iDWN9Z0ei1UQlqHZoI0GGhoBEgqcUDcAGlYZqHNKP43vU8rHbE/ro+1KhDCUQcOYJka3F17zTRTow== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DUZPR04MB9982.eurprd04.prod.outlook.com (2603:10a6:10:4db::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6521.23; Wed, 21 Jun 2023 06:28:14 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.036; Wed, 21 Jun 2023 06:28:14 +0000 Message-ID: <0075f542-9dc0-33db-4cf9-cdd3ba502122@suse.com> Date: Wed, 21 Jun 2023 08:28:12 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 5/5] x86: yet more PR target/100711-like splitting Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Hongtao Liu , Kirill Yukhin References: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> In-Reply-To: <04f99abe-a563-d093-23b7-4abf0f91633d@suse.com> X-ClientProxiedBy: FR3P281CA0031.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:1c::9) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DUZPR04MB9982:EE_ X-MS-Office365-Filtering-Correlation-Id: 0e8ec0c8-b17e-4047-7435-08db7220b170 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: Eyv9Eu/kO+VZayFAc0yK1h94lynCVWiB0it2hwId6KHdteIpgPqsQ8s+u00wDq50UJuqk3R5cdIgBNrwCp9NQlQ+Ms4MJHYpsPIz6OCkz1Eq/fw3Lo3lkj8cWn2SprAj2xumy9BRAQQGVnt301qV1UhQKUMzk4YxiyPRv1ZgqjCkOk63aCf/3vJlKPxLOWfLvIk2eL6Aq6mgBrgHgvOPgpjI68xThDbHpoqpXaHDAJYzdT9i0zLPyb3TLQxwcLBlXCvERqqVwulaUXefA03+F/iXj6PqGfBw4FNkp6zqfXTjx6S0k4cE3uP1xSv8GBwz+pKFx9HUffJFzVo7FzMnCt5GxHMcKwEjhkZiy34jmgjywuSJT6xvUG6aTUAcHg2yDNOcPyQPNXz8pnanmlDq4PNuarcez18GOfAS4A3rOzwFj1NpWyDyI4BC1lT9RcE28R6RBmAgayaW8MY3zhKDdtPheLApN7A0dtPxTPt2EKHbP2jKMeHEMwVAzOfDPKEsqGl6Amg2OSxiTJW6+jItD3b2MVCwm0/eTGTq4iF07Yu6zqt/U8lTW4lMa/35SLMPJbZ/xLac0xJjMH0KNUjrYdReMFnC9Bq5aFEJiQm/zpVXVHubjrFWzQ+gcNu28Su3scc0tLUmoh1+2mZZXv2Y+g== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(39860400002)(376002)(136003)(346002)(396003)(451199021)(2616005)(84970400001)(6506007)(26005)(186003)(6512007)(66899021)(38100700002)(6486002)(478600001)(54906003)(31686004)(36756003)(31696002)(86362001)(66556008)(66946007)(66476007)(8936002)(8676002)(4326008)(316002)(6916009)(2906002)(5660300002)(41300700001)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?+JxGjokxKnm0msF0/U7lEnwJG1R3?= =?utf-8?q?s3aOc6+6OCxWn4QCLE7KOlOGClizEyhh1P7xfmS/y2BhJqoN4vnss0emodfMJ9KFE?= =?utf-8?q?CnjSaFsebMk+7spBxt5hWTif0Usx2USzgrTVsUWJcLmkeq7OZUQdFRJav6egK97bN?= =?utf-8?q?CvGMQskzvbDHIHhQPxt+T9BLlATLwaV9b1MD71EEIJ+4ATis+rRMWZH+bAnA8Ijv5?= =?utf-8?q?C1VZ4VapY1d4A536T5Sr1cX/1M7eDdiAjWwCykVwaJueElBC0K+hyQRcCGXUg/Zf5?= =?utf-8?q?HeveFMBLt1raGEeFXTuFRdvNMgzW+59Bdstgtq55WHHp1D/xo9u93m4t+mHfUKhPb?= =?utf-8?q?6b83TcSLQKbyiWYxbk9asTxegW6sVOt4Z+KZL3KDY6keBj5ZwagLipI+Bz3Nd5Utz?= =?utf-8?q?faRR+TjHKFvXpBvRmE/F7UbpAcdthDjuU9b1/GVLNmJ9OhnsIq1LnQYoiGiLBKvdt?= =?utf-8?q?MwmBjY0OTqjoqvn9OeVgaJrBlDZvvP715fCZgiFxpYv2i5ufjALBCV5GJuwF9ecIW?= =?utf-8?q?FVtt+bMTNamQvTBsUR3PEJJG8Qj54lLhoOeDvV6E7i9GREs++MMRHoH5Zc/UlBbck?= =?utf-8?q?Yd7tvKL08fNaEW6m9AUQFbbjzEnW1Z6Xo+3Z3ZcYSswKpAAiJlmF94yn0N4c+xRWY?= =?utf-8?q?aGPvveEpPi1+APyaYYsbqS5WXvCr35UJmsEVOB4ucg00QlSyTlhFtUwBbvQLDc3um?= =?utf-8?q?npXcOYxMxhgVqV86I7oF35qXL//mUr+k+Ivlw01sIp5rIboREoYHFjwiR/nT11ydH?= =?utf-8?q?sOYGZ+lmOJgec4i9yFX8YYjf1J4iSSo2b2qACTR3ShYCKVLB0njGgx7Gsbe0LSnsF?= =?utf-8?q?A51i4uJ3upmmxG9TG+kpREQcLFZqVgPhYop3dG/5WhvpQsGF2pzCqLkN2rcGxE3Hx?= =?utf-8?q?qdo9KE+ot768sAEtfxWWzkJr+eHotqxVvdNExByMcdZXhR4FFMoe/nslwHoZXW4Y1?= =?utf-8?q?u9k4QCXWNrdkDXs8BSDvhFojRhbcB47WaE9hGmmsbpUAmgvPKIrKfv5JszRSLTpN2?= =?utf-8?q?BkP+9RgTXzTsJp4Ox/RCJso9GHkHpnmb8V1Br1uwMB/E7/kHKRspgJGkC8hPIS8zk?= =?utf-8?q?E4AHXHNpeRd8hkxSWX58PschWgIdOxqc85YB8NhUqBrB3qOM78gzgP1d/3KXDlh8K?= =?utf-8?q?c8QQAGipP9f9gNun/yCdDwaSarhjBk9VWxGrcyl2nxULpcOcj3K6ahFAQUPv0/agC?= =?utf-8?q?yhZJpsMAgpsLfNjwr6HuPjiI4IM8J4GbbQooT7HkTOQP4SEKRFA2dKolqJFB41D7M?= =?utf-8?q?1dxmHB0K8mgQMOB7RHebTxFg/yzKWTFiwbrpw7qbDaR1FzPWtOg3tzyPRpip1NqFO?= =?utf-8?q?w76w94YKMtFv8g7X6uVELO8YqdhKi4pzyknkRDP4m5Qi7s5LxvKtLmxwlJOYRG5sS?= =?utf-8?q?RkBjmobwMJ5XVhD/xH9vZEkMmQ/zf/5v0II7K272MG+DtXM5d5aqYO9EPWNjTTo8z?= =?utf-8?q?X+wh7fNGAHQZe3Wom7VVAFe8W0ynmeWQp0XBbQzaAKv5Huk8siBgB+uWoaWrtWV1B?= =?utf-8?q?3ET1919Tj7IC?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 0e8ec0c8-b17e-4047-7435-08db7220b170 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 21 Jun 2023 06:28:14.5537 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: qrpXFCf6kGYK7LhyYickiJkqbjoPeMCpn4lX9jsfyIFJLtxfgXG9jfWRHVJ51++qCpH4udiBn200KNv3x0Yabw== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DUZPR04MB9982 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Gcc-patches From: Jan Beulich Reply-To: Jan Beulich Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" Following two-operand bitwise operations, add another splitter to also deal with not followed by broadcast all on its own, which can be expressed as simple embedded broadcast instead once a broadcast operand is actually permitted in the respective insn. While there also permit a broadcast operand in the corresponding expander. gcc/ * config/i386/sse.md: New splitters to simplify not;vec_duplicate as a singular vpternlog. (one_cmpl2): Allow broadcast for operand 1. (one_cmpl2): Likewise. gcc/testsuite/ * gcc.target/i386/pr100711-6.c: New test. --- For the purpose here (and elsewhere) bcst_vector_operand() (really: bcst_mem_operand()) isn't permissive enough: We'd want it to allow 128-bit and 256-bit types as well irrespective of AVX512VL being enabled. This would likely require a new predicate (bcst_intvec_operand()?) and a new constraint (BR? Bi?). (Yet for name selection it will want considering that this is applicable to certain non-calculational FP operations as well.) --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -17156,7 +17156,7 @@ (define_expand "one_cmpl2" [(set (match_operand:VI 0 "register_operand") - (xor:VI (match_operand:VI 1 "vector_operand") + (xor:VI (match_operand:VI 1 "bcst_vector_operand") (match_dup 2)))] "TARGET_SSE" { @@ -17168,7 +17168,7 @@ (define_insn "one_cmpl2" [(set (match_operand:VI 0 "register_operand" "=v,v") - (xor:VI (match_operand:VI 1 "nonimmediate_operand" "v,m") + (xor:VI (match_operand:VI 1 "bcst_vector_operand" "vBr,m") (match_operand:VI 2 "vector_all_ones_operand" "BC,BC")))] "TARGET_AVX512F && (! @@ -17191,6 +17191,19 @@ (symbol_ref " == 64 || TARGET_AVX512VL") (const_int 1)))]) +(define_split + [(set (match_operand:VI48_AVX512F 0 "register_operand") + (vec_duplicate:VI48_AVX512F + (not: + (match_operand: 1 "nonimmediate_operand"))))] + " == 64 || TARGET_AVX512VL + || (TARGET_AVX512F && !TARGET_PREFER_AVX256)" + [(set (match_dup 0) + (xor:VI48_AVX512F + (vec_duplicate:VI48_AVX512F (match_dup 1)) + (match_dup 2)))] + "operands[2] = CONSTM1_RTX (mode);") + (define_expand "_andnot3" [(set (match_operand:VI_AVX2 0 "register_operand") (and:VI_AVX2 --- /dev/null +++ b/gcc/testsuite/gcc.target/i386/pr100711-6.c @@ -0,0 +1,18 @@ +/* { dg-do compile } */ +/* { dg-options "-mavx512f -mno-avx512vl -mprefer-vector-width=512 -O2" } */ + +typedef int v16si __attribute__ ((vector_size (64))); +typedef long long v8di __attribute__((vector_size (64))); + +v16si foo_v16si (const int *a) +{ + return (__extension__ (v16si) {~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a, + ~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a}); +} + +v8di foo_v8di (const long long *a) +{ + return (__extension__ (v8di) {~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a, ~*a}); +} + +/* { dg-final { scan-assembler-times "vpternlog\[dq\]\[ \\t\]+\\\$0x55, \\(%(?:eax|rdi|edi)\\)\\\{1to\[1-8\]+\\\}" 2 } } */