From patchwork Fri Apr 26 13:28:08 2013 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Greenhalgh X-Patchwork-Id: 239874 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client CN "localhost", Issuer "www.qmailtoaster.com" (not verified)) by ozlabs.org (Postfix) with ESMTPS id AE38E2C00C0 for ; Fri, 26 Apr 2013 23:28:23 +1000 (EST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:from :to:cc:subject:date:message-id:mime-version:content-type; q=dns; s=default; b=l13/4ldZnjXSBNw+GnZ/Dnu8VCRbsa0t37xDYatXH5fAU/+jAs qGtcTsvrPFWOsAhCoDQE6w6BFdk00it8kmFxLGXiAqp53OU5J23j86btslo5uNjq HMvDUSPndbO6d3ipxxRFogkrZQQgol2YXGbJJyQRjkRmIWplJ2jSf/Aa8= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:from :to:cc:subject:date:message-id:mime-version:content-type; s= default; bh=7RVhX5UZbeHQmHuLrx0ElX5FYLs=; b=n15+hvo3c9pxVIMtwc4r d3MASr9+AD8pzosf1KNSGDP2J0qzx8aylQYzwgPsyDrQlbuM215wohNOsHh6wcgI Va8L05S992FfwqAJCfHv1st8CDfQE6z0TzQzyMYjQsZXAWoz77x92KysNkMHNlXv cHtB6Pm2g2itN2jrI6rGZeU= Received: (qmail 2231 invoked by alias); 26 Apr 2013 13:28:17 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Delivered-To: mailing list gcc-patches@gcc.gnu.org Received: (qmail 2221 invoked by uid 89); 26 Apr 2013 13:28:17 -0000 X-Spam-SWARE-Status: No, score=-2.6 required=5.0 tests=AWL, BAYES_00, RCVD_IN_DNSWL_LOW, SPF_PASS autolearn=ham version=3.3.1 Received: from service87.mimecast.com (HELO service87.mimecast.com) (91.220.42.44) by sourceware.org (qpsmtpd/0.84/v0.84-167-ge50287c) with ESMTP; Fri, 26 Apr 2013 13:28:16 +0000 Received: from cam-owa2.Emea.Arm.com (fw-tnat.cambridge.arm.com [217.140.96.21]) by service87.mimecast.com; Fri, 26 Apr 2013 14:28:14 +0100 Received: from e106375-lin.cambridge.arm.com ([10.1.255.212]) by cam-owa2.Emea.Arm.com with Microsoft SMTPSVC(6.0.3790.3959); Fri, 26 Apr 2013 14:28:13 +0100 From: James Greenhalgh To: gcc-patches@gcc.gnu.org Cc: marcus.shawcroft@arm.com Subject: [AArch64] Vectorize over more math.h functions. Date: Fri, 26 Apr 2013 14:28:08 +0100 Message-Id: <1366982888-29096-1-git-send-email-james.greenhalgh@arm.com> MIME-Version: 1.0 X-MC-Unique: 113042614281401001 X-Virus-Found: No Hi, This patch adds float -> int builtins to the set of builtins we can try to vectorize in aarch64_builtin_vectorized_function. In particular, we add BUILT_IN_IFLOORF, BUILT_IN_ICEILF, BUILT_IN_LROUND, BUILT_IN_IROUNDF. The BUILT_IN_LROUND cases won't be triggered unless -ffast-math or something else which turns off inexact errors is enabled. Regression tested for aarch64-none-elf with no regressions. Thanks, James --- gcc/ 2013-04-26 James Greenhalgh * config/aarch64/aarch64-builtins.c (aarch64_builtin_vectorized_function): Vectorize over ifloorf, iceilf, lround, iroundf. diff --git a/gcc/config/aarch64/aarch64-builtins.c b/gcc/config/aarch64/aarch64-builtins.c index d2e5136..53d2c6a 100644 --- a/gcc/config/aarch64/aarch64-builtins.c +++ b/gcc/config/aarch64/aarch64-builtins.c @@ -1245,6 +1245,7 @@ aarch64_builtin_vectorized_function (tree fndecl, tree type_out, tree type_in) (out_mode == N##Imode && out_n == C \ && in_mode == N##Fmode && in_n == C) case BUILT_IN_LFLOOR: + case BUILT_IN_IFLOORF: { tree new_tree = NULL_TREE; if (AARCH64_CHECK_BUILTIN_MODE (2, D)) @@ -1259,6 +1260,7 @@ aarch64_builtin_vectorized_function (tree fndecl, tree type_out, tree type_in) return new_tree; } case BUILT_IN_LCEIL: + case BUILT_IN_ICEILF: { tree new_tree = NULL_TREE; if (AARCH64_CHECK_BUILTIN_MODE (2, D)) @@ -1272,6 +1274,22 @@ aarch64_builtin_vectorized_function (tree fndecl, tree type_out, tree type_in) aarch64_builtin_decls[AARCH64_SIMD_BUILTIN_lceilv2sfv2si]; return new_tree; } + case BUILT_IN_LROUND: + case BUILT_IN_IROUNDF: + { + tree new_tree = NULL_TREE; + if (AARCH64_CHECK_BUILTIN_MODE (2, D)) + new_tree = + aarch64_builtin_decls[AARCH64_SIMD_BUILTIN_lroundv2dfv2di]; + else if (AARCH64_CHECK_BUILTIN_MODE (4, S)) + new_tree = + aarch64_builtin_decls[AARCH64_SIMD_BUILTIN_lroundv4sfv4si]; + else if (AARCH64_CHECK_BUILTIN_MODE (2, S)) + new_tree = + aarch64_builtin_decls[AARCH64_SIMD_BUILTIN_lroundv2sfv2si]; + return new_tree; + } + default: return NULL_TREE; }