[version,2] , Add support for _Float<N> and _Float<N>X sqrt, fma, fmin, fmax built-in functions

On Wed, Sep 13, 2017 at 10:49:43PM +0000, Joseph Myers wrote:
> On Wed, 13 Sep 2017, Michael Meissner wrote:
> 
> > This patch adds support on PowerPC ISA 3.0 for the built-in function
> > __builtin_sqrtf128 generating the XSSQRTQP hardware square root instruction and
> > the built-in function __builtin_fmaf128 generating XSMADDQP, XSMSUBQP,
> > XSNMADDQP, and XSNMSUBQP fused multiply-add instructions.
> 
> Is there a reason for these to be architecture-specific rather than 
> generic everywhere _Float128 is supported?  (With the fmaf128 / sqrtf128 
> names available as well as the __builtin_* variants of those.)

The basic reason was I hadn't yet discovered all of the places that need to be
modified to add generic _Float128 math functions.

> Full support for _FloatN/_FloatNx variants of all the existing built-in 
> functions might be complicated, and run into potential issues with startup 
> cost of creating large numbers of extra built-in functions (it's 
> desirable, but possibly hard, which is why I excluded it from the initial 
> _FloatN / _FloatNx support patches).  But adding just these two functions 
> to builtins.def and making them fold / expand appropriately ought to be 
> much simpler.  (I realise sqrt goes through internal-fn.def and 
> DEF_INTERNAL_FLT_FN expects a particular set of functions for standard 
> types, so maybe some duplication would be involved to get the built-in 
> function expanded appropriately, i.e. using an insn pattern or a call to 
> an external sqrtf128 function according to whether such an insn pattern is 
> available.  fma ought not to involve much more than adding an extra case 
> where CASE_FLT_FN (BUILT_IN_FMA) is used.)

I have now gone through and added the proper support for _Float128 sqrt, fma,
fmin, and fmax.  I have added the framework so that other functions as needed
can be added over time.

> > While I was at it, I changed the documentation so that it no longer documents
> > the 'q' built-in functions (to mirror libquadmath) but instead just documented
> > the 'f128' functions that matches glibc 2.26 and the technical report that
> > added the _FloatF128 date.
> 
> Those *f128 built-in functions (inf / huge_val / nan / nans / fabs / 
> copysign) are not target-specific; they exist for all _FloatN / _FloatNx 
> types for all targets with such types.  So it doesn't seem appropriate to 
> document them in a target-specific section of the manual, beyond a brief 
> cross-reference to the documentation of the functions as 
> target-independent.

Highlights of the patch:

    1)	I switched to use DEF_EXT_LIB_BUILTIN to declare the _Float<N> and
	_Float<N>X functions.  This allows treating __builtin_sqrtf128 the same
	as sqrtf128.

    2)	Add support in gencfn-macros.c to build the appropriate CASE_CFN_* and
	operators in cfn-operators.pd that can be used as needed.

    3)	I did not enable _Float128 support for all math built-ins, but just the
	built-in functions I am currently need to support in (just like
	copysign and fabs were previously done).  I expect over time there
	might be some more needed to be added to the list.  I added fmin and
	fmax to the machine independent built-ins, but I will submit a patch
	later to enable them in the PowerPC.

    4)	I went through and added support for copysign, fma, fmin, and fmax
	functions in the same places the current float/double/long double
	functions are handled.

    5)	I removed the PowerPC sqrtf128 and fmaf128 built-ins, since these are
	now handled by machine independent code.  In doing so, I deleted two
	tests that did not allow the built-ins where the software emulator is
	used.  The GLIBC 2.26 as shipped with the Advance Toolchain 11.0-1
	contain these functions.

    6)	In the previous version of the patch, I put in a special warning for
	fmaf128 (that it might not be present if the h/w instructions weren't
	available).  When I wrote that patch, the initial release of Advance
	Toolchain 11.0-0 did not include a fmaf128 function.  It now includes
	the function, so I don't need the warning.

I have checked this patch on the following systems with bootstrap and make
check for gcc/g++/gfortran/lto:

    1)	Little endian power8 system using --with-cpu=power8
    2)	Big endian power7 system (both 64/32-bit) using --with-cpu=power7
    3)	Little endian power9 prototype system using --with-cpu=power9
    4)	A Fedora 21 x86_64 system

Can I check these patches into the trunk?

[gcc]
2017-10-19  Michael Meissner  <meissner@linux.vnet.ibm.com>

	* builtins.c (CASE_MATHRN_FLOATN): New helper macro to support
	math functions that have _Float<N> and _Float<N>X variants.
	(mathfn_built_in_2): Add support for copysign, fma, fmax, fmin,
	and sqrt having _Float<N> and _Float<N>X variants.
	(DEF_INTERNAL_FLT_FLOATN_FN): New helper macro to support for math
	functions with _Float<N> and _Float<N>X variants.
	(expand_builtin_mathfn_ternary): Add fma _Float<N> and _Float<N>X
	support.
	(expand_builtin): Likewise.
	(fold_builtin_3): Likewise.
	* fold-const.c (tree_call_nonnegative_warnv_p): Add support for
	sqrt, fmax, fmin, and copysign with _Float<N> and _Float<N>X
	variants.
	(integer_valued_real_call_p): Likewise.
	* builtin-types.def (BT_FN_FLOAT16_FLOAT16_FLOAT16_FLOAT16): New
	function signatures for fma _Float<N> and _Float<N>X variants.
	(BT_FN_FLOAT32_FLOAT32_FLOAT32_FLOAT32): Likewise.
	(BT_FN_FLOAT64_FLOAT64_FLOAT64_FLOAT64): Likewise.
	(BT_FN_FLOAT128_FLOAT128_FLOAT128_FLOAT128): Likewise.
	(BT_FN_FLOAT32X_FLOAT32X_FLOAT32X_FLOAT32X): Likewise.
	(BT_FN_FLOAT64X_FLOAT64X_FLOAT64X_FLOAT64X): Likewise.
	(BT_FN_FLOAT128X_FLOAT128X_FLOAT128X_FLOAT128X): Likewise.
	* builtins.def (DEF_GCC_FLOATN_NX_BUILTINS): Use
	DEF_EXT_LIB_BUILTIN instead of DEF_GCC_BUILTIN, so that
	sqrtf128 is normally processed to be __builtin_sqrtf128.
	(BUILT_IN_FMA): Define _Float<N> and _Float<N>X variants.
	(BUILT_IN_FMAX): Likewise.
	(BUILT_IN_FMIN): Likewise.
	(BUILT_IN_SQRT): Likewise.
	* tree-call-cdce.c (can_test_argument_range): Add support for sqrt
	_Float<N> and _Float<N>X variants.
	(edom_only_function): Likewise.
	(get_no_error_domain): Likewise.
	* tree-ssa-math-opts.c (gimple_call_combined_fn): Likewise.
	* fold-const-call.c (fold_const_call_ss): Likewise.
	(fold_const_call_sss): Add support for copysign, fmin, and fmax
	_Float<N> and _Float<N>X variants.
	(fold_const_call_ssss): Add support for fma _Float<N> and
	_Float<N>X variants.
	* internal-fn.def (DEF_INTERNAL_FLT_FLOATN_FN): New helper macro
	for math functions that have _Float<N> and _Float<N>X variants.
	(SQRT): Add support for sqrt, copysign, fmin and fmax _Float<N>
	and _Float<N>X variants.
	(COPYSIGN): Likewise.
	(FMIN): Likewise.
	(FMAX): Likewise.
	* gencfn-macros.c (print_case_cfn): Add support for math functions
	that have _Float<N> and _Float<N>X variants.
	(print_define_operator_list): Likewise.
	(fltfn_suffixes): Likewise.
	(main): Likewise.
	* tree-ssa-reassoc.c (attempt_builtin_copysign): Add support for
	copysign with _Float<N> and _Float<N>X variants.
	* gimple-ssa-backprop.c (backprop::process_builtin_call_use): Add
	support for copysign and fma with _Float<N> and _Float<N>X
	variants.
	* config/rs6000/rs6000-builtin.def (SQRTF128): Delete rs6000
	sqrtf128 and fmaf128 builtins, as this is handled by machine
	independent code.
	(FMAF128): Likewise.

[gcc/c-family]
2017-10-19  Michael Meissner  <meissner@linux.vnet.ibm.com>

	* c-cppbuiltin.c (mode_has_fma): Add support for PowerPC fmakf3
	for float128 fma when long double is not __float128.
	(c_cpp_builtins): Define __FP_FAST_FMAF<N> and __FP_FAST_FMA<N>X
	if the _Float<N> and _Float<N>X variants for fma exist.

[gcc/c]
2017-10-19  Michael Meissner  <meissner@linux.vnet.ibm.com>

	* c-decl.c (header_for_builtin_fn): Add support for fma with
	_Float<N> and _Float<N>X variants.

[gcc/testsuite]
2017-10-19  Michael Meissner  <meissner@linux.vnet.ibm.com>

	* gcc.target/powerpc/float128-fma2.c: Delete, test is no longer
	relavant now that machine independent code handles sqrt and fma
	_Float<N> and _Float<N>X variants.
	* gcc.target/powerpc/float128-sqrt2.c: Likewise.
	* gcc.target/powerpc/float128-hw.c: Add more tests for FMA
	variants.  Test code generated to convert __float128 to float.
	* gcc.target/powerpc/float128-hw2.c: New test for machine
	independent handling of copysignf128, sqrtf128, and fmaf128.

Message ID	20171019220831.GA27658@ibm-tiger.the-meissners.org
State	New
Headers	show Return-Path: <gcc-patches-return-464577-incoming=patchwork.ozlabs.org@gcc.gnu.org> X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=gcc.gnu.org (client-ip=209.132.180.131; helo=sourceware.org; envelope-from=gcc-patches-return-464577-incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=<UNKNOWN>) Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.b="PiyoKQbL"; dkim-atps=neutral Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 3yJ3411cY1z9t6M for <incoming@patchwork.ozlabs.org>; Fri, 20 Oct 2017 09:09:03 +1100 (AEDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:date :from:to:cc:subject:references:mime-version:content-type :in-reply-to:message-id; q=dns; s=default; b=yOFzYUB9kW2hjgamq4d 8mMPLGVIoBm3RAD3bEjcom+7gOoJBrypoRAxlpQmv3YVm9RSB1Wp6yQtyltwF2uS uIq+99AQnJXwewdKiAaJYe+ZTgExAR7IdFeQCdnzzUqFbffDITOnU+/2B4pRr5b9 LoIgqdfLrpG6FXAXxiXq4X+o= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:date :from:to:cc:subject:references:mime-version:content-type :in-reply-to:message-id; s=default; bh=/4ugN0IW4WzGOqzt+r6XOyvYn 1g=; b=PiyoKQbLifZSVUvhrM8gpseXsOU8xnwK8FIRTA0MGceyqll7qfaPyfba0 dueDKdnmReoYf8N3ZJo+W2GZaBUIEvBBKF5bNCj2TZmHiPJGvqWU61clJFJLSL0A HHAhZZLngfby+DA3Mu00EEbP00mcjkHJruoxboX74RFQ367I/4= Received: (qmail 51640 invoked by alias); 19 Oct 2017 22:08:50 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: <gcc-patches.gcc.gnu.org> List-Unsubscribe: <mailto:gcc-patches-unsubscribe-incoming=patchwork.ozlabs.org@gcc.gnu.org> List-Archive: <http://gcc.gnu.org/ml/gcc-patches/> List-Post: <mailto:gcc-patches@gcc.gnu.org> List-Help: <mailto:gcc-patches-help@gcc.gnu.org> Sender: gcc-patches-owner@gcc.gnu.org Delivered-To: mailing list gcc-patches@gcc.gnu.org Received: (qmail 49938 invoked by uid 89); 19 Oct 2017 22:08:46 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-10.0 required=5.0 tests=AWL, BAYES_00, GIT_PATCH_2, GIT_PATCH_3, KAM_ASCII_DIVIDERS, KAM_LAZY_DOMAIN_SECURITY, RCVD_IN_DNSWL_LOW autolearn=ham version=3.3.2 spammy=scales, SINH, sk:float16 X-HELO: mx0a-001b2d01.pphosted.com Received: from mx0a-001b2d01.pphosted.com (HELO mx0a-001b2d01.pphosted.com) (148.163.156.1) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Thu, 19 Oct 2017 22:08:41 +0000 Received: from pps.filterd (m0098393.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.21/8.16.0.21) with SMTP id v9JM7lip003845 for <gcc-patches@gcc.gnu.org>; Thu, 19 Oct 2017 18:08:37 -0400 Received: from e36.co.us.ibm.com (e36.co.us.ibm.com [32.97.110.154]) by mx0a-001b2d01.pphosted.com with ESMTP id 2dpxrfh0nr-1 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=NOT) for <gcc-patches@gcc.gnu.org>; Thu, 19 Oct 2017 18:08:37 -0400 Received: from localhost by e36.co.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for <gcc-patches@gcc.gnu.org> from <meissner@ibm-tiger.the-meissners.org>; Thu, 19 Oct 2017 16:08:36 -0600 Received: from b03cxnp07029.gho.boulder.ibm.com (9.17.130.16) by e36.co.us.ibm.com (192.168.1.136) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; Thu, 19 Oct 2017 16:08:33 -0600 Received: from b03ledav004.gho.boulder.ibm.com (b03ledav004.gho.boulder.ibm.com [9.17.130.235]) by b03cxnp07029.gho.boulder.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id v9JM8WTn7537038; Thu, 19 Oct 2017 15:08:32 -0700 Received: from b03ledav004.gho.boulder.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 7784278037; Thu, 19 Oct 2017 16:08:32 -0600 (MDT) Received: from ibm-tiger.the-meissners.org (unknown [9.32.77.111]) by b03ledav004.gho.boulder.ibm.com (Postfix) with ESMTP id 44E2978038; Thu, 19 Oct 2017 16:08:32 -0600 (MDT) Received: by ibm-tiger.the-meissners.org (Postfix, from userid 500) id 7EBAC483D4; Thu, 19 Oct 2017 18:08:31 -0400 (EDT) Date: Thu, 19 Oct 2017 18:08:31 -0400 From: Michael Meissner <meissner@linux.vnet.ibm.com> To: Joseph Myers <joseph@codesourcery.com>, GCC Patches <gcc-patches@gcc.gnu.org>, Segher Boessenkool <segher@kernel.crashing.org>, David Edelsohn <dje.gcc@gmail.com>, Bill Schmidt <wschmidt@linux.vnet.ibm.com> Cc: Michael Meissner <meissner@linux.vnet.ibm.com> Subject: [PATCH, version 2], Add support for _Float<N> and _Float<N>X sqrt, fma, fmin, fmax built-in functions Mail-Followup-To: Michael Meissner <meissner@linux.vnet.ibm.com>, Joseph Myers <joseph@codesourcery.com>, GCC Patches <gcc-patches@gcc.gnu.org>, Segher Boessenkool <segher@kernel.crashing.org>, David Edelsohn <dje.gcc@gmail.com>, Bill Schmidt <wschmidt@linux.vnet.ibm.com> References: <20170913214600.GA24598@ibm-tiger.the-meissners.org> <alpine.DEB.2.20.1709132239500.28319@digraph.polyomino.org.uk> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="pWyiEgJYm5f9v55/" Content-Disposition: inline In-Reply-To: <alpine.DEB.2.20.1709132239500.28319@digraph.polyomino.org.uk> User-Agent: Mutt/1.5.20 (2009-12-10) X-TM-AS-GCONF: 00 x-cbid: 17101922-0020-0000-0000-00000CDFB4B4 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00007920; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000238; SDB=6.00933562; UDB=6.00470232; IPR=6.00713850; BA=6.00005651; NDR=6.00000001; ZLA=6.00000005; ZF=6.00000009; ZB=6.00000000; ZP=6.00000000; ZH=6.00000000; ZU=6.00000002; MB=3.00017613; XFM=3.00000015; UTC=2017-10-19 22:08:34 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 17101922-0021-0000-0000-00005E94FDAB Message-Id: <20171019220831.GA27658@ibm-tiger.the-meissners.org> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:, , definitions=2017-10-19_11:, , signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1707230000 definitions=main-1710190301 X-IsSubscribed: yes
Series	[version,2] , Add support for _Float<N> and _Float<N>X sqrt, fma, fmin, fmax built-in functions \| expand [version,2] , Add support for _Float<N> and _Float<N>X sqrt, fma, fmin, fmax built-in functions

[version,2] , Add support for _Float<N> and _Float<N>X sqrt, fma, fmin, fmax built-in functions

Commit Message

Comments

Patch