From patchwork Thu Jun 17 09:18:14 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Marius Hillenbrand X-Patchwork-Id: 1493283 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=8.43.85.97; helo=sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256 header.s=default header.b=u2h1vZJI; dkim-atps=neutral Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 4G5Gh358SVz9s1l for ; Thu, 17 Jun 2021 19:18:43 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 7313F388E80C for ; Thu, 17 Jun 2021 09:18:41 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 7313F388E80C DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1623921521; bh=mz2qHyS0NuMzZ57sikO+8AAKWctcldOt6otEt1oZl4Q=; h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:Cc:From; b=u2h1vZJImjGkQBq2F+jXTSgCTUCd2bMGG1IpZ9LlPD1CB5KfyBRvYVGR2LL1g6Dmb oEalP8VmBcDoIomHHbZ/2/vnKibG/NS80DzXKqg8N55b5BBJJYZQA4lLv/hcHtJ7NX HiuS0dzA2jhK8IvywZDkO3QWeXAz4+3UPoud3FT0= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from oc3353602072.ibm.com (070-221-018-212.ip-addr.vsenet.de [212.18.221.70]) by sourceware.org (Postfix) with ESMTP id 25343386EC51 for ; Thu, 17 Jun 2021 09:18:21 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 25343386EC51 Received: by oc3353602072.ibm.com (Postfix, from userid 1000) id AB4A8176239F; Thu, 17 Jun 2021 11:18:19 +0200 (CEST) To: Andreas Krebbel Subject: [COMMITTED] IBM Z: Fix vector intrinsics vec_double and vec_floate Date: Thu, 17 Jun 2021 11:18:14 +0200 Message-Id: <20210617091814.15698-1-mhillen@linux.ibm.com> X-Mailer: git-send-email 2.26.2 MIME-Version: 1.0 X-Spam-Status: No, score=-4.7 required=5.0 tests=BAYES_00, GIT_PATCH_0, KAM_DMARC_STATUS, KAM_LAZY_DOMAIN_SECURITY, KHOP_HELO_FCRDNS, RDNS_DYNAMIC, SPF_HELO_NONE, SPF_NONE, TXREP, UNWANTED_LANGUAGE_BODY autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Marius Hillenbrand via Gcc-patches From: Marius Hillenbrand Reply-To: Marius Hillenbrand Cc: gcc-patches@gcc.gnu.org Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" Discussed offline with Andreas Krebbel. Fix the mapping of vec_double and vec_floate to builtins. gcc/ChangeLog: PR target/100871 * config/s390/vecintrin.h (vec_doublee): Fix to use __builtin_s390_vflls. (vec_floate): Fix to use __builtin_s390_vflrd. gcc/testsuite/ChangeLog: * gcc.target/s390/zvector/vec-doublee.c: New test. * gcc.target/s390/zvector/vec-floate.c: New test. --- gcc/config/s390/vecintrin.h | 4 ++-- .../gcc.target/s390/zvector/vec-doublee.c | 22 ++++++++++++++++++ .../gcc.target/s390/zvector/vec-floate.c | 23 +++++++++++++++++++ 3 files changed, 47 insertions(+), 2 deletions(-) create mode 100644 gcc/testsuite/gcc.target/s390/zvector/vec-doublee.c create mode 100644 gcc/testsuite/gcc.target/s390/zvector/vec-floate.c diff --git a/gcc/config/s390/vecintrin.h b/gcc/config/s390/vecintrin.h index 6bd26f8a240..9a3f7c347ac 100644 --- a/gcc/config/s390/vecintrin.h +++ b/gcc/config/s390/vecintrin.h @@ -109,8 +109,8 @@ __lcbb(const void *ptr, int bndry) #define vec_rint(X) __builtin_s390_vfi((X), 0, 0) #define vec_roundc(X) __builtin_s390_vfi((X), 4, 0) #define vec_round(X) __builtin_s390_vfi((X), 4, 4) -#define vec_doublee(X) __builtin_s390_vfll((X)) -#define vec_floate(X) __builtin_s390_vflr((X), 0, 0) +#define vec_doublee(X) __builtin_s390_vflls((X)) +#define vec_floate(X) __builtin_s390_vflrd((X), 0, 0) #define vec_load_len_r(X,L) \ (__vector unsigned char)__builtin_s390_vlrlr((L),(X)) #define vec_store_len_r(X,Y,L) \ diff --git a/gcc/testsuite/gcc.target/s390/zvector/vec-doublee.c b/gcc/testsuite/gcc.target/s390/zvector/vec-doublee.c new file mode 100644 index 00000000000..11610f225e5 --- /dev/null +++ b/gcc/testsuite/gcc.target/s390/zvector/vec-doublee.c @@ -0,0 +1,22 @@ +/* { dg-do compile } */ +/* { dg-options "-O3 -march=z14 -mzarch -mzvector --save-temps" } */ +/* { dg-do run { target { s390_z14_hw } } } */ + +/* + * The vector intrinsic vec_doublee(a) converts the even-indexed + * single-precision numbers in a vector to double precision. + */ +#include +#include + +int +main (void) +{ + vector float in = { 1.0, 2.0, 3.0, 4.0 }; + + vector double result = vec_doublee(in); + /* { dg-final { scan-assembler-times {\n\tvldeb} 1 } } */ + + assert(result[0] == (double)in[0]); + assert(result[1] == (double)in[2]); +} diff --git a/gcc/testsuite/gcc.target/s390/zvector/vec-floate.c b/gcc/testsuite/gcc.target/s390/zvector/vec-floate.c new file mode 100644 index 00000000000..0b9cbe3d89e --- /dev/null +++ b/gcc/testsuite/gcc.target/s390/zvector/vec-floate.c @@ -0,0 +1,23 @@ +/* { dg-do compile } */ +/* { dg-options "-O3 -march=z14 -mzarch -mzvector --save-temps" } */ +/* { dg-do run { target { s390_z14_hw } } } */ + +/* + * The vector intrinsic vec_floate(a) rounds a vector of double-precision + * numbers to single-precision. The results are stored in the even-numbered + * target elements. + */ +#include +#include + +int +main (void) +{ + vector double in = { 1.0, 2.0 }; + + vector float result = vec_floate(in); + /* { dg-final { scan-assembler-times {\n\tvledb} 1 } } */ + + assert(result[0] == (float)in[0]); + assert(result[2] == (float)in[1]); +}