From patchwork Thu Feb 8 17:31:25 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Alex_Benn=C3=A9e?= X-Patchwork-Id: 870996 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=nongnu.org (client-ip=2001:4830:134:3::11; helo=lists.gnu.org; envelope-from=qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org; receiver=) Authentication-Results: ozlabs.org; dkim=fail reason="signature verification failed" (1024-bit key; unprotected) header.d=linaro.org header.i=@linaro.org header.b="FllsBiFX"; dkim-atps=neutral Received: from lists.gnu.org (lists.gnu.org [IPv6:2001:4830:134:3::11]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 3zclwY2yBYz9s75 for ; Fri, 9 Feb 2018 04:45:48 +1100 (AEDT) Received: from localhost ([::1]:57215 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ejqGX-0006yJ-Ab for incoming@patchwork.ozlabs.org; Thu, 08 Feb 2018 12:45:45 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54076) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ejq3I-0003EV-GA for qemu-devel@nongnu.org; Thu, 08 Feb 2018 12:32:06 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ejq3F-0006Na-60 for qemu-devel@nongnu.org; Thu, 08 Feb 2018 12:32:04 -0500 Received: from mail-wr0-x236.google.com ([2a00:1450:400c:c0c::236]:43108) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1ejq3E-0006MB-Ux for qemu-devel@nongnu.org; Thu, 08 Feb 2018 12:32:01 -0500 Received: by mail-wr0-x236.google.com with SMTP id b52so5552048wrd.10 for ; Thu, 08 Feb 2018 09:32:00 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=+irdcFf9MxSO/Sn9RSAJ+cOIjFVGSGshT05GFCTRWMo=; b=FllsBiFX0BeuLEBn5ycyp2G2EnmU6PQB2yj87Jdzoo0+AqpF3z/qgwRNcaYPiYuBWL tPqBBP9RrvTumDHJyqoRrVhInplrHxxDcHmaNjWiHiwTENfQ1sr1v3Y0V05BcpwepUM6 bR3ctidDbG/gW150shH7VAUiMyb0BL9zBV/w8= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=+irdcFf9MxSO/Sn9RSAJ+cOIjFVGSGshT05GFCTRWMo=; b=PeZzYrxxlVWcArPB1P383P5kw62cuJBfJdYH1CxIIH+jQKIzI1VHYBbKNK0R3SiXpi /ByG1vqLuMc/WoyjCQg8d/CTR6rmkSow7EK8YhLI0JJfuiWmERhtQX4XaBgUcPaNaRG8 1T49Q7JdljbRhveFo+WNi69qlg0uBBqPrgsYtnBJdN9R/jfl95bZSy4lW7JT1sa3TrKz GZGiQQU02XGeOSmw8O9gEwGW+mX1tPQRsWqCUesh/3UpNc1GaH5gBxd8f5uuGY/FeGYl mdvE59qPQQRHTpNgfWWfWqZGGhoMAFvjwVkGoPi3vqQVg9Gv7ipSrvysFB4GLHc5amkm HkBA== X-Gm-Message-State: APf1xPDLp9S1I2XStF0CQnkQQjImxnGJdOOuJXqHNOduiMB7FqeOUnvK 3HT/Blp7C1i9IbYHZNIXgVkavQ== X-Google-Smtp-Source: AH8x2274+4R6srr88LiyGhSMkC5qCshNxu8Tf8YLNwQcMFvRCIQTAB7NXbhpmPd+SXaq5bo5zOfFYA== X-Received: by 10.223.133.248 with SMTP id 53mr1462313wru.229.1518111119455; Thu, 08 Feb 2018 09:31:59 -0800 (PST) Received: from zen.linaro.local ([81.128.185.34]) by smtp.gmail.com with ESMTPSA id 62sm472925wrg.81.2018.02.08.09.31.58 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 08 Feb 2018 09:31:58 -0800 (PST) Received: from zen.linaroharston (localhost [127.0.0.1]) by zen.linaro.local (Postfix) with ESMTP id B70A73E0AC0; Thu, 8 Feb 2018 17:31:57 +0000 (GMT) From: =?utf-8?q?Alex_Benn=C3=A9e?= To: qemu-arm@nongnu.org Date: Thu, 8 Feb 2018 17:31:25 +0000 Message-Id: <20180208173157.24705-1-alex.bennee@linaro.org> X-Mailer: git-send-email 2.15.1 MIME-Version: 1.0 X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2a00:1450:400c:c0c::236 Subject: [Qemu-devel] [PATCH v2 00/32] Add ARMv8.2 half-precision functions X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: =?utf-8?q?Alex_Benn=C3=A9e?= , qemu-devel@nongnu.org Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: "Qemu-devel" Hi, Some of this was posted before as part of the various partial patch series when we first started messing around with FP16 in softfloat. This series is now just the ARM bits and expects to have the V4 softfloat patches as a prerequisite: https://lists.nongnu.org/archive/html/qemu-devel/2018-02/msg01330.html Alternatively you can grab the full tree from: https://github.com/stsquad/qemu/tree/arm-fp16-v2 I've tested with the following RISU test binaries: http://people.linaro.org/~alex.bennee/testcases/arm64.risu/testcases.armv8.2_hp.tar.xz And of course I ran the original RISU tests with: -cpu any,fp16=off But I guess we really just need to carefully regenerate the testcases to not include UNDEF's which get added with future revisions of the specification. Anyway please review. Alex Bennée (32): include/exec/helper-head.h: support f16 in helper calls target/arm/cpu64: introduce ARM_V8_FP16 feature bit target/arm/cpu64: allow fp16 to be disabled target/arm/cpu.h: update comment for half-precision values target/arm/cpu.h: add additional float_status flags target/arm/helper: pass explicit fpst to set_rmode arm/translate-a64: implement half-precision F(MIN|MAX)(V|NMV) arm/translate-a64: handle_3same_64 comment fix arm/translate-a64: initial decode for simd_three_reg_same_fp16 arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16 arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16 arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16 arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16 arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16 arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed arm/translate-a64: add FP16 x2 ops for simd_indexed arm/translate-a64: initial decode for simd_two_reg_misc_fp16 arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16 arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16 arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16 arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16 arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16 arm/helper.c: re-factor recpe and add recepe_f16 arm/translate-a64: add FP16 FRECPE arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16 arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16 arm/helper.c: re-factor rsqrte and add rsqrte_f16 arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16 arm/translate-a64: add FP16 FMOV to simd_mod_imm arm/translate-a64: add all FP16 ops in simd_scalar_pairwise arm/translate-a64: implement simd_scalar_three_reg_same_fp16 arm/translate-a64: add all single op FP16 to handle_fp_1src_half include/exec/helper-head.h | 3 + target/arm/cpu.h | 24 +- target/arm/cpu64.c | 28 + target/arm/helper-a64.c | 274 ++++++++++ target/arm/helper-a64.h | 34 ++ target/arm/helper.c | 466 +++++++++-------- target/arm/helper.h | 14 +- target/arm/translate-a64.c | 1229 +++++++++++++++++++++++++++++++++++++------- target/arm/translate.c | 12 +- 9 files changed, 1676 insertions(+), 408 deletions(-)