[{"id":1767971,"web_url":"http://patchwork.ozlabs.org/comment/1767971/","msgid":"<87mv5ya9qb.fsf@linaro.org>","list_archive_url":null,"date":"2017-09-13T15:39:40","subject":"Re: [PATCH v2 07/28] arm64/sve: Low-level SVE architectural state\n\tmanipulation functions","submitter":{"id":39532,"url":"http://patchwork.ozlabs.org/api/people/39532/","name":"Alex Bennée","email":"alex.bennee@linaro.org"},"content":"Dave Martin <Dave.Martin@arm.com> writes:\n\n> Manipulating the SVE architectural state, including the vector and\n> predicate registers, first-fault register and the vector length,\n> requires the use of dedicated instructions added by SVE.\n>\n> This patch adds suitable assembly functions for saving and\n> restoring the SVE registers and querying the vector length.\n> Setting of the vector length is done as part of register restore.\n>\n> Since people building kernels may not all get an SVE-enabled\n> toolchain for a while, this patch uses macros that generate\n> explicit opcodes in place of assembler mnemonics.\n>\n> Signed-off-by: Dave Martin <Dave.Martin@arm.com>\n> Cc: Alex Bennée <alex.bennee@linaro.org>\n\nIt took me a while to find a way to properly dissemble the resulting\nbinaries, in the end needing to run a native objdump in Stretch. I'd\nhopped my gdb-multiarch was bleeding edge enough but no ;-)\n\nReviewed-by: Alex Bennée <alex.bennee@linaro.org>\n\n>\n> ---\n>\n> Changes since v1\n> ----------------\n>\n> Requested by Alex Bennée:\n>\n> * Annotate instruction generation macros with the canonical\n> architectural syntax so that people can cross-reference more easily\n> against the architectural documentation.\n> ---\n>  arch/arm64/include/asm/fpsimd.h       |   5 ++\n>  arch/arm64/include/asm/fpsimdmacros.h | 148 ++++++++++++++++++++++++++++++++++\n>  arch/arm64/kernel/entry-fpsimd.S      |  17 ++++\n>  3 files changed, 170 insertions(+)\n>\n> diff --git a/arch/arm64/include/asm/fpsimd.h b/arch/arm64/include/asm/fpsimd.h\n> index 410c481..026a7c7 100644\n> --- a/arch/arm64/include/asm/fpsimd.h\n> +++ b/arch/arm64/include/asm/fpsimd.h\n> @@ -67,6 +67,11 @@ extern void fpsimd_update_current_state(struct fpsimd_state *state);\n>\n>  extern void fpsimd_flush_task_state(struct task_struct *target);\n>\n> +extern void sve_save_state(void *state, u32 *pfpsr);\n> +extern void sve_load_state(void const *state, u32 const *pfpsr,\n> +\t\t\t   unsigned long vq_minus_1);\n> +extern unsigned int sve_get_vl(void);\n> +\n>  /* For use by EFI runtime services calls only */\n>  extern void __efi_fpsimd_begin(void);\n>  extern void __efi_fpsimd_end(void);\n> diff --git a/arch/arm64/include/asm/fpsimdmacros.h b/arch/arm64/include/asm/fpsimdmacros.h\n> index 0f5fdd3..e050d76 100644\n> --- a/arch/arm64/include/asm/fpsimdmacros.h\n> +++ b/arch/arm64/include/asm/fpsimdmacros.h\n> @@ -75,3 +75,151 @@\n>  \tldr\tw\\tmpnr, [\\state, #16 * 2 + 4]\n>  \tfpsimd_restore_fpcr x\\tmpnr, \\state\n>  .endm\n> +\n> +/* Sanity-check macros to help avoid encoding garbage instructions */\n> +\n> +.macro _check_general_reg nr\n> +\t.if (\\nr) < 0 || (\\nr) > 30\n> +\t\t.error \"Bad register number \\nr.\"\n> +\t.endif\n> +.endm\n> +\n> +.macro _sve_check_zreg znr\n> +\t.if (\\znr) < 0 || (\\znr) > 31\n> +\t\t.error \"Bad Scalable Vector Extension vector register number \\znr.\"\n> +\t.endif\n> +.endm\n> +\n> +.macro _sve_check_preg pnr\n> +\t.if (\\pnr) < 0 || (\\pnr) > 15\n> +\t\t.error \"Bad Scalable Vector Extension predicate register number \\pnr.\"\n> +\t.endif\n> +.endm\n> +\n> +.macro _check_num n, min, max\n> +\t.if (\\n) < (\\min) || (\\n) > (\\max)\n> +\t\t.error \"Number \\n out of range [\\min,\\max]\"\n> +\t.endif\n> +.endm\n> +\n> +/* SVE instruction encodings for non-SVE-capable assemblers */\n> +\n> +/* STR (vector): STR Z\\nz, [X\\nxbase, #\\offset, MUL VL] */\n> +.macro _sve_str_v nz, nxbase, offset=0\n> +\t_sve_check_zreg \\nz\n> +\t_check_general_reg \\nxbase\n> +\t_check_num (\\offset), -0x100, 0xff\n> +\t.inst\t0xe5804000\t\t\t\\\n> +\t\t| (\\nz)\t\t\t\t\\\n> +\t\t| ((\\nxbase) << 5)\t\t\\\n> +\t\t| (((\\offset) & 7) << 10)\t\\\n> +\t\t| (((\\offset) & 0x1f8) << 13)\n> +.endm\n> +\n> +/* LDR (vector): LDR Z\\nz, [X\\nxbase, #\\offset, MUL VL] */\n> +.macro _sve_ldr_v nz, nxbase, offset=0\n> +\t_sve_check_zreg \\nz\n> +\t_check_general_reg \\nxbase\n> +\t_check_num (\\offset), -0x100, 0xff\n> +\t.inst\t0x85804000\t\t\t\\\n> +\t\t| (\\nz)\t\t\t\t\\\n> +\t\t| ((\\nxbase) << 5)\t\t\\\n> +\t\t| (((\\offset) & 7) << 10)\t\\\n> +\t\t| (((\\offset) & 0x1f8) << 13)\n> +.endm\n> +\n> +/* STR (predicate): STR P\\np, [X\\nxbase, #\\offset, MUL VL] */\n> +.macro _sve_str_p np, nxbase, offset=0\n> +\t_sve_check_preg \\np\n> +\t_check_general_reg \\nxbase\n> +\t_check_num (\\offset), -0x100, 0xff\n> +\t.inst\t0xe5800000\t\t\t\\\n> +\t\t| (\\np)\t\t\t\t\\\n> +\t\t| ((\\nxbase) << 5)\t\t\\\n> +\t\t| (((\\offset) & 7) << 10)\t\\\n> +\t\t| (((\\offset) & 0x1f8) << 13)\n> +.endm\n> +\n> +/* LDR (predicate): LDR P\\np, [X\\nxbase, #\\offset, MUL VL] */\n> +.macro _sve_ldr_p np, nxbase, offset=0\n> +\t_sve_check_preg \\np\n> +\t_check_general_reg \\nxbase\n> +\t_check_num (\\offset), -0x100, 0xff\n> +\t.inst\t0x85800000\t\t\t\\\n> +\t\t| (\\np)\t\t\t\t\\\n> +\t\t| ((\\nxbase) << 5)\t\t\\\n> +\t\t| (((\\offset) & 7) << 10)\t\\\n> +\t\t| (((\\offset) & 0x1f8) << 13)\n> +.endm\n> +\n> +/* RDVL X\\nx, #\\imm */\n> +.macro _sve_rdvl nx, imm\n> +\t_check_general_reg \\nx\n> +\t_check_num (\\imm), -0x20, 0x1f\n> +\t.inst\t0x04bf5000\t\t\t\\\n> +\t\t| (\\nx)\t\t\t\t\\\n> +\t\t| (((\\imm) & 0x3f) << 5)\n> +.endm\n> +\n> +/* RDFFR (unpredicated): RDFFR P\\np.B */\n> +.macro _sve_rdffr np\n> +\t_sve_check_preg \\np\n> +\t.inst\t0x2519f000\t\t\t\\\n> +\t\t| (\\np)\n> +.endm\n> +\n> +/* WRFFR P\\np.B */\n> +.macro _sve_wrffr np\n> +\t_sve_check_preg \\np\n> +\t.inst\t0x25289000\t\t\t\\\n> +\t\t| ((\\np) << 5)\n> +.endm\n> +\n> +.macro __for from:req, to:req\n> +\t.if (\\from) == (\\to)\n> +\t\t_for__body \\from\n> +\t.else\n> +\t\t__for \\from, (\\from) + ((\\to) - (\\from)) / 2\n> +\t\t__for (\\from) + ((\\to) - (\\from)) / 2 + 1, \\to\n> +\t.endif\n> +.endm\n> +\n> +.macro _for var:req, from:req, to:req, insn:vararg\n> +\t.macro _for__body \\var:req\n> +\t\t\\insn\n> +\t.endm\n> +\n> +\t__for \\from, \\to\n> +\n> +\t.purgem _for__body\n> +.endm\n> +\n> +.macro sve_save nxbase, xpfpsr, nxtmp\n> + _for n, 0, 31,\t_sve_str_v\t\\n, \\nxbase, \\n - 34\n> + _for n, 0, 15,\t_sve_str_p\t\\n, \\nxbase, \\n - 16\n> +\t\t_sve_rdffr\t0\n> +\t\t_sve_str_p\t0, \\nxbase\n> +\t\t_sve_ldr_p\t0, \\nxbase, -16\n> +\n> +\t\tmrs\t\tx\\nxtmp, fpsr\n> +\t\tstr\t\tw\\nxtmp, [\\xpfpsr]\n> +\t\tmrs\t\tx\\nxtmp, fpcr\n> +\t\tstr\t\tw\\nxtmp, [\\xpfpsr, #4]\n> +.endm\n> +\n> +.macro sve_load nxbase, xpfpsr, xvqminus1, nxtmp\n> +\t\tmrs_s\t\tx\\nxtmp, SYS_ZCR_EL1\n> +\t\tbic\t\tx\\nxtmp, x\\nxtmp, ZCR_ELx_LEN_MASK\n> +\t\torr\t\tx\\nxtmp, x\\nxtmp, \\xvqminus1\n> +\t\tmsr_s\t\tSYS_ZCR_EL1, x\\nxtmp\t// self-synchronising\n> +\n> + _for n, 0, 31,\t_sve_ldr_v\t\\n, \\nxbase, \\n - 34\n> +\t\t_sve_ldr_p\t0, \\nxbase\n> +\t\t_sve_wrffr\t0\n> + _for n, 0, 15,\t_sve_ldr_p\t\\n, \\nxbase, \\n - 16\n> +\n> +\t\tldr\t\tw\\nxtmp, [\\xpfpsr]\n> +\t\tmsr\t\tfpsr, x\\nxtmp\n> +\t\tldr\t\tw\\nxtmp, [\\xpfpsr, #4]\n> +\t\tmsr\t\tfpcr, x\\nxtmp\n> +.endm\n> diff --git a/arch/arm64/kernel/entry-fpsimd.S b/arch/arm64/kernel/entry-fpsimd.S\n> index 6a27cd6..73f17bf 100644\n> --- a/arch/arm64/kernel/entry-fpsimd.S\n> +++ b/arch/arm64/kernel/entry-fpsimd.S\n> @@ -41,3 +41,20 @@ ENTRY(fpsimd_load_state)\n>  \tfpsimd_restore x0, 8\n>  \tret\n>  ENDPROC(fpsimd_load_state)\n> +\n> +#ifdef CONFIG_ARM64_SVE\n> +ENTRY(sve_save_state)\n> +\tsve_save 0, x1, 2\n> +\tret\n> +ENDPROC(sve_save_state)\n> +\n> +ENTRY(sve_load_state)\n> +\tsve_load 0, x1, x2, 3\n> +\tret\n> +ENDPROC(sve_load_state)\n> +\n> +ENTRY(sve_get_vl)\n> +\t_sve_rdvl\t0, 1\n> +\tret\n> +ENDPROC(sve_get_vl)\n> +#endif /* CONFIG_ARM64_SVE */\n\n\n--\nAlex Bennée","headers":{"Return-Path":"<libc-alpha-return-84565-incoming=patchwork.ozlabs.org@sourceware.org>","X-Original-To":"incoming@patchwork.ozlabs.org","Delivered-To":["patchwork-incoming@bilbo.ozlabs.org","mailing list libc-alpha@sourceware.org"],"Authentication-Results":["ozlabs.org;\n\tspf=pass (mailfrom) smtp.mailfrom=sourceware.org\n\t(client-ip=209.132.180.131; helo=sourceware.org;\n\tenvelope-from=libc-alpha-return-84565-incoming=patchwork.ozlabs.org@sourceware.org;\n\treceiver=<UNKNOWN>)","ozlabs.org; dkim=pass (1024-bit key;\n\tsecure) header.d=sourceware.org header.i=@sourceware.org\n\theader.b=\"Rs5Q+6uq\"; dkim-atps=neutral","sourceware.org; auth=none"],"Received":["from sourceware.org (server1.sourceware.org [209.132.180.131])\n\t(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256\n\tbits)) (No client certificate requested)\n\tby ozlabs.org (Postfix) with ESMTPS id 3xsm7X3y7tz9s9Y\n\tfor <incoming@patchwork.ozlabs.org>;\n\tThu, 14 Sep 2017 01:39:52 +1000 (AEST)","(qmail 81281 invoked by alias); 13 Sep 2017 15:39:46 -0000","(qmail 80890 invoked by uid 89); 13 Sep 2017 15:39:45 -0000"],"DomainKey-Signature":"a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id\n\t:list-unsubscribe:list-subscribe:list-archive:list-post\n\t:list-help:sender:references:from:to:cc:subject:in-reply-to:date\n\t:message-id:mime-version:content-type:content-transfer-encoding;\n\tq=dns; s=default; b=iqgHyOJeLi6vX+qDXWJ8JccV4T4xV/CbLfIO3KutgVn\n\tx65uNiNLM3jU4u2vqN8+EpSZGD6B2VPinZl+JRYhlrBfeRfbf1+PEwyykF7Nnm/2\n\tOsHTalhQ1AYvYcrMCjSoTdjWRjPkMGdIZep+2s1D7hegFF/GYeOGXe9MjPyq48Vg\n\t=","DKIM-Signature":"v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id\n\t:list-unsubscribe:list-subscribe:list-archive:list-post\n\t:list-help:sender:references:from:to:cc:subject:in-reply-to:date\n\t:message-id:mime-version:content-type:content-transfer-encoding;\n\ts=default; bh=dHRVQspOpqMlOOk4szBgoN402VE=; b=Rs5Q+6uqrhCS5TQFw\n\tjQ/v4DHwunAEfsGJh+dSLF9dAKx1vuT5Oi9effNsK3vqT2V/zm1Q0aD/2s0DRRXc\n\tQgGTdMFUzT04FiJ+zqyMxO5f2fkNFzIcFLiwPrFaS1jXqqQobqtZnn7stzhnDXMB\n\thOZ6LncFycx7DNU0KMWhkrhm1o=","Mailing-List":"contact libc-alpha-help@sourceware.org; run by ezmlm","Precedence":"bulk","List-Id":"<libc-alpha.sourceware.org>","List-Unsubscribe":"<mailto:libc-alpha-unsubscribe-incoming=patchwork.ozlabs.org@sourceware.org>","List-Subscribe":"<mailto:libc-alpha-subscribe@sourceware.org>","List-Archive":"<http://sourceware.org/ml/libc-alpha/>","List-Post":"<mailto:libc-alpha@sourceware.org>","List-Help":"<mailto:libc-alpha-help@sourceware.org>,\n\t<http://sourceware.org/ml/#faqs>","Sender":"libc-alpha-owner@sourceware.org","X-Virus-Found":"No","X-Spam-SWARE-Status":"No, score=-25.9 required=5.0 tests=AWL, BAYES_00,\n\tGIT_PATCH_0, GIT_PATCH_1, GIT_PATCH_2, GIT_PATCH_3,\n\tRCVD_IN_DNSWL_NONE, RCVD_IN_SORBS_SPAM,\n\tSPF_PASS autolearn=ham version=3.3.2 spammy=LDR,\n\tHX-Received:10.223.134.174","X-HELO":"mail-wr0-f175.google.com","X-Google-DKIM-Signature":"v=1; a=rsa-sha256; c=relaxed/relaxed;\n\td=1e100.net; s=20161025;\n\th=x-gm-message-state:references:user-agent:from:to:cc:subject\n\t:in-reply-to:date:message-id:mime-version:content-transfer-encoding; \n\tbh=A9taivPhXeEsm+oc9z5v09BgI3d45ts2hml0OP6XgPg=;\n\tb=T17gp/IeFnIBWQZEnD0JVV+4wJyrOU2is6CqaDlX6ofU8CmKlmLWGnyVMl0SvejgvH\n\tVT5dd5PnPdFt3QrcXLsxpTy9hqwkkYbL6Yc1vABN5kkaK6RK74oXn7gFcySICg613/Vf\n\tMwju46fS5Vs3ycolRk2+/glfOEZBV0sORJXEoIFXTnm0z6L9dCn3QRxEwCM3ggcihu9x\n\tkKiCyQZS9nhmpG76TSCgIYGib5Ptmv9kL70nqn3Sa2hAXYmd+1NOutB42o+BCueu8NPK\n\tkAg5yFrR38kmyTvS+kvUOQ5sAboztqWZukswS33xSibeSkD23lSDafcGa4ljM1n6tFfM\n\tnXKQ==","X-Gm-Message-State":"AHPjjUiuaWK5YdX/wXQyoBgAulgRaYWPVfwodsKo4haNcYG04rNa3vtS\n\teVaex+5pcAhtlt31","X-Google-Smtp-Source":"ADKCNb7VcLxhvbcw1xW2H1WlKqEYkvVhBcf/v0i8DUN5ME4dMzx99+lop/V0XGDmLY4RSB9D19Hlwg==","X-Received":"by 10.223.134.174 with SMTP id 43mr17447017wrx.173.1505317181415;\n\tWed, 13 Sep 2017 08:39:41 -0700 (PDT)","References":"<1504198860-12951-1-git-send-email-Dave.Martin@arm.com>\n\t<1504198860-12951-8-git-send-email-Dave.Martin@arm.com>","User-agent":"mu4e 0.9.19; emacs 25.2.50.3","From":"Alex =?utf-8?q?Benn=C3=A9e?= <alex.bennee@linaro.org>","To":"Dave Martin <Dave.Martin@arm.com>","Cc":"linux-arm-kernel@lists.infradead.org,\n\tCatalin Marinas <catalin.marinas@arm.com>,\n\tWill Deacon <will.deacon@arm.com>,\n\tArd Biesheuvel <ard.biesheuvel@linaro.org>,\n\tSzabolcs Nagy <szabolcs.nagy@arm.com>,\n\tRichard Sandiford <richard.sandiford@arm.com>,\n\tkvmarm@lists.cs.columbia.edu, libc-alpha@sourceware.org,\n\tlinux-arch@vger.kernel.org","Subject":"Re: [PATCH v2 07/28] arm64/sve: Low-level SVE architectural state\n\tmanipulation functions","In-reply-to":"<1504198860-12951-8-git-send-email-Dave.Martin@arm.com>","Date":"Wed, 13 Sep 2017 16:39:40 +0100","Message-ID":"<87mv5ya9qb.fsf@linaro.org>","MIME-Version":"1.0","Content-Type":"text/plain; charset=utf-8","Content-Transfer-Encoding":"8bit"}}]