[RFC,v2,31/44] target/loongarch: Implement vpcnt

Message ID	20230328030631.3117129-32-gaosong@loongson.cn
State	New
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: Song Gao <gaosong@loongson.cn> To: qemu-devel@nongnu.org Cc: richard.henderson@linaro.org Subject: [RFC PATCH v2 31/44] target/loongarch: Implement vpcnt Date: Tue, 28 Mar 2023 11:06:18 +0800 Message-Id: <20230328030631.3117129-32-gaosong@loongson.cn> In-Reply-To: <20230328030631.3117129-1-gaosong@loongson.cn> References: <20230328030631.3117129-1-gaosong@loongson.cn> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=114.242.206.163; envelope-from=gaosong@loongson.cn; helo=loongson.cn X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Series	Add LoongArch LSX instructions \| expand [RFC,v2,00/44] Add LoongArch LSX instructions [RFC,v2,01/44] target/loongarch: Add LSX data type VReg [RFC,v2,02/44] target/loongarch: CPUCFG support LSX [RFC,v2,03/44] target/loongarch: meson.build support build LSX [RFC,v2,04/44] target/loongarch: Add CHECK_SXE maccro for check LSX enable [RFC,v2,05/44] target/loongarch: Implement vadd/vsub [RFC,v2,06/44] target/loongarch: Implement vaddi/vsubi [RFC,v2,07/44] target/loongarch: Implement vneg [RFC,v2,08/44] target/loongarch: Implement vsadd/vssub [RFC,v2,09/44] target/loongarch: Implement vhaddw/vhsubw [RFC,v2,10/44] target/loongarch: Implement vaddw/vsubw [RFC,v2,11/44] target/loongarch: Implement vavg/vavgr [RFC,v2,12/44] target/loongarch: Implement vabsd [RFC,v2,13/44] target/loongarch: Implement vadda [RFC,v2,14/44] target/loongarch: Implement vmax/vmin [RFC,v2,15/44] target/loongarch: Implement vmul/vmuh/vmulw{ev/od} [RFC,v2,16/44] target/loongarch: Implement vmadd/vmsub/vmaddw{ev/od} [RFC,v2,17/44] target/loongarch: Implement vdiv/vmod [RFC,v2,18/44] target/loongarch: Implement vsat [RFC,v2,19/44] target/loongarch: Implement vexth [RFC,v2,20/44] target/loongarch: Implement vsigncov [RFC,v2,21/44] target/loongarch: Implement vmskltz/vmskgez/vmsknz [RFC,v2,22/44] target/loongarch: Implement LSX logic instructions [RFC,v2,23/44] target/loongarch: Implement vsll vsrl vsra vrotr [RFC,v2,24/44] target/loongarch: Implement vsllwil vextl [RFC,v2,25/44] target/loongarch: Implement vsrlr vsrar [RFC,v2,26/44] target/loongarch: Implement vsrln vsran [RFC,v2,27/44] target/loongarch: Implement vsrlrn vsrarn [RFC,v2,28/44] target/loongarch: Implement vssrln vssran [RFC,v2,29/44] target/loongarch: Implement vssrlrn vssrarn [RFC,v2,30/44] target/loongarch: Implement vclo vclz [RFC,v2,31/44] target/loongarch: Implement vpcnt [RFC,v2,32/44] target/loongarch: Implement vbitclr vbitset vbitrev [RFC,v2,33/44] target/loongarch: Implement vfrstp [RFC,v2,34/44] target/loongarch: Implement LSX fpu arith instructions [RFC,v2,35/44] target/loongarch: Implement LSX fpu fcvt instructions [RFC,v2,36/44] target/loongarch: Implement vseq vsle vslt [RFC,v2,37/44] target/loongarch: Implement vfcmp [RFC,v2,38/44] target/loongarch: Implement vbitsel vset [RFC,v2,39/44] target/loongarch: Implement vinsgr2vr vpickve2gr vreplgr2vr [RFC,v2,40/44] target/loongarch: Implement vreplve vpack vpick [RFC,v2,41/44] target/loongarch: Implement vilvl vilvh vextrins vshuf [RFC,v2,42/44] target/loongarch: Implement vld vst [RFC,v2,43/44] target/loongarch: Implement vldi [RFC,v2,44/44] target/loongarch: Use {set/get}_gpr replace to cpu_fpr

Message ID

20230328030631.3117129-32-gaosong@loongson.cn

State

New

Headers

From: Song Gao <gaosong@loongson.cn>
To: qemu-devel@nongnu.org
Cc: richard.henderson@linaro.org
Subject: [RFC PATCH v2 31/44] target/loongarch: Implement vpcnt
Date: Tue, 28 Mar 2023 11:06:18 +0800
Message-Id: <20230328030631.3117129-32-gaosong@loongson.cn>
In-Reply-To: <20230328030631.3117129-1-gaosong@loongson.cn>
References: <20230328030631.3117129-1-gaosong@loongson.cn>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=114.242.206.163;
 envelope-from=gaosong@loongson.cn;
 helo=loongson.cn
X-Spam_score_int: -18
X-Spam_score: -1.9
X-Spam_bar: -
X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_PASS=-0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Series

Add LoongArch LSX instructions | expand

Commit Message

gaosong March 28, 2023, 3:06 a.m. UTC

This patch includes:
- VPCNT.{B/H/W/D}.

Signed-off-by: Song Gao <gaosong@loongson.cn>
---
 target/loongarch/disas.c                    |  5 ++++
 target/loongarch/helper.h                   |  5 ++++
 target/loongarch/insn_trans/trans_lsx.c.inc |  5 ++++
 target/loongarch/insns.decode               |  5 ++++
 target/loongarch/lsx_helper.c               | 30 +++++++++++++++++++++
 5 files changed, 50 insertions(+)

Comments

Richard Henderson April 2, 2023, 3:35 a.m. UTC | #1

On 3/27/23 20:06, Song Gao wrote:
> +static uint64_t do_vpcnt(uint64_t u1)
> +{
> +    u1 = (u1 & 0x5555555555555555ULL) + ((u1 >>  1) & 0x5555555555555555ULL);
> +    u1 = (u1 & 0x3333333333333333ULL) + ((u1 >>  2) & 0x3333333333333333ULL);
> +    u1 = (u1 & 0x0F0F0F0F0F0F0F0FULL) + ((u1 >>  4) & 0x0F0F0F0F0F0F0F0FULL);
> +    u1 = (u1 & 0x00FF00FF00FF00FFULL) + ((u1 >>  8) & 0x00FF00FF00FF00FFULL);
> +    u1 = (u1 & 0x0000FFFF0000FFFFULL) + ((u1 >> 16) & 0x0000FFFF0000FFFFULL);
> +    u1 = (u1 & 0x00000000FFFFFFFFULL) + ((u1 >> 32));
> +
> +    return u1;
> +}
> +
> +#define VPCNT(NAME, BIT, E, T)                                      \
> +void HELPER(NAME)(CPULoongArchState *env, uint32_t vd, uint32_t vj) \
> +{                                                                   \
> +    int i;                                                          \
> +    VReg *Vd = &(env->fpr[vd].vreg);                                \
> +    VReg *Vj = &(env->fpr[vj].vreg);                                \
> +                                                                    \
> +    for (i = 0; i < LSX_LEN/BIT; i++)                               \
> +    {                                                               \
> +        Vd->E(i) = do_vpcnt((T)Vj->E(i));                           \
> +    }                                                               \
> +}
> +
> +VPCNT(vpcnt_b, 8, B, uint8_t)
> +VPCNT(vpcnt_h, 16, H, uint16_t)
> +VPCNT(vpcnt_w, 32, W, uint32_t)
> +VPCNT(vpcnt_d, 64, D, uint64_t)

host-utils.h has ctpop{8,16,32,64}.


r~

diff --git a/target/loongarch/disas.c b/target/loongarch/disas.c
index 0c82a1d9d1..0ca51de9d8 100644
--- a/target/loongarch/disas.c
+++ b/target/loongarch/disas.c
@@ -1267,3 +1267,8 @@  INSN_LSX(vclz_b,           vv)
 INSN_LSX(vclz_h,           vv)
 INSN_LSX(vclz_w,           vv)
 INSN_LSX(vclz_d,           vv)
+
+INSN_LSX(vpcnt_b,          vv)
+INSN_LSX(vpcnt_h,          vv)
+INSN_LSX(vpcnt_w,          vv)
+INSN_LSX(vpcnt_d,          vv)
diff --git a/target/loongarch/helper.h b/target/loongarch/helper.h
index a7facc6bc1..38e310512b 100644
--- a/target/loongarch/helper.h
+++ b/target/loongarch/helper.h
@@ -495,3 +495,8 @@  DEF_HELPER_3(vclz_b, void, env, i32, i32)
 DEF_HELPER_3(vclz_h, void, env, i32, i32)
 DEF_HELPER_3(vclz_w, void, env, i32, i32)
 DEF_HELPER_3(vclz_d, void, env, i32, i32)
+
+DEF_HELPER_3(vpcnt_b, void, env, i32, i32)
+DEF_HELPER_3(vpcnt_h, void, env, i32, i32)
+DEF_HELPER_3(vpcnt_w, void, env, i32, i32)
+DEF_HELPER_3(vpcnt_d, void, env, i32, i32)
diff --git a/target/loongarch/insn_trans/trans_lsx.c.inc b/target/loongarch/insn_trans/trans_lsx.c.inc
index 5d81c02103..59923eb1fa 100644
--- a/target/loongarch/insn_trans/trans_lsx.c.inc
+++ b/target/loongarch/insn_trans/trans_lsx.c.inc
@@ -2794,3 +2794,8 @@  TRANS(vclz_b, gen_vv, gen_helper_vclz_b)
 TRANS(vclz_h, gen_vv, gen_helper_vclz_h)
 TRANS(vclz_w, gen_vv, gen_helper_vclz_w)
 TRANS(vclz_d, gen_vv, gen_helper_vclz_d)
+
+TRANS(vpcnt_b, gen_vv, gen_helper_vpcnt_b)
+TRANS(vpcnt_h, gen_vv, gen_helper_vpcnt_h)
+TRANS(vpcnt_w, gen_vv, gen_helper_vpcnt_w)
+TRANS(vpcnt_d, gen_vv, gen_helper_vpcnt_d)
diff --git a/target/loongarch/insns.decode b/target/loongarch/insns.decode
index 7591ec1bab..f865e83da5 100644
--- a/target/loongarch/insns.decode
+++ b/target/loongarch/insns.decode
@@ -968,3 +968,8 @@  vclz_b           0111 00101001 11000 00100 ..... .....    @vv
 vclz_h           0111 00101001 11000 00101 ..... .....    @vv
 vclz_w           0111 00101001 11000 00110 ..... .....    @vv
 vclz_d           0111 00101001 11000 00111 ..... .....    @vv
+
+vpcnt_b          0111 00101001 11000 01000 ..... .....    @vv
+vpcnt_h          0111 00101001 11000 01001 ..... .....    @vv
+vpcnt_w          0111 00101001 11000 01010 ..... .....    @vv
+vpcnt_d          0111 00101001 11000 01011 ..... .....    @vv
diff --git a/target/loongarch/lsx_helper.c b/target/loongarch/lsx_helper.c
index 8ec479dc2d..94dded7e49 100644
--- a/target/loongarch/lsx_helper.c
+++ b/target/loongarch/lsx_helper.c
@@ -2201,3 +2201,33 @@  DO_2OP(vclz_b, 8, B, uint8_t, DO_CLZ_B)
 DO_2OP(vclz_h, 16, H, uint16_t, DO_CLZ_H)
 DO_2OP(vclz_w, 32, W, uint32_t, DO_CLZ_W)
 DO_2OP(vclz_d, 64, D, uint64_t, DO_CLZ_D)
+
+static uint64_t do_vpcnt(uint64_t u1)
+{
+    u1 = (u1 & 0x5555555555555555ULL) + ((u1 >>  1) & 0x5555555555555555ULL);
+    u1 = (u1 & 0x3333333333333333ULL) + ((u1 >>  2) & 0x3333333333333333ULL);
+    u1 = (u1 & 0x0F0F0F0F0F0F0F0FULL) + ((u1 >>  4) & 0x0F0F0F0F0F0F0F0FULL);
+    u1 = (u1 & 0x00FF00FF00FF00FFULL) + ((u1 >>  8) & 0x00FF00FF00FF00FFULL);
+    u1 = (u1 & 0x0000FFFF0000FFFFULL) + ((u1 >> 16) & 0x0000FFFF0000FFFFULL);
+    u1 = (u1 & 0x00000000FFFFFFFFULL) + ((u1 >> 32));
+
+    return u1;
+}
+
+#define VPCNT(NAME, BIT, E, T)                                      \
+void HELPER(NAME)(CPULoongArchState *env, uint32_t vd, uint32_t vj) \
+{                                                                   \
+    int i;                                                          \
+    VReg *Vd = &(env->fpr[vd].vreg);                                \
+    VReg *Vj = &(env->fpr[vj].vreg);                                \
+                                                                    \
+    for (i = 0; i < LSX_LEN/BIT; i++)                               \
+    {                                                               \
+        Vd->E(i) = do_vpcnt((T)Vj->E(i));                           \
+    }                                                               \
+}
+
+VPCNT(vpcnt_b, 8, B, uint8_t)
+VPCNT(vpcnt_h, 16, H, uint16_t)
+VPCNT(vpcnt_w, 32, W, uint32_t)
+VPCNT(vpcnt_d, 64, D, uint64_t)

[RFC,v2,31/44] target/loongarch: Implement vpcnt

Commit Message

Comments

Patch