Message ID | 000101d79b2b$5304ee10$f90eca30$@nextmovesoftware.com |
---|---|
State | New |
Headers | show |
Series | nvptx: Use cvt to perform sign-extension of truncation. | expand |
On 8/27/21 12:07 PM, Roger Sayle wrote: > > This patch introduces some new define_insn rules to the nvptx backend, > to perform sign-extension of a truncation (from and to the same mode), > using a single cvt instruction. As an example, the following function > > int foo(int x) { return (char)x; } > > with -O2 currently generates: > > mov.u32 %r24, %ar0; > mov.u32 %r26, %r24; > cvt.s32.s8 %value, %r26; > > and with this patch, now generates: > > mov.u32 %r24, %ar0; > cvt.s32.s8 %value, %r24; > > This patch has been tested on nvptx-none hosted by x86_64-pc-linux-gnu > with a top-level "make" (including newlib) and a "make check" with no > new regressions. Ok for mainline? > Hi Roger, thanks for the patch and apologies for the delay. I was not happy with negative-testing-only in the test-case, so I've split it up into 6 separate tests. The first one is not influenced by the patch, so I've committed that one separately. The other 5 I've committed together with the patch. Thanks, - Tom > > 2021-08-27 Roger Sayle <roger@nextmovesoftware.com> > > gcc/ChangeLog > * config/nvptx/nvptx.md (*extend_trunc_<mode>2_qi, > *extend_trunc_<mode>2_hi, *extend_trunc_di2_si): New insns. > Use cvt to perform sign-extension of truncation in one step. > > gcc/testsuite/ChangeLog > * gcc.target/nvptx/exttrunc.c: New test case. > > > Roger > -- >
diff --git a/gcc/config/nvptx/nvptx.md b/gcc/config/nvptx/nvptx.md index 108de1c..b7a0393 100644 --- a/gcc/config/nvptx/nvptx.md +++ b/gcc/config/nvptx/nvptx.md @@ -401,6 +401,32 @@ %.\\tst%A0.u%T0\\t%0, %1;" [(set_attr "subregs_ok" "true")]) +;; Sign-extensions of truncations + +(define_insn "*extend_trunc_<mode>2_qi" + [(set (match_operand:HSDIM 0 "nvptx_register_operand" "=R") + (sign_extend:HSDIM + (truncate:QI (match_operand:HSDIM 1 "nvptx_register_operand" "R"))))] + "" + "%.\\tcvt.s%T0.s8\\t%0, %1;" + [(set_attr "subregs_ok" "true")]) + +(define_insn "*extend_trunc_<mode>2_hi" + [(set (match_operand:SDIM 0 "nvptx_register_operand" "=R") + (sign_extend:SDIM + (truncate:HI (match_operand:SDIM 1 "nvptx_register_operand" "R"))))] + "" + "%.\\tcvt.s%T0.s16\\t%0, %1;" + [(set_attr "subregs_ok" "true")]) + +(define_insn "*extend_trunc_di2_si" + [(set (match_operand:DI 0 "nvptx_register_operand" "=R") + (sign_extend:DI + (truncate:SI (match_operand:DI 1 "nvptx_register_operand" "R"))))] + "" + "%.\\tcvt.s64.s32\\t%0, %1;" + [(set_attr "subregs_ok" "true")]) + ;; Integer arithmetic (define_insn "add<mode>3"