Show patches with: Submitter = Liu, Hongtao       |    State = Action Required       |   566 patches
« 1 2 3 45 6 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. - - - - --- 2023-08-25 Liu, Hongtao New
[x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H [x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H - - - - --- 2023-08-24 Liu, Hongtao New
Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") - - - - --- 2023-08-23 Liu, Hongtao New
[x86] Testcase fix. [x86] Testcase fix. - - - - --- 2023-08-22 Liu, Hongtao New
Adjust testcase for Intel GDS. Adjust testcase for Intel GDS. - - - - --- 2023-08-21 Liu, Hongtao New
Mention Intel -march=gracemont for Alderlake-N. Mention Intel -march=gracemont for Alderlake-N. - - - - --- 2023-08-21 Liu, Hongtao New
Support -march=gracemont Support -march=gracemont - - - - --- 2023-08-18 Liu, Hongtao New
Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. - - - - --- 2023-08-14 Liu, Hongtao New
[V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… [V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… - - - - --- 2023-08-11 Liu, Hongtao New
Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. - - - - --- 2023-08-11 Liu, Hongtao New
Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… - - - - --- 2023-08-10 Liu, Hongtao New
i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] - - - - --- 2023-08-10 Liu, Hongtao New
Rename local variable subleaf_level to max_subleaf_level. Rename local variable subleaf_level to max_subleaf_level. - - - - --- 2023-08-09 Liu, Hongtao New
[V2,X86] Workaround possible CPUID bug in Sandy Bridge. [V2,X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-09 Liu, Hongtao New
[X86] Workaround possible CPUID bug in Sandy Bridge. [X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-08 Liu, Hongtao New
i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] - - - - --- 2023-08-07 Liu, Hongtao New
Fix ICE in rtl check when bootstrap. Fix ICE in rtl check when bootstrap. - - - - --- 2023-08-07 Liu, Hongtao New
Optimize vlddqu + inserti128 to vbroadcasti128 Optimize vlddqu + inserti128 to vbroadcasti128 - - - - --- 2023-08-02 Liu, Hongtao New
Support vec_fmaddsub/vec_fmsubadd for vector HFmode. Support vec_fmaddsub/vec_fmsubadd for vector HFmode. - - - - --- 2023-08-02 Liu, Hongtao New
Adjust testcase for more optimal codegen. Adjust testcase for more optimal codegen. - - - - --- 2023-08-01 Liu, Hongtao New
[x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. [x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. - - - - --- 2023-07-27 Liu, Hongtao New
Optimize vlddqu to vmovdqu for TARGET_AVX Optimize vlddqu to vmovdqu for TARGET_AVX - - - - --- 2023-07-20 Liu, Hongtao New
Fix fp16 related testcase failure for i686. Fix fp16 related testcase failure for i686. - - - - --- 2023-07-20 Liu, Hongtao New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. - - - - --- 2023-07-17 Liu, Hongtao New
Fix typo in the testcase. Fix typo in the testcase. - - - - --- 2023-07-11 Liu, Hongtao New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 Liu, Hongtao New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… [v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… - - - - --- 2023-07-11 Liu, Hongtao New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 Liu, Hongtao New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… - - - - --- 2023-07-10 Liu, Hongtao New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern. [V2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-07 Liu, Hongtao New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 Liu, Hongtao New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 Liu, Hongtao New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. - - - - --- 2023-07-06 Liu, Hongtao New
Break false dependence for vpternlog by inserting vpxor. Break false dependence for vpternlog by inserting vpxor. - - - - --- 2023-07-04 Liu, Hongtao New
[2/2] Make option mvzeroupper independent of optimization level. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 Liu, Hongtao New
[1/2] Don't issue vzeroupper for vzeroupper call_insn. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 Liu, Hongtao New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV. [x86] Refine maskstore patterns with UNSPEC_MASKMOV. - - - - --- 2023-06-27 Liu, Hongtao New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. - - - - --- 2023-06-26 Liu, Hongtao New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
Refine maskloadmn pattern with UNSPEC_MASKLOAD. Refine maskloadmn pattern with UNSPEC_MASKLOAD. - - - - --- 2023-06-21 Liu, Hongtao New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-20 Liu, Hongtao New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 Liu, Hongtao New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 Liu, Hongtao New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. [x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. - - - - --- 2023-06-14 Liu, Hongtao New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. [1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 Liu, Hongtao New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. [v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. - - - - --- 2023-06-06 Liu, Hongtao New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. - - - - --- 2023-06-06 Liu, Hongtao New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 Liu, Hongtao New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. [x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. - - - - --- 2023-06-05 Liu, Hongtao New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-02 Liu, Hongtao New
i386: Add missing vector truncate patterns [PR92658]. i386: Add missing vector truncate patterns [PR92658]. - - - - --- 2023-06-02 Liu, Hongtao New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. - - - - --- 2023-06-01 Liu, Hongtao New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. - - - - --- 2023-05-26 Liu, Hongtao New
[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes. [x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes. - - - - --- 2023-05-26 Liu, Hongtao New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR. Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR. - - - - --- 2023-05-22 Liu, Hongtao New
Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode. Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode. - - - - --- 2023-05-17 Liu, Hongtao New
[V2] Provide -fcf-protection=branch,return. [V2] Provide -fcf-protection=branch,return. - - - - --- 2023-05-13 Liu, Hongtao New
Provide -fcf-protection=branch,return. Provide -fcf-protection=branch,return. - - - - --- 2023-05-12 Liu, Hongtao New
x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. - - - - --- 2023-05-10 Liu, Hongtao New
Detect bswap + rotate for byte permutation in pass_bswap. Detect bswap + rotate for byte permutation in pass_bswap. - - - - --- 2023-05-09 Liu, Hongtao New
[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. [V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. - - - - --- 2023-05-08 Liu, Hongtao New
[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from … [powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from … - - - - --- 2023-05-04 Liu, Hongtao New
[v2] Canonicalize vec_merge when mask is constant. [v2] Canonicalize vec_merge when mask is constant. - - - - --- 2023-05-04 Liu, Hongtao New
[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. [vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. - - - - --- 2023-04-26 Liu, Hongtao New
Add testcases for ffs/ctz vectorization. Add testcases for ffs/ctz vectorization. - - - - --- 2023-04-23 Liu, Hongtao New
[2/2,i386] def_or_undef __STDCPP_FLOAT16_T__ and __STDCPP_BFLOAT16_T__ for target attribute/pragmas. [1/2,i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-21 Liu, Hongtao New
[1/2,i386] Support type _Float16/__bf16 independent of SSE2. [1/2,i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-21 Liu, Hongtao New
Canonicalize vec_merge when mask is constant. Canonicalize vec_merge when mask is constant. - - - - --- 2023-04-20 Liu, Hongtao New
[2/2] Adjust testcases after better RA decision. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 Liu, Hongtao New
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 Liu, Hongtao New
[i386] Support type _Float16/__bf16 independent of SSE2. [i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-19 Liu, Hongtao New
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… - - - - --- 2023-04-04 Liu, Hongtao New
Document signbitm2. Document signbitm2. - - - - --- 2023-03-31 Liu, Hongtao New
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. - - - - --- 2023-03-31 Liu, Hongtao New
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… [V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… - - - - --- 2023-03-30 Liu, Hongtao New
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. - - - - --- 2023-03-30 Liu, Hongtao New
Generate vpblendd instead of vpblendw for V4SI under AVX2. Generate vpblendd instead of vpblendw for V4SI under AVX2. - - - - --- 2023-03-29 Liu, Hongtao New
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. - - - - --- 2023-03-22 Liu, Hongtao New
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. [vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. - - - - --- 2023-02-02 Liu, Hongtao New
Change AVX512FP16 to AVX512-FP16 which is official name. Change AVX512FP16 to AVX512-FP16 which is official name. - - - - --- 2023-01-29 Liu, Hongtao New
Change AVX512FP16 to AVX512-FP16 in the document. Change AVX512FP16 to AVX512-FP16 in the document. - - - - --- 2023-01-29 Liu, Hongtao New
Don't add crtfastmath.o for -shared. Don't add crtfastmath.o for -shared. - - - - --- 2023-01-13 Liu, Hongtao New
[V2,2/2,x86] x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. Untitled series #332816 - - - - --- 2022-12-15 Liu, Hongtao New
[V2,1/2] x86: Don't add crtfastmath.o for -shared - - - - --- 2022-12-15 Liu, Hongtao New
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… [x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… - - - - --- 2022-12-14 Liu, Hongtao New
[x86] Fix ICE due to condition mismatch between expander and define_insn. [x86] Fix ICE due to condition mismatch between expander and define_insn. - - - - --- 2022-12-06 Liu, Hongtao New
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. [x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. - - - - --- 2022-12-02 Liu, Hongtao New
[x86] Fix ICE due to incorrect insn type. [x86] Fix ICE due to incorrect insn type. - - - - --- 2022-12-01 Liu, Hongtao New
[1/2,V2] Implement hwasan target_hook. [1/2,V2] Implement hwasan target_hook. - - - - --- 2022-11-30 Liu, Hongtao New
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. [x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. - - - - --- 2022-11-28 Liu, Hongtao New
[V3,x86] Fix incorrect _mm_cvtsbh_ss. [V3,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-25 Liu, Hongtao New
[v2,x86] Fix incorrect _mm_cvtsbh_ss. [v2,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-24 Liu, Hongtao New
[x86] Fix incorrect implementation for mm_cvtsbh_ss. [x86] Fix incorrect implementation for mm_cvtsbh_ss. - - - - --- 2022-11-23 Liu, Hongtao New
[x86] Some tidy up for RA related hooks. [x86] Some tidy up for RA related hooks. - - - - --- 2022-11-21 Liu, Hongtao New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. [x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. - - - - --- 2022-11-18 Liu, Hongtao New
[2/2] Enable hwasan for x86-64. Support HWASAN with Intel LAM - - - - --- 2022-11-11 Liu, Hongtao New
[1/2] Implement hwasan target_hook. Support HWASAN with Intel LAM - - - - --- 2022-11-11 Liu, Hongtao New
Fix incorrect insn type to avoid ICE in memory attr auto-detection. Fix incorrect insn type to avoid ICE in memory attr auto-detection. - - - - --- 2022-11-08 Liu, Hongtao New
« 1 2 3 45 6 »