Show patches with: Submitter = liuhongt       |    State = Action Required       |    Archived = No       |   544 patches
« 1 2 3 45 6 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
Fix fp16 related testcase failure for i686. Fix fp16 related testcase failure for i686. - - - - --- 2023-07-20 liuhongt New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. - - - - --- 2023-07-17 liuhongt New
Fix typo in the testcase. Fix typo in the testcase. - - - - --- 2023-07-11 liuhongt New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 liuhongt New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… [v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… - - - - --- 2023-07-11 liuhongt New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 liuhongt New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… - - - - --- 2023-07-10 liuhongt New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern. [V2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-07 liuhongt New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 liuhongt New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 liuhongt New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. - - - - --- 2023-07-06 liuhongt New
Break false dependence for vpternlog by inserting vpxor. Break false dependence for vpternlog by inserting vpxor. - - - - --- 2023-07-04 liuhongt New
[2/2] Make option mvzeroupper independent of optimization level. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 liuhongt New
[1/2] Don't issue vzeroupper for vzeroupper call_insn. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 liuhongt New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV. [x86] Refine maskstore patterns with UNSPEC_MASKMOV. - - - - --- 2023-06-27 liuhongt New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. - - - - --- 2023-06-26 liuhongt New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 liuhongt New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 liuhongt New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 liuhongt New
Refine maskloadmn pattern with UNSPEC_MASKLOAD. Refine maskloadmn pattern with UNSPEC_MASKLOAD. - - - - --- 2023-06-21 liuhongt New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-20 liuhongt New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 liuhongt New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 liuhongt New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. [x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. - - - - --- 2023-06-14 liuhongt New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. [1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 liuhongt New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. [v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. - - - - --- 2023-06-06 liuhongt New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. - - - - --- 2023-06-06 liuhongt New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 liuhongt New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. [x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. - - - - --- 2023-06-05 liuhongt New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-02 liuhongt New
i386: Add missing vector truncate patterns [PR92658]. i386: Add missing vector truncate patterns [PR92658]. - - - - --- 2023-06-02 liuhongt New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. - - - - --- 2023-06-01 liuhongt New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. - - - - --- 2023-05-26 liuhongt New
[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes. [x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes. - - - - --- 2023-05-26 liuhongt New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR. Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR. - - - - --- 2023-05-22 liuhongt New
Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode. Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode. - - - - --- 2023-05-17 liuhongt New
[V2] Provide -fcf-protection=branch,return. [V2] Provide -fcf-protection=branch,return. - - - - --- 2023-05-13 liuhongt New
Provide -fcf-protection=branch,return. Provide -fcf-protection=branch,return. - - - - --- 2023-05-12 liuhongt New
x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. - - - - --- 2023-05-10 liuhongt New
Detect bswap + rotate for byte permutation in pass_bswap. Detect bswap + rotate for byte permutation in pass_bswap. - - - - --- 2023-05-09 liuhongt New
[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. [V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. - - - - --- 2023-05-08 liuhongt New
[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from … [powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from … - - - - --- 2023-05-04 liuhongt New
[v2] Canonicalize vec_merge when mask is constant. [v2] Canonicalize vec_merge when mask is constant. - - - - --- 2023-05-04 liuhongt New
[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. [vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision. - - - - --- 2023-04-26 liuhongt New
Add testcases for ffs/ctz vectorization. Add testcases for ffs/ctz vectorization. - - - - --- 2023-04-23 liuhongt New
[2/2,i386] def_or_undef __STDCPP_FLOAT16_T__ and __STDCPP_BFLOAT16_T__ for target attribute/pragmas. [1/2,i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-21 liuhongt New
[1/2,i386] Support type _Float16/__bf16 independent of SSE2. [1/2,i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-21 liuhongt New
Canonicalize vec_merge when mask is constant. Canonicalize vec_merge when mask is constant. - - - - --- 2023-04-20 liuhongt New
[2/2] Adjust testcases after better RA decision. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 liuhongt New
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 liuhongt New
[i386] Support type _Float16/__bf16 independent of SSE2. [i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-19 liuhongt New
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… - - - - --- 2023-04-04 liuhongt New
Document signbitm2. Document signbitm2. - - - - --- 2023-03-31 liuhongt New
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. - - - - --- 2023-03-31 liuhongt New
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… [V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… - - - - --- 2023-03-30 liuhongt New
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. - - - - --- 2023-03-30 liuhongt New
Generate vpblendd instead of vpblendw for V4SI under AVX2. Generate vpblendd instead of vpblendw for V4SI under AVX2. - - - - --- 2023-03-29 liuhongt New
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. - - - - --- 2023-03-22 liuhongt New
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. [vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. - - - - --- 2023-02-02 liuhongt New
Change AVX512FP16 to AVX512-FP16 which is official name. Change AVX512FP16 to AVX512-FP16 which is official name. - - - - --- 2023-01-29 liuhongt New
Change AVX512FP16 to AVX512-FP16 in the document. Change AVX512FP16 to AVX512-FP16 in the document. - - - - --- 2023-01-29 liuhongt New
Don't add crtfastmath.o for -shared. Don't add crtfastmath.o for -shared. - - - - --- 2023-01-13 liuhongt New
[V2,2/2,x86] x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. Untitled series #332816 - - - - --- 2022-12-15 liuhongt New
[V2,1/2] x86: Don't add crtfastmath.o for -shared - - - - --- 2022-12-15 liuhongt New
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… [x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… - - - - --- 2022-12-14 liuhongt New
[x86] Fix ICE due to condition mismatch between expander and define_insn. [x86] Fix ICE due to condition mismatch between expander and define_insn. - - - - --- 2022-12-06 liuhongt New
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. [x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. - - - - --- 2022-12-02 liuhongt New
[x86] Fix ICE due to incorrect insn type. [x86] Fix ICE due to incorrect insn type. - - - - --- 2022-12-01 liuhongt New
[1/2,V2] Implement hwasan target_hook. [1/2,V2] Implement hwasan target_hook. - - - - --- 2022-11-30 liuhongt New
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. [x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. - - - - --- 2022-11-28 liuhongt New
[V3,x86] Fix incorrect _mm_cvtsbh_ss. [V3,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-25 liuhongt New
[v2,x86] Fix incorrect _mm_cvtsbh_ss. [v2,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-24 liuhongt New
[x86] Fix incorrect implementation for mm_cvtsbh_ss. [x86] Fix incorrect implementation for mm_cvtsbh_ss. - - - - --- 2022-11-23 liuhongt New
[x86] Some tidy up for RA related hooks. [x86] Some tidy up for RA related hooks. - - - - --- 2022-11-21 liuhongt New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. [x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. - - - - --- 2022-11-18 liuhongt New
[2/2] Enable hwasan for x86-64. Support HWASAN with Intel LAM - - - - --- 2022-11-11 liuhongt New
[1/2] Implement hwasan target_hook. Support HWASAN with Intel LAM - - - - --- 2022-11-11 liuhongt New
Fix incorrect insn type to avoid ICE in memory attr auto-detection. Fix incorrect insn type to avoid ICE in memory attr auto-detection. - - - - --- 2022-11-08 liuhongt New
Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count. Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count. - - - - --- 2022-10-31 liuhongt New
[V2,x86] Fix incorrect digit constraint [V2,x86] Fix incorrect digit constraint - - - - --- 2022-10-31 liuhongt New
[x86] Fix incorrect digit constraint [x86] Fix incorrect digit constraint - - - - --- 2022-10-27 liuhongt New
[x86] Enable V4BFmode and V2BFmode. [x86] Enable V4BFmode and V2BFmode. - - - - --- 2022-10-26 liuhongt New
Canonicalize vec_perm index to make the first index come from the first vector. Canonicalize vec_perm index to make the first index come from the first vector. - - - - --- 2022-10-18 liuhongt New
[x86] Add define_insn_and_split to support general version of "kxnor". [x86] Add define_insn_and_split to support general version of "kxnor". - - - - --- 2022-10-11 liuhongt New
[x86] Fix unrecognizable insn of cvtss2si. [x86] Fix unrecognizable insn of cvtss2si. - - - - --- 2022-10-10 liuhongt New
Check nonlinear iv in vect_can_advance_ivs_p. Check nonlinear iv in vect_can_advance_ivs_p. - - - - --- 2022-09-29 liuhongt New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. [x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. - - - - --- 2022-09-26 liuhongt New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. [x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. - - - - --- 2022-09-23 liuhongt New
[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand. [x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand. - - - - --- 2022-09-22 liuhongt New
Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant. Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant. - - - - --- 2022-09-20 liuhongt New
Fix incorrect handle in vectorizable_induction for mixed induction type. Fix incorrect handle in vectorizable_induction for mixed induction type. - - - - --- 2022-09-20 liuhongt New
Support 64-bit vectorization for single-precision floating rounding operation. Support 64-bit vectorization for single-precision floating rounding operation. - - - - --- 2022-09-20 liuhongt New
[x86] Adjust issue_rate for latest Intel processors. [x86] Adjust issue_rate for latest Intel processors. - - - - --- 2022-09-16 liuhongt New
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg [x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg - - - - --- 2022-09-16 liuhongt New
Modernize ix86_builtin_vectorized_function with corresponding expanders. Modernize ix86_builtin_vectorized_function with corresponding expanders. - - - - --- 2022-09-16 liuhongt New
[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction. [ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction. - - - - --- 2022-09-14 liuhongt New
Fix _mm512_cvt_roundps_ph to generate sae instruction. Fix _mm512_cvt_roundps_ph to generate sae instruction. - - - - --- 2022-09-05 liuhongt New
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - - - --- 2022-08-29 liuhongt New
Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2 Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2 - - - - --- 2022-08-24 liuhongt New
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - - - --- 2022-08-04 liuhongt New
« 1 2 3 45 6 »