Show patches with: State = Action Required       |   126921 patches
« 1 2 ... 11 12 131269 1270 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
[V2,i386] Optimize v4si broadcast for noavx512vl. [V2,i386] Optimize v4si broadcast for noavx512vl. - - - - --- 2022-03-07 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - - - --- 2022-03-15 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - - - --- 2022-03-16 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - - - --- 2022-03-16 liuhongt New
[i386] Don't fold __builtin_ia32_blendvpd w/o sse4.2. [i386] Don't fold __builtin_ia32_blendvpd w/o sse4.2. - - - - --- 2022-03-16 liuhongt New
[avx512fp16] Refine HImode movement for "v" to "v". [avx512fp16] Refine HImode movement for "v" to "v". - - - - --- 2022-03-18 liuhongt New
[i386] Extend splitter pattern to reversed condition by swapping then and else rtx. [PR target/1049… [i386] Extend splitter pattern to reversed condition by swapping then and else rtx. [PR target/1049… - - - - --- 2022-03-21 liuhongt New
Fix ICE caused by NULL_RTX returned by lowpart_subreg. Fix ICE caused by NULL_RTX returned by lowpart_subreg. - - - - --- 2022-03-23 liuhongt New
[i386] Fix typo in vec_setv8hi_0. [i386] Fix typo in vec_setv8hi_0. - - - - --- 2022-03-28 liuhongt New
Split vector load from parm_del to elemental loads to avoid STLF stalls. Split vector load from parm_del to elemental loads to avoid STLF stalls. - - - - --- 2022-03-31 liuhongt New
Split vector load from parm_del to elemental loads to avoid STLF stalls. Split vector load from parm_del to elemental loads to avoid STLF stalls. - - - - --- 2022-04-01 liuhongt New
[V3] Split vector load from parm_del to elemental loads to avoid STLF stalls. [V3] Split vector load from parm_del to elemental loads to avoid STLF stalls. - - - - --- 2022-04-01 liuhongt New
Refine and/ior/xor/andn masked patterns for V*HFmode. Refine and/ior/xor/andn masked patterns for V*HFmode. - - - - --- 2022-04-06 liuhongt New
Strip of a vector load which is only used partially. Strip of a vector load which is only used partially. - - - - --- 2022-05-05 liuhongt New
Expand __builtin_memcmp_eq with ptest for OI/TImode. Expand __builtin_memcmp_eq with ptest for OI/TImode. - - - - --- 2022-05-05 liuhongt New
Expand __builtin_memcmp_eq with ptest for OImode. Expand __builtin_memcmp_eq with ptest for OImode. - - - - --- 2022-05-07 liuhongt New
[i386] Optimize movzwl + vmovd/vmovq to vmovw. [i386] Optimize movzwl + vmovd/vmovq to vmovw. - - - - --- 2022-05-09 liuhongt New
[v2] Strip of a vector load which is only used partially. [v2] Strip of a vector load which is only used partially. - - - - --- 2022-05-09 liuhongt New
[Middle-end] Enhance final_value_replacement_loop to handle bitwise induction. [Middle-end] Enhance final_value_replacement_loop to handle bitwise induction. - - - - --- 2022-05-09 liuhongt New
[i386] Implement permutation with pslldq + psrldq + por when pshufb is not available. [i386] Implement permutation with pslldq + psrldq + por when pshufb is not available. - - - - --- 2022-05-09 liuhongt New
Optimize vec_setv8{hi,hf}_0 + pmovzxbq to pmovzxbq. Optimize vec_setv8{hi,hf}_0 + pmovzxbq to pmovzxbq. - - - - --- 2022-05-09 liuhongt New
Optimize vpermtiw/b to vpunpcklqdq for certain cases. Optimize vpermtiw/b to vpunpcklqdq for certain cases. - - - - --- 2022-05-13 liuhongt New
[v2] Optimize vpermtiw/b to vpunpcklqdq for certain cases. [v2] Optimize vpermtiw/b to vpunpcklqdq for certain cases. - - - - --- 2022-05-13 liuhongt New
[i386] Fix ICE caused by wrong condition. [i386] Fix ICE caused by wrong condition. - - - - --- 2022-05-13 liuhongt New
Clamp vec_perm_expr index in simplify_bitfield_ref to avoid ICE. Clamp vec_perm_expr index in simplify_bitfield_ref to avoid ICE. - - - - --- 2022-05-16 liuhongt New
[i386] recognize bzhi pattern when there's zero_extendsidi. [i386] recognize bzhi pattern when there's zero_extendsidi. - - - - --- 2022-05-17 liuhongt New
Increase move cost between mask and gpr. Increase move cost between mask and gpr. - - - - --- 2022-05-20 liuhongt New
Add a bit dislike for separate mem alternative when op is REG_P. Add a bit dislike for separate mem alternative when op is REG_P. - - - - --- 2022-05-25 liuhongt New
Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. - - - - --- 2022-06-02 liuhongt New
Fix insn does not satisfy its constraints: sse2_lshrv1ti3 Fix insn does not satisfy its constraints: sse2_lshrv1ti3 - - - - --- 2022-06-07 liuhongt New
Disparages SSE_REGS alternatives sligntly with ?v instead of *v in *mov{si, di}_internal. Disparages SSE_REGS alternatives sligntly with ?v instead of *v in *mov{si, di}_internal. - - - - --- 2022-06-07 liuhongt New
Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. - - - - --- 2022-06-07 liuhongt New
Adjust testcase to avoid compile failure under -m32. Adjust testcase to avoid compile failure under -m32. - - - - --- 2022-06-08 liuhongt New
Add optional __Bfloat16 support Add optional __Bfloat16 support - - - - --- 2022-06-10 liuhongt New
Fix ICE in extract_insn, at recog.cc:2791 Fix ICE in extract_insn, at recog.cc:2791 - - - - --- 2022-06-14 liuhongt New
[x86] Replace REGNO with reg_or_subregno in pre_reload splitter. [x86] Replace REGNO with reg_or_subregno in pre_reload splitter. - - - - --- 2022-06-20 liuhongt New
Improve reg_or_subregno to return INVALID_REGNUM when the subreg of memory is processed. Improve reg_or_subregno to return INVALID_REGNUM when the subreg of memory is processed. - - - - --- 2022-06-23 liuhongt New
Allocate general register(memory/immediate) for 16/32/64-bit vector bit_op patterns. Allocate general register(memory/immediate) for 16/32/64-bit vector bit_op patterns. - - - - --- 2022-07-11 liuhongt New
[RFC] Support vectorization for Complex type. [RFC] Support vectorization for Complex type. - - - - --- 2022-07-11 liuhongt New
Extend 64-bit vector bit_op patterns with ?r alternative Extend 64-bit vector bit_op patterns with ?r alternative - - - - --- 2022-07-14 liuhongt New
Extend 16/32-bit vector bit_op patterns with (m, 0, i)(vertical) alternative. Extend 16/32-bit vector bit_op patterns with (m, 0, i)(vertical) alternative. - - - - --- 2022-07-18 liuhongt New
[V2,RFC] Support vectorization for Complex type. [V2,RFC] Support vectorization for Complex type. - - - - --- 2022-07-18 liuhongt New
[V2] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. [V2] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. - - - - --- 2022-07-19 liuhongt New
Move pass_cse_sincos after vectorizer. Move pass_cse_sincos after vectorizer. - - - - --- 2022-07-20 liuhongt New
Lower complex type move to enable vectorization for complex type load&store. Lower complex type move to enable vectorization for complex type load&store. - - - - --- 2022-07-20 liuhongt New
[V3] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. [V3] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. - - - - --- 2022-07-21 liuhongt New
Adjust testcase. Adjust testcase. - - - - --- 2022-07-22 liuhongt New
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - - - --- 2022-08-04 liuhongt New
Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2 Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2 - - - - --- 2022-08-24 liuhongt New
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - - - --- 2022-08-29 liuhongt New
Fix _mm512_cvt_roundps_ph to generate sae instruction. Fix _mm512_cvt_roundps_ph to generate sae instruction. - - - - --- 2022-09-05 liuhongt New
[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction. [ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction. - - - - --- 2022-09-14 liuhongt New
Modernize ix86_builtin_vectorized_function with corresponding expanders. Modernize ix86_builtin_vectorized_function with corresponding expanders. - - - - --- 2022-09-16 liuhongt New
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg [x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg - - - - --- 2022-09-16 liuhongt New
[x86] Adjust issue_rate for latest Intel processors. [x86] Adjust issue_rate for latest Intel processors. - - - - --- 2022-09-16 liuhongt New
Support 64-bit vectorization for single-precision floating rounding operation. Support 64-bit vectorization for single-precision floating rounding operation. - - - - --- 2022-09-20 liuhongt New
Fix incorrect handle in vectorizable_induction for mixed induction type. Fix incorrect handle in vectorizable_induction for mixed induction type. - - - - --- 2022-09-20 liuhongt New
Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant. Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant. - - - - --- 2022-09-20 liuhongt New
[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand. [x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand. - - - - --- 2022-09-22 liuhongt New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. [x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. - - - - --- 2022-09-23 liuhongt New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. [x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. - - - - --- 2022-09-26 liuhongt New
Check nonlinear iv in vect_can_advance_ivs_p. Check nonlinear iv in vect_can_advance_ivs_p. - - - - --- 2022-09-29 liuhongt New
[x86] Fix unrecognizable insn of cvtss2si. [x86] Fix unrecognizable insn of cvtss2si. - - - - --- 2022-10-10 liuhongt New
[x86] Add define_insn_and_split to support general version of "kxnor". [x86] Add define_insn_and_split to support general version of "kxnor". - - - - --- 2022-10-11 liuhongt New
Canonicalize vec_perm index to make the first index come from the first vector. Canonicalize vec_perm index to make the first index come from the first vector. - - - - --- 2022-10-18 liuhongt New
[x86] Enable V4BFmode and V2BFmode. [x86] Enable V4BFmode and V2BFmode. - - - - --- 2022-10-26 liuhongt New
[x86] Fix incorrect digit constraint [x86] Fix incorrect digit constraint - - - - --- 2022-10-27 liuhongt New
[V2,x86] Fix incorrect digit constraint [V2,x86] Fix incorrect digit constraint - - - - --- 2022-10-31 liuhongt New
Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count. Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count. - - - - --- 2022-10-31 liuhongt New
Fix incorrect insn type to avoid ICE in memory attr auto-detection. Fix incorrect insn type to avoid ICE in memory attr auto-detection. - - - - --- 2022-11-08 liuhongt New
[1/2] Implement hwasan target_hook. Support HWASAN with Intel LAM - - - - --- 2022-11-11 liuhongt New
[2/2] Enable hwasan for x86-64. Support HWASAN with Intel LAM - - - - --- 2022-11-11 liuhongt New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. [x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. - - - - --- 2022-11-18 liuhongt New
[x86] Some tidy up for RA related hooks. [x86] Some tidy up for RA related hooks. - - - - --- 2022-11-21 liuhongt New
[x86] Fix incorrect implementation for mm_cvtsbh_ss. [x86] Fix incorrect implementation for mm_cvtsbh_ss. - - - - --- 2022-11-23 liuhongt New
[v2,x86] Fix incorrect _mm_cvtsbh_ss. [v2,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-24 liuhongt New
[V3,x86] Fix incorrect _mm_cvtsbh_ss. [V3,x86] Fix incorrect _mm_cvtsbh_ss. - - - - --- 2022-11-25 liuhongt New
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. [x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. - - - - --- 2022-11-28 liuhongt New
[1/2,V2] Implement hwasan target_hook. [1/2,V2] Implement hwasan target_hook. - - - - --- 2022-11-30 liuhongt New
[x86] Fix ICE due to incorrect insn type. [x86] Fix ICE due to incorrect insn type. - - - - --- 2022-12-01 liuhongt New
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. [x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1. - - - - --- 2022-12-02 liuhongt New
[x86] Fix ICE due to condition mismatch between expander and define_insn. [x86] Fix ICE due to condition mismatch between expander and define_insn. - - - - --- 2022-12-06 liuhongt New
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… [x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ… - - - - --- 2022-12-14 liuhongt New
[V2,1/2] x86: Don't add crtfastmath.o for -shared - - - - --- 2022-12-15 liuhongt New
[V2,2/2,x86] x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR. Untitled series #332816 - - - - --- 2022-12-15 liuhongt New
Don't add crtfastmath.o for -shared. Don't add crtfastmath.o for -shared. - - - - --- 2023-01-13 liuhongt New
Change AVX512FP16 to AVX512-FP16 in the document. Change AVX512FP16 to AVX512-FP16 in the document. - - - - --- 2023-01-29 liuhongt New
Change AVX512FP16 to AVX512-FP16 which is official name. Change AVX512FP16 to AVX512-FP16 which is official name. - - - - --- 2023-01-29 liuhongt New
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. [vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant. - - - - --- 2023-02-02 liuhongt New
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore. - - - - --- 2023-03-22 liuhongt New
Generate vpblendd instead of vpblendw for V4SI under AVX2. Generate vpblendd instead of vpblendw for V4SI under AVX2. - - - - --- 2023-03-29 liuhongt New
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq. - - - - --- 2023-03-30 liuhongt New
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… [V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na… - - - - --- 2023-03-30 liuhongt New
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8. - - - - --- 2023-03-31 liuhongt New
Document signbitm2. Document signbitm2. - - - - --- 2023-03-31 liuhongt New
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla… - - - - --- 2023-04-04 liuhongt New
[i386] Support type _Float16/__bf16 independent of SSE2. [i386] Support type _Float16/__bf16 independent of SSE2. - - - - --- 2023-04-19 liuhongt New
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 liuhongt New
[2/2] Adjust testcases after better RA decision. [1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet. - - - - --- 2023-04-20 liuhongt New
Canonicalize vec_merge when mask is constant. Canonicalize vec_merge when mask is constant. - - - - --- 2023-04-20 liuhongt New
« 1 2 ... 11 12 131269 1270 »