Show patches with: Submitter = Hongtao Liu       |    State = Action Required       |    Archived = No       |   540 patches
« 1 2 3 45 6 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
Don't assert for IFN_COND_{MIN, MAX} in vect_transform_reduction Don't assert for IFN_COND_{MIN, MAX} in vect_transform_reduction - - - - --- 2024-04-29 liuhongt New
[x86] Optimize 64-bit vector permutation with punpcklqdq + 128-bit vector pshuf. [x86] Optimize 64-bit vector permutation with punpcklqdq + 128-bit vector pshuf. - - - - --- 2024-04-28 liuhongt New
[2/2] Extend usdot_prodv*qi with vpmaddwd when AVXVNNI/AVX512VNNI is not available. [1/2,x86] Support dot_prod optabs for 64-bit vector. - - - - --- 2024-04-28 liuhongt New
[1/2,x86] Support dot_prod optabs for 64-bit vector. [1/2,x86] Support dot_prod optabs for 64-bit vector. - - - - --- 2024-04-28 liuhongt New
Update libbid according to the latest Intel Decimal Floating-Point Math Library. Update libbid according to the latest Intel Decimal Floating-Point Math Library. - - - - --- 2024-04-28 liuhongt New
[x86] Adjust alternative *k to ?k for avx512 mask in zero_extend patterns [x86] Adjust alternative *k to ?k for avx512 mask in zero_extend patterns - - - - --- 2024-04-28 liuhongt New
[V2] sanitizer: [PR110027] Align asan_vec[0] to MAX (BIGGEST_ALIGNMENT / BITS_PER_UNIT, ASAN_RED_ZO… [V2] sanitizer: [PR110027] Align asan_vec[0] to MAX (BIGGEST_ALIGNMENT / BITS_PER_UNIT, ASAN_RED_ZO… - - - - --- 2024-03-26 liuhongt New
Move pr114396.c from gcc.target/i386 to gcc.c-torture/execute. Move pr114396.c from gcc.target/i386 to gcc.c-torture/execute. - - - - --- 2024-03-22 liuhongt New
Fix runtime error for nonlinear iv vectorization(step_mult). Fix runtime error for nonlinear iv vectorization(step_mult). - - - - --- 2024-03-21 liuhongt New
[V2] Document -fexcess-precision=16. [V2] Document -fexcess-precision=16. - - - - --- 2024-03-20 liuhongt New
Document -fexcess-precision=16. Document -fexcess-precision=16. - - - - --- 2024-03-18 liuhongt New
i386 [stv]: Handle REG_EH_REGION note [pr111822]. i386 [stv]: Handle REG_EH_REGION note [pr111822]. - - - - --- 2024-03-18 liuhongt New
Add missing <any_logic>hf/bf patterns. Add missing <any_logic>hf/bf patterns. - - - - --- 2024-03-18 liuhongt New
i386[stv]: Handle REG_EH_REGION note i386[stv]: Handle REG_EH_REGION note - - - - --- 2024-03-14 liuhongt New
sanitizer: [PR110027] Align asan_vec[0] to MAX (alignb, ASAN_RED_ZONE_SIZE) sanitizer: [PR110027] Align asan_vec[0] to MAX (alignb, ASAN_RED_ZONE_SIZE) - - - - --- 2024-03-12 liuhongt New
Fix testcase for platform without gnu/stubs-x32.h Fix testcase for platform without gnu/stubs-x32.h - - - - --- 2024-02-19 liuhongt New
[wwwdoc] Hardware-assisted AddressSanitizer now works for x86_64 with LAM_U57 [wwwdoc] Hardware-assisted AddressSanitizer now works for x86_64 with LAM_U57 - - - - --- 2024-02-08 liuhongt New
[2/2,x86] Enable -mlam=u57 by default when compiled with -fsanitize=hwaddress. [1/2] Adjust hwasan testcase for x86 target. - - - - --- 2024-01-23 liuhongt New
[1/2] Adjust hwasan testcase for x86 target. [1/2] Adjust hwasan testcase for x86 target. - - - - --- 2024-01-23 liuhongt New
Adjust testcase gcc.target/i386/part-vect-copysignhf.c. Adjust testcase gcc.target/i386/part-vect-copysignhf.c. - - - - --- 2024-01-19 liuhongt New
Fix testcase failure on many platforms which don't support vect_int_max. Fix testcase failure on many platforms which don't support vect_int_max. - - - - --- 2024-01-19 liuhongt New
Document refactoring of the option -fcf-protection=x. Document refactoring of the option -fcf-protection=x. - - - - --- 2024-01-10 liuhongt New
Update documents for fcf-protection= Update documents for fcf-protection= - - - - --- 2024-01-10 liuhongt New
Optimize A < B ? A : B to MIN_EXPR. Optimize A < B ? A : B to MIN_EXPR. - - - - --- 2024-01-09 liuhongt New
Optimize A < B ? A : B to MIN_EXPR. Optimize A < B ? A : B to MIN_EXPR. - - - - --- 2023-12-19 liuhongt New
Force broadcast constant to mem for vec_dup{v4di, v8si, v4df, v8df} when TARGET_AVX2 is not availab… Force broadcast constant to mem for vec_dup{v4di, v8si, v4df, v8df} when TARGET_AVX2 is not availab… - - - - --- 2023-12-13 liuhongt New
Adjust vectorized cost for reduction. Adjust vectorized cost for reduction. - - - - --- 2023-12-12 liuhongt New
[v3] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… [v3] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… - - - - --- 2023-12-11 liuhongt New
[ICE] Support vpcmov for V4HF/V4BF/V2HF/V2BF under TARGET_XOP. [ICE] Support vpcmov for V4HF/V4BF/V2HF/V2BF under TARGET_XOP. - - - - --- 2023-12-08 liuhongt New
Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains … Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains … - - - - --- 2023-12-08 liuhongt New
Support udot_prodv*qi with emulation sdot_prodv*hi Support udot_prodv*qi with emulation sdot_prodv*hi - - - - --- 2023-12-04 liuhongt New
Don't vectorize when vector stmts are only vec_contruct and stores Don't vectorize when vector stmts are only vec_contruct and stores - - - - --- 2023-12-04 liuhongt New
Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa… Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa… - - - - --- 2023-12-01 liuhongt New
Use vec_extact_lo instead of subreg in reduc_<code>_scal_m. Use vec_extact_lo instead of subreg in reduc_<code>_scal_m. - - - - --- 2023-11-29 liuhongt New
[x86] Support sdot_prodv*qi with emulation of sdot_prodv*hi. [x86] Support sdot_prodv*qi with emulation of sdot_prodv*hi. - - - - --- 2023-11-29 liuhongt New
Take register pressure into account for vec_construct when the components are not loaded from memor… Take register pressure into account for vec_construct when the components are not loaded from memor… - - - - --- 2023-11-28 liuhongt New
Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms - - - - --- 2023-11-22 liuhongt New
[x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode [x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode - - - - --- 2023-11-20 liuhongt New
Support cbranchm for Vector HI/QImode. Support cbranchm for Vector HI/QImode. - - - - --- 2023-11-17 liuhongt New
[2/2] Add i?86-*-* and x86_64-*-* to vect_logical_reduc [1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. - - - - --- 2023-11-17 liuhongt New
[1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. [1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. - - - - --- 2023-11-17 liuhongt New
[V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… [V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… - - - - --- 2023-11-16 liuhongt New
Fix ICE of unrecognizable insn. Fix ICE of unrecognizable insn. - - - - --- 2023-11-16 liuhongt New
Fix ICE in vectorizable_nonlinear_induction with bitfield. Fix ICE in vectorizable_nonlinear_induction with bitfield. - - - - --- 2023-11-14 liuhongt New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)). Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)). - - - - --- 2023-11-10 liuhongt New
Support vec_set/vec_extract/vec_init for V4HF/V2HF. Support vec_set/vec_extract/vec_init for V4HF/V2HF. - - - - --- 2023-11-10 liuhongt New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d. Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d. - - - - --- 2023-11-10 liuhongt New
Fix wrong code due to vec_merge + pcmp to blendvb splitter. Fix wrong code due to vec_merge + pcmp to blendvb splitter. - - - - --- 2023-11-09 liuhongt New
[V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. [V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. - - - - --- 2023-11-07 liuhongt New
Avoid generating RTL code when d->testing_p. Avoid generating RTL code when d->testing_p. - - - - --- 2023-11-06 liuhongt New
Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction. Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction. - - - - --- 2023-11-01 liuhongt New
Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. - - - - --- 2023-10-30 liuhongt New
Fix wrong code due to incorrest define_split Fix wrong code due to incorrest define_split - - - - --- 2023-10-30 liuhongt New
Improve memcmpeq for 512-bit vector with vpcmpeq + kortest. Improve memcmpeq for 512-bit vector with vpcmpeq + kortest. - - - - --- 2023-10-27 liuhongt New
[V2,2/2] Support vec_cmpmn/vcondmn for v2hf/v4hf. [V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… - - - - --- 2023-10-26 liuhongt New
[V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… [V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… - - - - --- 2023-10-26 liuhongt New
[GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_… [GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_… - - - - --- 2023-10-24 liuhongt New
Support vec_cmpmn/vcondmn for v2hf/v4hf. Support vec_cmpmn/vcondmn for v2hf/v4hf. - - - - --- 2023-10-23 liuhongt New
[x86] Remove unused mmx_pinsrw. [x86] Remove unused mmx_pinsrw. - - - - --- 2023-10-20 liuhongt New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-20 liuhongt New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-19 liuhongt New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-18 liuhongt New
Support 32/64-bit vectorization for _Float16 fma related operations. Support 32/64-bit vectorization for _Float16 fma related operations. - - - - --- 2023-10-17 liuhongt New
[2/2] Support 32/64-bit vectorization for conversion between _Float16 and integer/float. [1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. - - - - --- 2023-10-12 liuhongt New
[1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. [1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. - - - - --- 2023-10-12 liuhongt New
[x86] Refine predicate of operands[2] in divv4hf3 with register_operand. [x86] Refine predicate of operands[2] in divv4hf3 with register_operand. - - - - --- 2023-10-10 liuhongt New
[2/2] Support signbit/xorsign/copysign/abs/neg/and/xor/ior/andn for V2HF/V4HF. [1/2,x86] Support smin/smax for V2HF/V4HF - - - - --- 2023-10-08 liuhongt New
[1/2,x86] Support smin/smax for V2HF/V4HF [1/2,x86] Support smin/smax for V2HF/V4HF - - - - --- 2023-10-08 liuhongt New
Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative. Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative. - - - - --- 2023-09-11 liuhongt New
Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative. Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative. - - - - --- 2023-09-08 liuhongt New
Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes. Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes. - - - - --- 2023-09-07 liuhongt New
Generate vmovsh instead of vpblendw for specific vec_merge. Generate vmovsh instead of vpblendw for specific vec_merge. - - - - --- 2023-09-04 liuhongt New
Adjust costing of emulated vectorized gather/scatter Adjust costing of emulated vectorized gather/scatter - - - - --- 2023-08-30 liuhongt New
Refactor vector HF/BF mode iterators and patterns. Refactor vector HF/BF mode iterators and patterns. - - - - --- 2023-08-30 liuhongt New
Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. - - - - --- 2023-08-25 liuhongt New
[x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H [x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H - - - - --- 2023-08-24 liuhongt New
Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") - - - - --- 2023-08-23 liuhongt New
[x86] Testcase fix. [x86] Testcase fix. - - - - --- 2023-08-22 liuhongt New
Adjust testcase for Intel GDS. Adjust testcase for Intel GDS. - - - - --- 2023-08-21 liuhongt New
Mention Intel -march=gracemont for Alderlake-N. Mention Intel -march=gracemont for Alderlake-N. - - - - --- 2023-08-21 liuhongt New
Support -march=gracemont Support -march=gracemont - - - - --- 2023-08-18 liuhongt New
Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. - - - - --- 2023-08-14 liuhongt New
[V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… [V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… - - - - --- 2023-08-11 liuhongt New
Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. - - - - --- 2023-08-11 liuhongt New
Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… - - - - --- 2023-08-10 liuhongt New
i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] - - - - --- 2023-08-10 liuhongt New
Rename local variable subleaf_level to max_subleaf_level. Rename local variable subleaf_level to max_subleaf_level. - - - - --- 2023-08-09 liuhongt New
[V2,X86] Workaround possible CPUID bug in Sandy Bridge. [V2,X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-09 liuhongt New
[X86] Workaround possible CPUID bug in Sandy Bridge. [X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-08 liuhongt New
i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] - - - - --- 2023-08-07 liuhongt New
Fix ICE in rtl check when bootstrap. Fix ICE in rtl check when bootstrap. - - - - --- 2023-08-07 liuhongt New
Optimize vlddqu + inserti128 to vbroadcasti128 Optimize vlddqu + inserti128 to vbroadcasti128 - - - - --- 2023-08-02 liuhongt New
Support vec_fmaddsub/vec_fmsubadd for vector HFmode. Support vec_fmaddsub/vec_fmsubadd for vector HFmode. - - - - --- 2023-08-02 liuhongt New
Adjust testcase for more optimal codegen. Adjust testcase for more optimal codegen. - - - - --- 2023-08-01 liuhongt New
[x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. [x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. - - - - --- 2023-07-27 liuhongt New
Optimize vlddqu to vmovdqu for TARGET_AVX Optimize vlddqu to vmovdqu for TARGET_AVX - - - - --- 2023-07-20 liuhongt New
Fix fp16 related testcase failure for i686. Fix fp16 related testcase failure for i686. - - - - --- 2023-07-20 liuhongt New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. - - - - --- 2023-07-17 liuhongt New
Fix typo in the testcase. Fix typo in the testcase. - - - - --- 2023-07-11 liuhongt New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 liuhongt New
« 1 2 3 45 6 »