GNU Compiler Collection

Show patches with: Submitter = liuhongt | State = Action Required | Archived = No | 544 patches

« 1 2 3 4 … 5 6 »

Patch	Series	A/F/R/T	S/W/F	Date	Submitter	State
Fix fp16 related testcase failure for i686.	Fix fp16 related testcase failure for i686.	- - - -	---	2023-07-20	liuhongt	New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.	Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.	- - - -	---	2023-07-17	liuhongt	New
Fix typo in the testcase.	Fix typo in the testcase.	- - - -	---	2023-07-11	liuhongt	New
Add peephole to eliminate redundant comparison after cmpccxadd.	Add peephole to eliminate redundant comparison after cmpccxadd.	- - - -	---	2023-07-11	liuhongt	New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…	[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…	- - - -	---	2023-07-11	liuhongt	New
Add peephole to eliminate redundant comparison after cmpccxadd.	Add peephole to eliminate redundant comparison after cmpccxadd.	- - - -	---	2023-07-11	liuhongt	New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…	Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…	- - - -	---	2023-07-10	liuhongt	New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern.	[V2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-07	liuhongt	New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations.	[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-06	liuhongt	New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-06	liuhongt	New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.	Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.	- - - -	---	2023-07-06	liuhongt	New
Break false dependence for vpternlog by inserting vpxor.	Break false dependence for vpternlog by inserting vpxor.	- - - -	---	2023-07-04	liuhongt	New
[2/2] Make option mvzeroupper independent of optimization level.	[1/2] Don't issue vzeroupper for vzeroupper call_insn.	- - - -	---	2023-06-27	liuhongt	New
[1/2] Don't issue vzeroupper for vzeroupper call_insn.	[1/2] Don't issue vzeroupper for vzeroupper call_insn.	- - - -	---	2023-06-27	liuhongt	New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV.	[x86] Refine maskstore patterns with UNSPEC_MASKMOV.	- - - -	---	2023-06-27	liuhongt	New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.	Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.	- - - -	---	2023-06-26	liuhongt	New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	liuhongt	New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	liuhongt	New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	liuhongt	New
Refine maskloadmn pattern with UNSPEC_MASKLOAD.	Refine maskloadmn pattern with UNSPEC_MASKLOAD.	- - - -	---	2023-06-21	liuhongt	New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	- - - -	---	2023-06-20	liuhongt	New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns.	[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	- - - -	---	2023-06-16	liuhongt	New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	- - - -	---	2023-06-16	liuhongt	New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.	[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.	- - - -	---	2023-06-14	liuhongt	New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	- - - -	---	2023-06-06	liuhongt	New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.	[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.	- - - -	---	2023-06-06	liuhongt	New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.	Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.	- - - -	---	2023-06-06	liuhongt	New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	- - - -	---	2023-06-06	liuhongt	New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.	[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.	- - - -	---	2023-06-05	liuhongt	New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	- - - -	---	2023-06-02	liuhongt	New
i386: Add missing vector truncate patterns [PR92658].	i386: Add missing vector truncate patterns [PR92658].	- - - -	---	2023-06-02	liuhongt	New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.	Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.	- - - -	---	2023-06-01	liuhongt	New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.	Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.	- - - -	---	2023-05-26	liuhongt	New
[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes.	[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes.	- - - -	---	2023-05-26	liuhongt	New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR.	Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR.	- - - -	---	2023-05-22	liuhongt	New
Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode.	Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode.	- - - -	---	2023-05-17	liuhongt	New
[V2] Provide -fcf-protection=branch,return.	[V2] Provide -fcf-protection=branch,return.	- - - -	---	2023-05-13	liuhongt	New
Provide -fcf-protection=branch,return.	Provide -fcf-protection=branch,return.	- - - -	---	2023-05-12	liuhongt	New
x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.	x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.	- - - -	---	2023-05-10	liuhongt	New
Detect bswap + rotate for byte permutation in pass_bswap.	Detect bswap + rotate for byte permutation in pass_bswap.	- - - -	---	2023-05-09	liuhongt	New
[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.	[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.	- - - -	---	2023-05-08	liuhongt	New
[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from …	[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from …	- - - -	---	2023-05-04	liuhongt	New
[v2] Canonicalize vec_merge when mask is constant.	[v2] Canonicalize vec_merge when mask is constant.	- - - -	---	2023-05-04	liuhongt	New
[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.	[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.	- - - -	---	2023-04-26	liuhongt	New
Add testcases for ffs/ctz vectorization.	Add testcases for ffs/ctz vectorization.	- - - -	---	2023-04-23	liuhongt	New
[2/2,i386] def_or_undef __STDCPP_FLOAT16_T__ and __STDCPP_BFLOAT16_T__ for target attribute/pragmas.	[1/2,i386] Support type _Float16/__bf16 independent of SSE2.	- - - -	---	2023-04-21	liuhongt	New
[1/2,i386] Support type _Float16/__bf16 independent of SSE2.	[1/2,i386] Support type _Float16/__bf16 independent of SSE2.	- - - -	---	2023-04-21	liuhongt	New
Canonicalize vec_merge when mask is constant.	Canonicalize vec_merge when mask is constant.	- - - -	---	2023-04-20	liuhongt	New
[2/2] Adjust testcases after better RA decision.	[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.	- - - -	---	2023-04-20	liuhongt	New
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.	[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.	- - - -	---	2023-04-20	liuhongt	New
[i386] Support type _Float16/__bf16 independent of SSE2.	[i386] Support type _Float16/__bf16 independent of SSE2.	- - - -	---	2023-04-19	liuhongt	New
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla…	Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla…	- - - -	---	2023-04-04	liuhongt	New
Document signbitm2.	Document signbitm2.	- - - -	---	2023-03-31	liuhongt	New
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8.	Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8.	- - - -	---	2023-03-31	liuhongt	New
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na…	[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na…	- - - -	---	2023-03-30	liuhongt	New
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq.	Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq.	- - - -	---	2023-03-30	liuhongt	New
Generate vpblendd instead of vpblendw for V4SI under AVX2.	Generate vpblendd instead of vpblendw for V4SI under AVX2.	- - - -	---	2023-03-29	liuhongt	New
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore.	Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore.	- - - -	---	2023-03-22	liuhongt	New
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant.	[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant.	- - - -	---	2023-02-02	liuhongt	New
Change AVX512FP16 to AVX512-FP16 which is official name.	Change AVX512FP16 to AVX512-FP16 which is official name.	- - - -	---	2023-01-29	liuhongt	New
Change AVX512FP16 to AVX512-FP16 in the document.	Change AVX512FP16 to AVX512-FP16 in the document.	- - - -	---	2023-01-29	liuhongt	New
Don't add crtfastmath.o for -shared.	Don't add crtfastmath.o for -shared.	- - - -	---	2023-01-13	liuhongt	New
[V2,2/2,x86] x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.	Untitled series #332816	- - - -	---	2022-12-15	liuhongt	New
[V2,1/2] x86: Don't add crtfastmath.o for -shared		- - - -	---	2022-12-15	liuhongt	New
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ…	[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ…	- - - -	---	2022-12-14	liuhongt	New
[x86] Fix ICE due to condition mismatch between expander and define_insn.	[x86] Fix ICE due to condition mismatch between expander and define_insn.	- - - -	---	2022-12-06	liuhongt	New
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1.	[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1.	- - - -	---	2022-12-02	liuhongt	New
[x86] Fix ICE due to incorrect insn type.	[x86] Fix ICE due to incorrect insn type.	- - - -	---	2022-12-01	liuhongt	New
[1/2,V2] Implement hwasan target_hook.	[1/2,V2] Implement hwasan target_hook.	- - - -	---	2022-11-30	liuhongt	New
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode.	[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode.	- - - -	---	2022-11-28	liuhongt	New
[V3,x86] Fix incorrect _mm_cvtsbh_ss.	[V3,x86] Fix incorrect _mm_cvtsbh_ss.	- - - -	---	2022-11-25	liuhongt	New
[v2,x86] Fix incorrect _mm_cvtsbh_ss.	[v2,x86] Fix incorrect _mm_cvtsbh_ss.	- - - -	---	2022-11-24	liuhongt	New
[x86] Fix incorrect implementation for mm_cvtsbh_ss.	[x86] Fix incorrect implementation for mm_cvtsbh_ss.	- - - -	---	2022-11-23	liuhongt	New
[x86] Some tidy up for RA related hooks.	[x86] Some tidy up for RA related hooks.	- - - -	---	2022-11-21	liuhongt	New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins.	[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins.	- - - -	---	2022-11-18	liuhongt	New
[2/2] Enable hwasan for x86-64.	Support HWASAN with Intel LAM	- - - -	---	2022-11-11	liuhongt	New
[1/2] Implement hwasan target_hook.	Support HWASAN with Intel LAM	- - - -	---	2022-11-11	liuhongt	New
Fix incorrect insn type to avoid ICE in memory attr auto-detection.	Fix incorrect insn type to avoid ICE in memory attr auto-detection.	- - - -	---	2022-11-08	liuhongt	New
Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count.	Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count.	- - - -	---	2022-10-31	liuhongt	New
[V2,x86] Fix incorrect digit constraint	[V2,x86] Fix incorrect digit constraint	- - - -	---	2022-10-31	liuhongt	New
[x86] Fix incorrect digit constraint	[x86] Fix incorrect digit constraint	- - - -	---	2022-10-27	liuhongt	New
[x86] Enable V4BFmode and V2BFmode.	[x86] Enable V4BFmode and V2BFmode.	- - - -	---	2022-10-26	liuhongt	New
Canonicalize vec_perm index to make the first index come from the first vector.	Canonicalize vec_perm index to make the first index come from the first vector.	- - - -	---	2022-10-18	liuhongt	New
[x86] Add define_insn_and_split to support general version of "kxnor".	[x86] Add define_insn_and_split to support general version of "kxnor".	- - - -	---	2022-10-11	liuhongt	New
[x86] Fix unrecognizable insn of cvtss2si.	[x86] Fix unrecognizable insn of cvtss2si.	- - - -	---	2022-10-10	liuhongt	New
Check nonlinear iv in vect_can_advance_ivs_p.	Check nonlinear iv in vect_can_advance_ivs_p.	- - - -	---	2022-09-29	liuhongt	New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.	[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.	- - - -	---	2022-09-26	liuhongt	New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.	[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.	- - - -	---	2022-09-23	liuhongt	New
[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand.	[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand.	- - - -	---	2022-09-22	liuhongt	New
Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant.	Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant.	- - - -	---	2022-09-20	liuhongt	New
Fix incorrect handle in vectorizable_induction for mixed induction type.	Fix incorrect handle in vectorizable_induction for mixed induction type.	- - - -	---	2022-09-20	liuhongt	New
Support 64-bit vectorization for single-precision floating rounding operation.	Support 64-bit vectorization for single-precision floating rounding operation.	- - - -	---	2022-09-20	liuhongt	New
[x86] Adjust issue_rate for latest Intel processors.	[x86] Adjust issue_rate for latest Intel processors.	- - - -	---	2022-09-16	liuhongt	New
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg	[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg	- - - -	---	2022-09-16	liuhongt	New
Modernize ix86_builtin_vectorized_function with corresponding expanders.	Modernize ix86_builtin_vectorized_function with corresponding expanders.	- - - -	---	2022-09-16	liuhongt	New
[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction.	[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction.	- - - -	---	2022-09-14	liuhongt	New
Fix _mm512_cvt_roundps_ph to generate sae instruction.	Fix _mm512_cvt_roundps_ph to generate sae instruction.	- - - -	---	2022-09-05	liuhongt	New
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.	[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.	- - - -	---	2022-08-29	liuhongt	New
Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2	Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2	- - - -	---	2022-08-24	liuhongt	New
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.	[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.	- - - -	---	2022-08-04	liuhongt	New

« 1 2 3 4 … 5 6 »