Toggle navigation
Patchwork
GNU Compiler Collection
Patches
Bundles
About this project
Login
Register
Mail settings
Show patches with
: Submitter =
liuhongt
| State =
Action Required
| Archived =
No
| 544 patches
Series
Submitter
State
any
Action Required
New
Under Review
Accepted
Rejected
RFC
Not Applicable
Changes Requested
Awaiting Upstream
Superseded
Deferred
Needs Review / ACK
Handled Elsewhere
Search
Archived
No
Yes
Both
Delegate
------
Nobody
jgarzik
arnd
ymano
smfrench
jlayton
tseliot
ogasawara
amitk
awhitcroft
mst
dayangkun
jwboyer
jwboyer
colinking
colinking
azummo
dwmw2
rtg
sconklin
smb
aliguori
bradf
galak
galak
demarchi
ms
bhundven
chbs
kengyu
kadlec
pdp
regit
jabk
laforge
laforge
tonyb
sfr
alai
zecke
zecke
__damien__
luka
luka
prafulla@marvell.com
cyrus
PeterHuewe
kiho
jow
jow
ypwong
nico
dedeckeh
dedeckeh
yousong
yousong
tomcwarren
mb
mrchuck
vineetg76
computersforpeace
Noltari
Noltari
patrick_delaunay
ee07b291
ldir
ldir
stefanct
zhouhan
carldani
blp
ffainelli
ffainelli
regXboi
bbrezillon
pravin
mkp
jpettit
mkresin
mkresin
thess
thess
fbarrat
fbarrat
phil
linville
jesse
tjaalton
esben
abrodkin
abrodkin
diproiettod
tbot
stephenfin
vriera
darball1
sammj
ajd
jogo
jogo
bhelgaas
blogic
blogic
tagr
tagr
tagr
oohal
russellb
ptomsich
agraf
joestringer
davem
davem
davem
mwalle
naveen
pchotard
pepe2k
pepe2k
arj
arj
andmur01
amitay
matttbe
pabeni
istokes
aparcar
Ansuel
goliath
martineau
tytso
danielschwierzeck
hs
mariosix
dcaratti
ovsrobot
ovsrobot
aserdean
XiaoYang
khem
tpetazzoni
mkorpershoek
marex
liwang
robimarko
mmichelson
apritzel
danielhb
groug
npiggin
pareddja
atishp
netdrv
mkubecek
stintel
stintel
jkicinski
cpitchen
maximeh
dsa
jstancek
pm215
bpf
jonhunter
shettyg
lorpie01
acelan
wigyori
wigyori
apopple
dja
alexhung
lynxis
lynxis
brgl
brgl
peda
akodanev
narmstrong
981213
0andriy
chunkeey
snowpatch_ozlabs
snowpatch_ozlabs
snowpatch_ozlabs
aivanov
atishp04
shemminger
blocktrron
monstr
vigneshr
mraynal
horms
stewart
stewart
freenix
rmilecki
rmilecki
rfried
kevery
akumar
jacmet
xypron
wsa
Jaehoon
rsalvaterra
adrianschmutzler
sjg
hegdevasant
hegdevasant
prom
bmeng
jagan
ukleinek
ukleinek
ag
ehristev
metan
kabel
ivanhu
arbab
abelloni
chleroy
pablo
pablo
apconole
svanheule
legoater
legoater
legoater
rw
rw
wbx
trini
Hauke
Hauke
bjonglez
ynezz
aik
sbabic
sbabic
pevik
xback
xback
richiejp
dangole
dangole
forty
next_ghost
anuppatel
anuppatel
echaudron
acer
benh
rgrimm
segher
passgat
pratyush
jms
jms
jms
festevam
mans0n
Andes
ruscur
jmberg
linusw
linusw
ymorin
ymorin
numans
jk
jk
jk
jk
xuyang
matthias_bgg
tambarus
kubu
apalos
dceara
pbrobinson
imaximets
strlen
strlen
spectrum
cazzacarna
neocturne
aldot
TIENFONG
mpe
ktraynor
arnout
nbd
nbd
robh
anguy11
calebccff
paulus
jm
stroese
Apply
«
1
2
3
4
…
5
6
»
Patch
Series
A/F/R/T
S/W/F
Date
Submitter
Delegate
State
Fix fp16 related testcase failure for i686.
Fix fp16 related testcase failure for i686.
- - - -
-
-
-
2023-07-20
liuhongt
New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.
- - - -
-
-
-
2023-07-17
liuhongt
New
Fix typo in the testcase.
Fix typo in the testcase.
- - - -
-
-
-
2023-07-11
liuhongt
New
Add peephole to eliminate redundant comparison after cmpccxadd.
Add peephole to eliminate redundant comparison after cmpccxadd.
- - - -
-
-
-
2023-07-11
liuhongt
New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…
- - - -
-
-
-
2023-07-11
liuhongt
New
Add peephole to eliminate redundant comparison after cmpccxadd.
Add peephole to eliminate redundant comparison after cmpccxadd.
- - - -
-
-
-
2023-07-11
liuhongt
New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…
- - - -
-
-
-
2023-07-10
liuhongt
New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern.
[V2,x86] Add pre_reload splitter to detect fp min/max pattern.
- - - -
-
-
-
2023-07-07
liuhongt
New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations.
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.
- - - -
-
-
-
2023-07-06
liuhongt
New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.
- - - -
-
-
-
2023-07-06
liuhongt
New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.
- - - -
-
-
-
2023-07-06
liuhongt
New
Break false dependence for vpternlog by inserting vpxor.
Break false dependence for vpternlog by inserting vpxor.
- - - -
-
-
-
2023-07-04
liuhongt
New
[2/2] Make option mvzeroupper independent of optimization level.
[1/2] Don't issue vzeroupper for vzeroupper call_insn.
- - - -
-
-
-
2023-06-27
liuhongt
New
[1/2] Don't issue vzeroupper for vzeroupper call_insn.
[1/2] Don't issue vzeroupper for vzeroupper call_insn.
- - - -
-
-
-
2023-06-27
liuhongt
New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV.
[x86] Refine maskstore patterns with UNSPEC_MASKMOV.
- - - -
-
-
-
2023-06-27
liuhongt
New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.
- - - -
-
-
-
2023-06-26
liuhongt
New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007.
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.
- - - -
-
-
-
2023-06-26
liuhongt
New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math.
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.
- - - -
-
-
-
2023-06-26
liuhongt
New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.
- - - -
-
-
-
2023-06-26
liuhongt
New
Refine maskloadmn pattern with UNSPEC_MASKLOAD.
Refine maskloadmn pattern with UNSPEC_MASKLOAD.
- - - -
-
-
-
2023-06-21
liuhongt
New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…
- - - -
-
-
-
2023-06-20
liuhongt
New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns.
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.
- - - -
-
-
-
2023-06-16
liuhongt
New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.
- - - -
-
-
-
2023-06-16
liuhongt
New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.
- - - -
-
-
-
2023-06-14
liuhongt
New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.
- - - -
-
-
-
2023-06-06
liuhongt
New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.
- - - -
-
-
-
2023-06-06
liuhongt
New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.
- - - -
-
-
-
2023-06-06
liuhongt
New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.
- - - -
-
-
-
2023-06-06
liuhongt
New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.
- - - -
-
-
-
2023-06-05
liuhongt
New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…
- - - -
-
-
-
2023-06-02
liuhongt
New
i386: Add missing vector truncate patterns [PR92658].
i386: Add missing vector truncate patterns [PR92658].
- - - -
-
-
-
2023-06-02
liuhongt
New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.
- - - -
-
-
-
2023-06-01
liuhongt
New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.
- - - -
-
-
-
2023-05-26
liuhongt
New
[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes.
[x86] Split notl + pbraodcast + pand to pbroadcast + pandn more modes.
- - - -
-
-
-
2023-05-26
liuhongt
New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR.
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABS_EXPR.
- - - -
-
-
-
2023-05-22
liuhongt
New
Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode.
Only use NO_REGS in cost calculation when !hard_regno_mode_ok for GENERAL_REGS and mode.
- - - -
-
-
-
2023-05-17
liuhongt
New
[V2] Provide -fcf-protection=branch,return.
[V2] Provide -fcf-protection=branch,return.
- - - -
-
-
-
2023-05-13
liuhongt
New
Provide -fcf-protection=branch,return.
Provide -fcf-protection=branch,return.
- - - -
-
-
-
2023-05-12
liuhongt
New
x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.
x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.
- - - -
-
-
-
2023-05-10
liuhongt
New
Detect bswap + rotate for byte permutation in pass_bswap.
Detect bswap + rotate for byte permutation in pass_bswap.
- - - -
-
-
-
2023-05-09
liuhongt
New
[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.
[V2,vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.
- - - -
-
-
-
2023-05-08
liuhongt
New
[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from …
[powerpc] Add a peephole2 to eliminate redundant move from VSX_REGS to GENERAL_REGS when it's from …
- - - -
-
-
-
2023-05-04
liuhongt
New
[v2] Canonicalize vec_merge when mask is constant.
[v2] Canonicalize vec_merge when mask is constant.
- - - -
-
-
-
2023-05-04
liuhongt
New
[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.
[vect] Enhance NARROW FLOAT_EXPR vectorization by truncating integer to lower precision.
- - - -
-
-
-
2023-04-26
liuhongt
New
Add testcases for ffs/ctz vectorization.
Add testcases for ffs/ctz vectorization.
- - - -
-
-
-
2023-04-23
liuhongt
New
[2/2,i386] def_or_undef __STDCPP_FLOAT16_T__ and __STDCPP_BFLOAT16_T__ for target attribute/pragmas.
[1/2,i386] Support type _Float16/__bf16 independent of SSE2.
- - - -
-
-
-
2023-04-21
liuhongt
New
[1/2,i386] Support type _Float16/__bf16 independent of SSE2.
[1/2,i386] Support type _Float16/__bf16 independent of SSE2.
- - - -
-
-
-
2023-04-21
liuhongt
New
Canonicalize vec_merge when mask is constant.
Canonicalize vec_merge when mask is constant.
- - - -
-
-
-
2023-04-20
liuhongt
New
[2/2] Adjust testcases after better RA decision.
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.
- - - -
-
-
-
2023-04-20
liuhongt
New
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.
[1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.
- - - -
-
-
-
2023-04-20
liuhongt
New
[i386] Support type _Float16/__bf16 independent of SSE2.
[i386] Support type _Float16/__bf16 independent of SSE2.
- - - -
-
-
-
2023-04-19
liuhongt
New
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla…
Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg cla…
- - - -
-
-
-
2023-04-04
liuhongt
New
Document signbitm2.
Document signbitm2.
- - - -
-
-
-
2023-03-31
liuhongt
New
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8.
Adjust memory_move_cost for MASK_REGS when MODE_SIZE > 8.
- - - -
-
-
-
2023-03-31
liuhongt
New
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na…
[V2] Rename ufix_trunc/ufloat* patterns to fixuns_trunc/floatuns* to align with standard pattern na…
- - - -
-
-
-
2023-03-30
liuhongt
New
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq.
Support vector conversion for AVX512 vcvtudq2pd/vcvttps2udq/vcvttpd2udq.
- - - -
-
-
-
2023-03-30
liuhongt
New
Generate vpblendd instead of vpblendw for V4SI under AVX2.
Generate vpblendd instead of vpblendw for V4SI under AVX2.
- - - -
-
-
-
2023-03-29
liuhongt
New
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore.
Remove TARGET_GEN_MEMSET_SCRATCH_RTX since it's not used anymore.
- - - -
-
-
-
2023-03-22
liuhongt
New
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant.
[vect] Don't peel nonlinear iv(mult or shift) for epilog when vf is not constant.
- - - -
-
-
-
2023-02-02
liuhongt
New
Change AVX512FP16 to AVX512-FP16 which is official name.
Change AVX512FP16 to AVX512-FP16 which is official name.
- - - -
-
-
-
2023-01-29
liuhongt
New
Change AVX512FP16 to AVX512-FP16 in the document.
Change AVX512FP16 to AVX512-FP16 in the document.
- - - -
-
-
-
2023-01-29
liuhongt
New
Don't add crtfastmath.o for -shared.
Don't add crtfastmath.o for -shared.
- - - -
-
-
-
2023-01-13
liuhongt
New
[V2,2/2,x86] x86: Add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.
Untitled series #332816
- - - -
-
-
-
2022-12-15
liuhongt
New
[V2,1/2] x86: Don't add crtfastmath.o for -shared
- - - -
-
-
-
2022-12-15
liuhongt
New
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ…
[x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ…
- - - -
-
-
-
2022-12-14
liuhongt
New
[x86] Fix ICE due to condition mismatch between expander and define_insn.
[x86] Fix ICE due to condition mismatch between expander and define_insn.
- - - -
-
-
-
2022-12-06
liuhongt
New
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1.
[x86] Improve ix86_expand_fast_convert_bf_to_sf with new extendbfsf2_1.
- - - -
-
-
-
2022-12-02
liuhongt
New
[x86] Fix ICE due to incorrect insn type.
[x86] Fix ICE due to incorrect insn type.
- - - -
-
-
-
2022-12-01
liuhongt
New
[1/2,V2] Implement hwasan target_hook.
[1/2,V2] Implement hwasan target_hook.
- - - -
-
-
-
2022-11-30
liuhongt
New
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode.
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode.
- - - -
-
-
-
2022-11-28
liuhongt
New
[V3,x86] Fix incorrect _mm_cvtsbh_ss.
[V3,x86] Fix incorrect _mm_cvtsbh_ss.
- - - -
-
-
-
2022-11-25
liuhongt
New
[v2,x86] Fix incorrect _mm_cvtsbh_ss.
[v2,x86] Fix incorrect _mm_cvtsbh_ss.
- - - -
-
-
-
2022-11-24
liuhongt
New
[x86] Fix incorrect implementation for mm_cvtsbh_ss.
[x86] Fix incorrect implementation for mm_cvtsbh_ss.
- - - -
-
-
-
2022-11-23
liuhongt
New
[x86] Some tidy up for RA related hooks.
[x86] Some tidy up for RA related hooks.
- - - -
-
-
-
2022-11-21
liuhongt
New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins.
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins.
- - - -
-
-
-
2022-11-18
liuhongt
New
[2/2] Enable hwasan for x86-64.
Support HWASAN with Intel LAM
- - - -
-
-
-
2022-11-11
liuhongt
New
[1/2] Implement hwasan target_hook.
Support HWASAN with Intel LAM
- - - -
-
-
-
2022-11-11
liuhongt
New
Fix incorrect insn type to avoid ICE in memory attr auto-detection.
Fix incorrect insn type to avoid ICE in memory attr auto-detection.
- - - -
-
-
-
2022-11-08
liuhongt
New
Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count.
Enable more optimization for 32-bit/64-bit shrd/shld with imm shift count.
- - - -
-
-
-
2022-10-31
liuhongt
New
[V2,x86] Fix incorrect digit constraint
[V2,x86] Fix incorrect digit constraint
- - - -
-
-
-
2022-10-31
liuhongt
New
[x86] Fix incorrect digit constraint
[x86] Fix incorrect digit constraint
- - - -
-
-
-
2022-10-27
liuhongt
New
[x86] Enable V4BFmode and V2BFmode.
[x86] Enable V4BFmode and V2BFmode.
- - - -
-
-
-
2022-10-26
liuhongt
New
Canonicalize vec_perm index to make the first index come from the first vector.
Canonicalize vec_perm index to make the first index come from the first vector.
- - - -
-
-
-
2022-10-18
liuhongt
New
[x86] Add define_insn_and_split to support general version of "kxnor".
[x86] Add define_insn_and_split to support general version of "kxnor".
- - - -
-
-
-
2022-10-11
liuhongt
New
[x86] Fix unrecognizable insn of cvtss2si.
[x86] Fix unrecognizable insn of cvtss2si.
- - - -
-
-
-
2022-10-10
liuhongt
New
Check nonlinear iv in vect_can_advance_ivs_p.
Check nonlinear iv in vect_can_advance_ivs_p.
- - - -
-
-
-
2022-09-29
liuhongt
New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.
- - - -
-
-
-
2022-09-26
liuhongt
New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1.
- - - -
-
-
-
2022-09-23
liuhongt
New
[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand.
[x86] Fix typo in floorv2sf2, should be register_operand for op1, not vector_operand.
- - - -
-
-
-
2022-09-22
liuhongt
New
Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant.
Don't check can_vec_perm_const_p for nonlinear iv_init when it's constant.
- - - -
-
-
-
2022-09-20
liuhongt
New
Fix incorrect handle in vectorizable_induction for mixed induction type.
Fix incorrect handle in vectorizable_induction for mixed induction type.
- - - -
-
-
-
2022-09-20
liuhongt
New
Support 64-bit vectorization for single-precision floating rounding operation.
Support 64-bit vectorization for single-precision floating rounding operation.
- - - -
-
-
-
2022-09-20
liuhongt
New
[x86] Adjust issue_rate for latest Intel processors.
[x86] Adjust issue_rate for latest Intel processors.
- - - -
-
-
-
2022-09-16
liuhongt
New
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg
- - - -
-
-
-
2022-09-16
liuhongt
New
Modernize ix86_builtin_vectorized_function with corresponding expanders.
Modernize ix86_builtin_vectorized_function with corresponding expanders.
- - - -
-
-
-
2022-09-16
liuhongt
New
[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction.
[ICE] Check another epilog variable peeling case in vectorizable_nonlinear_induction.
- - - -
-
-
-
2022-09-14
liuhongt
New
Fix _mm512_cvt_roundps_ph to generate sae instruction.
Fix _mm512_cvt_roundps_ph to generate sae instruction.
- - - -
-
-
-
2022-09-05
liuhongt
New
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.
- - - -
-
-
-
2022-08-29
liuhongt
New
Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2
Don't gimple fold ymm-version vblendvpd/vblendvps/vpblendvb w/o TARGET_AVX2
- - - -
-
-
-
2022-08-24
liuhongt
New
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant.
- - - -
-
-
-
2022-08-04
liuhongt
New
«
1
2
3
4
…
5
6
»