Message ID | CAAM7YAkxP06_=y1wg4P1JhPDWhZgRM6+wbFQRG5PFkAq8vgsTw@mail.gmail.com |
---|---|
State | RFC, archived |
Delegated to: | David Miller |
Headers | show |
Le vendredi 21 octobre 2011 à 16:28 +0800, Yan, Zheng a écrit : > On Thu, Oct 20, 2011 at 10:22 AM, Kim Phillips > <kim.phillips@freescale.com> wrote: > > Hi, > > > > I'm trying to debug an IPSec forwarding performance slowdown on a > > p2020 dual-core powerpc linux box using s/w crypto (no crypto h/w > > offload enabled) between vanilla kernel versions 2.6.35 and 3.0. > > Using a h/w packet generator set to 64-byte packets, I get the > > following results: > > > > v2.6.35: 48.5kpps > > v3.0.0: 0.2kpps > > v3.0.7: 0.2kpps > > v3.1.0-rc9-01707-gf7ba35d (a recent net-next): 13.6kpps > > > > I was able to bisect the problem down to the following commit: > > > > 7e1dc7b6f709dfc1a9ab4b320dbe723f45992693 is the first bad commit > > commit 7e1dc7b6f709dfc1a9ab4b320dbe723f45992693 > > Author: David S. Miller <davem@davemloft.net> > > Date: Sat Mar 12 02:42:11 2011 -0500 > > > > net: Use flowi4 and flowi6 in xfrm layer. > > > > Signed-off-by: David S. Miller <davem@davemloft.net> > > > > And, indeed, going back one commit (i.e., v2.6.38-rc8-1468-g2032656 > > and manually applying commit 7313714: "xfrm: fix > > __xfrm_route_forward ()"), brings performance back to ~50kpps from > > 0.2kpps. > > > > Tracing shows that the commit breaks the route cache [1], and I > > understand there is major surgery going on in the area [2], so I > > suppose my question is twofold: > > > > (a) was such a large performance drop to be expected for v3.0? > > > > (b) any ideas how to fix? I don't know much about routing > > internals, but in ip_route_input_common(), if I remove the input > > interface comparison (rth->rt_route_iif ^ iif), I get some > > performance back, but the system becomes unstable (it's booted over > > nfs). > > > > Thanks, > > > > Kim > > > > Looks like xfrm4_fill_dst() reset rt->rt_route_iif to 0, it makes the > comparison (rth->rt_route_iif ^ iif) in > ip_route_input_common() return false. > > Please try patch below. It improves the performance of 3.1-rc10 > kernel. (I'm not sure the patch is harmless) > > --- > diff --git a/net/ipv4/xfrm4_policy.c b/net/ipv4/xfrm4_policy.c > index fc5368a..88a0972 100644 > --- a/net/ipv4/xfrm4_policy.c > +++ b/net/ipv4/xfrm4_policy.c > @@ -82,7 +82,6 @@ static int xfrm4_fill_dst(struct xfrm_dst *xdst, > struct net_device *dev, > rt->rt_key_dst = fl4->daddr; > rt->rt_key_src = fl4->saddr; > rt->rt_key_tos = fl4->flowi4_tos; > - rt->rt_route_iif = fl4->flowi4_iif; > rt->rt_iif = fl4->flowi4_iif; > rt->rt_oif = fl4->flowi4_oif; > rt->rt_mark = fl4->flowi4_mark; > -- Hmm, looks like 1b86a58f9d7ce4fe237 (ipv4: Fix "Set rt->rt_iif more sanely on output routes.") assumed xfrm4_fill_dst() was used for input routes. -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 10/21/2011 04:49 PM, Eric Dumazet wrote: > Le vendredi 21 octobre 2011 à 16:28 +0800, Yan, Zheng a écrit : >> On Thu, Oct 20, 2011 at 10:22 AM, Kim Phillips >> <kim.phillips@freescale.com> wrote: >>> Hi, >>> >>> I'm trying to debug an IPSec forwarding performance slowdown on a >>> p2020 dual-core powerpc linux box using s/w crypto (no crypto h/w >>> offload enabled) between vanilla kernel versions 2.6.35 and 3.0. >>> Using a h/w packet generator set to 64-byte packets, I get the >>> following results: >>> >>> v2.6.35: 48.5kpps >>> v3.0.0: 0.2kpps >>> v3.0.7: 0.2kpps >>> v3.1.0-rc9-01707-gf7ba35d (a recent net-next): 13.6kpps >>> >>> I was able to bisect the problem down to the following commit: >>> >>> 7e1dc7b6f709dfc1a9ab4b320dbe723f45992693 is the first bad commit >>> commit 7e1dc7b6f709dfc1a9ab4b320dbe723f45992693 >>> Author: David S. Miller <davem@davemloft.net> >>> Date: Sat Mar 12 02:42:11 2011 -0500 >>> >>> net: Use flowi4 and flowi6 in xfrm layer. >>> >>> Signed-off-by: David S. Miller <davem@davemloft.net> >>> >>> And, indeed, going back one commit (i.e., v2.6.38-rc8-1468-g2032656 >>> and manually applying commit 7313714: "xfrm: fix >>> __xfrm_route_forward ()"), brings performance back to ~50kpps from >>> 0.2kpps. >>> >>> Tracing shows that the commit breaks the route cache [1], and I >>> understand there is major surgery going on in the area [2], so I >>> suppose my question is twofold: >>> >>> (a) was such a large performance drop to be expected for v3.0? >>> >>> (b) any ideas how to fix? I don't know much about routing >>> internals, but in ip_route_input_common(), if I remove the input >>> interface comparison (rth->rt_route_iif ^ iif), I get some >>> performance back, but the system becomes unstable (it's booted over >>> nfs). >>> >>> Thanks, >>> >>> Kim >>> >> >> Looks like xfrm4_fill_dst() reset rt->rt_route_iif to 0, it makes the >> comparison (rth->rt_route_iif ^ iif) in >> ip_route_input_common() return false. >> >> Please try patch below. It improves the performance of 3.1-rc10 >> kernel. (I'm not sure the patch is harmless) >> >> --- >> diff --git a/net/ipv4/xfrm4_policy.c b/net/ipv4/xfrm4_policy.c >> index fc5368a..88a0972 100644 >> --- a/net/ipv4/xfrm4_policy.c >> +++ b/net/ipv4/xfrm4_policy.c >> @@ -82,7 +82,6 @@ static int xfrm4_fill_dst(struct xfrm_dst *xdst, >> struct net_device *dev, >> rt->rt_key_dst = fl4->daddr; >> rt->rt_key_src = fl4->saddr; >> rt->rt_key_tos = fl4->flowi4_tos; >> - rt->rt_route_iif = fl4->flowi4_iif; >> rt->rt_iif = fl4->flowi4_iif; >> rt->rt_oif = fl4->flowi4_oif; >> rt->rt_mark = fl4->flowi4_mark; >> -- > > Hmm, looks like 1b86a58f9d7ce4fe237 (ipv4: Fix "Set rt->rt_iif more > sanely on output routes.") assumed xfrm4_fill_dst() was used for input > routes. > xfrm4_fill_dst() is used for input routes. The problem is fl4->flowi4_iif is zero. I don't know why xfrm_decode_session() does not set fl4->flowi4_iif. Regards Yan, Zheng -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Fri, 21 Oct 2011 16:28:30 +0800 "Yan, Zheng" <zheng.z.yan@linux.intel.com> wrote: > On Thu, Oct 20, 2011 at 10:22 AM, Kim Phillips > <kim.phillips@freescale.com> wrote: > > (b) any ideas how to fix? I don't know much about routing > > internals, but in ip_route_input_common(), if I remove the input > > interface comparison (rth->rt_route_iif ^ iif), I get some > > performance back, but the system becomes unstable (it's booted over > > nfs). > > Looks like xfrm4_fill_dst() reset rt->rt_route_iif to 0, it makes the > comparison (rth->rt_route_iif ^ iif) in > ip_route_input_common() return false. > > Please try patch below. It improves the performance of 3.1-rc10 > kernel. yes, thanks, ~50kpps performance is restored when applying this diff to current net-next. > (I'm not sure the patch is harmless) the system appears to be more stable, but this is still concerning. Kim -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/net/ipv4/xfrm4_policy.c b/net/ipv4/xfrm4_policy.c index fc5368a..88a0972 100644 --- a/net/ipv4/xfrm4_policy.c +++ b/net/ipv4/xfrm4_policy.c @@ -82,7 +82,6 @@ static int xfrm4_fill_dst(struct xfrm_dst *xdst, struct net_device *dev, rt->rt_key_dst = fl4->daddr; rt->rt_key_src = fl4->saddr; rt->rt_key_tos = fl4->flowi4_tos; - rt->rt_route_iif = fl4->flowi4_iif; rt->rt_iif = fl4->flowi4_iif; rt->rt_oif = fl4->flowi4_oif;