From patchwork Fri Mar 13 21:53:47 2009 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Tantilov, Emil S" X-Patchwork-Id: 24394 X-Patchwork-Delegate: davem@davemloft.net Return-Path: X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.176.167]) by ozlabs.org (Postfix) with ESMTP id C457FDDFE1 for ; Sat, 14 Mar 2009 08:53:55 +1100 (EST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756662AbZCMVxw (ORCPT ); Fri, 13 Mar 2009 17:53:52 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755135AbZCMVxw (ORCPT ); Fri, 13 Mar 2009 17:53:52 -0400 Received: from mga09.intel.com ([134.134.136.24]:19076 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753817AbZCMVxv convert rfc822-to-8bit (ORCPT ); Fri, 13 Mar 2009 17:53:51 -0400 Received: from orsmga001.jf.intel.com ([10.7.209.18]) by orsmga102.jf.intel.com with ESMTP; 13 Mar 2009 14:45:57 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.38,360,1233561600"; d="scan'208";a="497654645" Received: from rrsmsx603.amr.corp.intel.com ([10.31.0.57]) by orsmga001.jf.intel.com with ESMTP; 13 Mar 2009 14:53:25 -0700 Received: from rrsmsx501.amr.corp.intel.com ([10.31.0.148]) by rrsmsx603.amr.corp.intel.com ([10.31.0.57]) with mapi; Fri, 13 Mar 2009 15:53:48 -0600 From: "Tantilov, Emil S" To: Herbert Xu CC: "Duyck, Alexander H" , "Brandeburg, Jesse" , "Kirsher, Jeffrey T" , "David S. Miller" , "netdev@vger.kernel.org" Date: Fri, 13 Mar 2009 15:53:47 -0600 Subject: RE: FW: Kernel panic during stress with igb in the upstream kernel Thread-Topic: FW: Kernel panic during stress with igb in the upstream kernel Thread-Index: Acmj0j4c8w96JBvuT7qIv1PbaUhwKAAU6bCg Message-ID: References: <20090313115245.GA31949@gondor.apana.org.au> In-Reply-To: <20090313115245.GA31949@gondor.apana.org.au> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US MIME-Version: 1.0 Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org Thanks Herbert, With this patch the stress test has been going for more than 4 hours now (compared to minutes until panic before). I will leave the test running over the weekend, but at least for now looks like it's fixed. Thanks, Emil -----Original Message----- From: Herbert Xu [mailto:herbert@gondor.apana.org.au] Sent: Friday, March 13, 2009 4:53 AM To: Tantilov, Emil S Cc: Duyck, Alexander H; Brandeburg, Jesse; Kirsher, Jeffrey T; David S. Miller; netdev@vger.kernel.org Subject: Re: FW: Kernel panic during stress with igb in the upstream kernel On Thu, Mar 12, 2009 at 09:31:18AM -0600, Tantilov, Emil S wrote: > > Yeah - that's probably the case. I will restart the test with LRO off to make sure. > > I was also able to reproduce the panic with the master branch, so I think we should notify netdev. I can't figure out how I missed this before ... Something must be different about this configuration. I am running all kinds of stress tests in parallel with igb/igbvf drivers and I don't see this. > > Here's the trace: > > general protection fault: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC > last sysfs file: /sys/devices/pci0000:00/0000:00:04.0/0000:0b:00.0/local_cpus > CPU 3 > Modules linked in: igb nfsd lockd exportfs sunrpc pci_slot [last unloaded: igb] > Pid: 28747, comm: netperf Not tainted 2.6.29-rc7-net-next-igb-master #3 X7DA8 > RIP: 0010:[] [] netpoll_receive_skb+0xe/0x1e Sorry, it seems that I've managed to get the net/net-next patches mixed up. Does this help? GRO: Move netpoll checks to correct location As my netpoll fix for net doesn't really work for net-next, we need this update to move the checks into the right place. As it stands we may pass freed skbs to netpoll_receive_skb. This patch also introduces a netpoll_rx_on function to avoid GRO completely if we're invoked through netpoll. This might seem paranoid but as netpoll may have an external receive hook it's better to be safe than sorry. I don't think we need this for 2.6.29 though since there's nothing immediately broken by it. This patch also moves the GRO_* return values to netdevice.h since VLAN needs them too (I tried to avoid this originally but alas this seems to be the easiest way out). This fixes a bug in VLAN where it continued to use the old return value 2 instead of the correct GRO_DROP. Signed-off-by: Herbert Xu Cheers, diff --git a/include/linux/netdevice.h b/include/linux/netdevice.h index 493b065..be3ebd7 100644 --- a/include/linux/netdevice.h +++ b/include/linux/netdevice.h @@ -330,6 +330,14 @@ enum NAPI_STATE_NPSVC, /* Netpoll - don't dequeue from poll_list */ }; +enum { + GRO_MERGED, + GRO_MERGED_FREE, + GRO_HELD, + GRO_NORMAL, + GRO_DROP, +}; + extern void __napi_schedule(struct napi_struct *n); static inline int napi_disable_pending(struct napi_struct *n) diff --git a/include/linux/netpoll.h b/include/linux/netpoll.h index e38d3c9..5f239dc 100644 --- a/include/linux/netpoll.h +++ b/include/linux/netpoll.h @@ -63,6 +63,13 @@ static inline int netpoll_rx(struct sk_buff *skb) return ret; } +static inline int netpoll_rx_on(struct sk_buff *skb) +{ + struct netpoll_info *npinfo = skb->dev->npinfo; + + return npinfo && (npinfo->rx_np || npinfo->rx_flags); +} + static inline int netpoll_receive_skb(struct sk_buff *skb) { if (!list_empty(&skb->dev->napi_list)) diff --git a/net/8021q/vlan_core.c b/net/8021q/vlan_core.c index 2d6e405..6227248 100644 --- a/net/8021q/vlan_core.c +++ b/net/8021q/vlan_core.c @@ -79,6 +79,9 @@ static int vlan_gro_common(struct napi_struct *napi, struct vlan_group *grp, { struct sk_buff *p; + if (netpoll_rx_on(skb)) + return GRO_NORMAL; + if (skb_bond_should_drop(skb)) goto drop; @@ -98,7 +101,7 @@ static int vlan_gro_common(struct napi_struct *napi, struct vlan_group *grp, return dev_gro_receive(napi, skb); drop: - return 2; + return GRO_DROP; } int vlan_gro_receive(struct napi_struct *napi, struct vlan_group *grp, @@ -106,9 +109,6 @@ int vlan_gro_receive(struct napi_struct *napi, struct vlan_group *grp, { skb_gro_reset_offset(skb); - if (netpoll_receive_skb(skb)) - return NET_RX_DROP; - return napi_skb_finish(vlan_gro_common(napi, grp, vlan_tci, skb), skb); } EXPORT_SYMBOL(vlan_gro_receive); @@ -121,9 +121,6 @@ int vlan_gro_frags(struct napi_struct *napi, struct vlan_group *grp, if (!skb) return NET_RX_DROP; - if (netpoll_receive_skb(skb)) - return NET_RX_DROP; - return napi_frags_finish(napi, skb, vlan_gro_common(napi, grp, vlan_tci, skb)); } diff --git a/net/core/dev.c b/net/core/dev.c index 033d7ca..7bd3c29 100644 --- a/net/core/dev.c +++ b/net/core/dev.c @@ -135,14 +135,6 @@ /* This should be increased if a protocol with a bigger head is added. */ #define GRO_MAX_HEAD (MAX_HEADER + 128) -enum { - GRO_MERGED, - GRO_MERGED_FREE, - GRO_HELD, - GRO_NORMAL, - GRO_DROP, -}; - /* * The list of packet types we will receive (as opposed to discard) * and the routines to invoke. @@ -2474,6 +2466,9 @@ static int __napi_gro_receive(struct napi_struct *napi, struct sk_buff *skb) { struct sk_buff *p; + if (netpoll_rx_on(skb)) + return GRO_NORMAL; + for (p = napi->gro_list; p; p = p->next) { NAPI_GRO_CB(p)->same_flow = !compare_ether_header( skb_mac_header(p), skb_gro_mac_header(skb)); @@ -2487,9 +2482,6 @@ int napi_skb_finish(int ret, struct sk_buff *skb) { int err = NET_RX_SUCCESS; - if (netpoll_receive_skb(skb)) - return NET_RX_DROP; - switch (ret) { case GRO_NORMAL: return netif_receive_skb(skb); @@ -2587,9 +2579,6 @@ int napi_frags_finish(struct napi_struct *napi, struct sk_buff *skb, int ret) { int err = NET_RX_SUCCESS; - if (netpoll_receive_skb(skb)) - return NET_RX_DROP; - switch (ret) { case GRO_NORMAL: case GRO_HELD: