Message ID | 20100513073117.3528.33132.stgit@jf-dev2-dcblab |
---|---|
State | Superseded, archived |
Delegated to: | David Miller |
Headers | show |
Fastabend, John R wrote: > Currently, the accelerated receive path for VLAN's will > drop packets if the real device is an inactive slave and > is not one of the special pkts tested for in > skb_bond_should_drop(). This behavior is different then > the non-accelerated path and for pkts over a bonded vlan. > > For example, > > vlanx -> bond0 -> ethx > > will be dropped in the vlan path and not delivered to any > packet handlers at all. However, > > bond0 -> vlanx -> ethx > > and > > bond0 -> ethx > > will be delivered to handlers that match the exact dev, > because the VLAN path checks the real_dev which is not a > slave and netif_recv_skb() doesn't drop frames but only > delivers them to exact matches. > > This patch adds a sk_buff flag which is used for tagging > skbs that would previously been dropped and allows the > skb to continue to skb_netif_recv(). Here we add > logic to check for the bond_should_drop flag and if it > is set only deliver to handlers that match exactly. This > makes both paths above consistent and gives pkt handlers > a way to identify skbs that come from inactive slaves. > Without this patch in some configurations skbs will be > delivered to handlers with exact matches and in others > be dropped out right in the vlan path. > > I have tested the following 4 configurations in failover modes > and load balancing modes. > > # bond0 -> ethx > > # vlanx -> bond0 -> ethx > > # bond0 -> vlanx -> ethx > > # bond0 -> ethx > | > vlanx -> -- > > Signed-off-by: John Fastabend <john.r.fastabend@intel.com> > --- > Jay, I know you were looking over this series at one point. Did you have any comments? Sorry for the ping just wanted to keep this on my radar. Thanks, John -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
From: John Fastabend <john.r.fastabend@intel.com> Date: Wed, 26 May 2010 21:52:58 -0700 > I know you were looking over this series at one point. Did you have > any comments? Sorry for the ping just wanted to keep this on my > radar. I'll hold this last one until Jay has a chance to comment on it. -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
David Miller <davem@davemloft.net> wrote: >From: John Fastabend <john.r.fastabend@intel.com> >Date: Wed, 26 May 2010 21:52:58 -0700 > >> I know you were looking over this series at one point. Did you have >> any comments? Sorry for the ping just wanted to keep this on my >> radar. > >I'll hold this last one until Jay has a chance to comment on it. I've looked at it, and was initially hoping to combine this with Andy/Neil's vaguely similar changes, but I don't see a reasonable way to do that. I think the functionality is reasonable, i.e., adding a facility to implement "direct bind" delivery of packets on inactive bonding slaves (where direct bind means that the struct packet_type has a non-NULL dev). I have a couple of concerns about the patch itself: >From: John Fastabend <john.r.fastabend@intel.com> >Subject: [PATCH 3/3] net: deliver skbs on inactive slaves to exact matches >To: andy@greyhouse.net, fubar@us.ibm.com, nhorman@tuxdriver.com, > bonding-devel@lists.sourceforge.net, netdev@vger.kernel.org >Cc: john.r.fastabend@intel.com >Date: Thu, 13 May 2010 00:31:17 -0700 > >Currently, the accelerated receive path for VLAN's will >drop packets if the real device is an inactive slave and >is not one of the special pkts tested for in >skb_bond_should_drop(). This behavior is different then >the non-accelerated path and for pkts over a bonded vlan. > >For example, > >vlanx -> bond0 -> ethx > >will be dropped in the vlan path and not delivered to any >packet handlers at all. However, > >bond0 -> vlanx -> ethx > >and > >bond0 -> ethx > >will be delivered to handlers that match the exact dev, >because the VLAN path checks the real_dev which is not a >slave and netif_recv_skb() doesn't drop frames but only >delivers them to exact matches. > >This patch adds a sk_buff flag which is used for tagging >skbs that would previously been dropped and allows the >skb to continue to skb_netif_recv(). Here we add >logic to check for the bond_should_drop flag and if it >is set only deliver to handlers that match exactly. This >makes both paths above consistent and gives pkt handlers >a way to identify skbs that come from inactive slaves. >Without this patch in some configurations skbs will be >delivered to handlers with exact matches and in others >be dropped out right in the vlan path. > >I have tested the following 4 configurations in failover modes >and load balancing modes. > ># bond0 -> ethx > ># vlanx -> bond0 -> ethx > ># bond0 -> vlanx -> ethx > ># bond0 -> ethx > | > vlanx -> -- > >Signed-off-by: John Fastabend <john.r.fastabend@intel.com> >--- > > include/linux/skbuff.h | 8 +++++++- > net/8021q/vlan_core.c | 8 ++++++-- > net/core/dev.c | 23 +++++++++++++++++++---- > 3 files changed, 32 insertions(+), 7 deletions(-) > >diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h >index c9525bc..5ba4fd5 100644 >--- a/include/linux/skbuff.h >+++ b/include/linux/skbuff.h >@@ -379,8 +379,14 @@ struct sk_buff { > > kmemcheck_bitfield_begin(flags2); > __u16 queue_mapping:16; >-#ifdef CONFIG_IPV6_NDISC_NODETYPE >+#if defined(CONFIG_IPV6_NDISC_NODETYPE) && \ >+ (defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE)) >+ __u8 ndisc_nodetype:2, >+ bond_should_drop:1; >+#elif defined(CONFIG_IPV6_NDISC_NODETYPE) > __u8 ndisc_nodetype:2; >+#elif defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >+ __u8 bond_should_drop:1; > #endif First, this looks like too much #ifdef soup here. None of the bonding-specific code in the packet receive processing path has historically been #ifdefed out; there are more examples further down in the patch. Second, should the field be named something generic, e.g., "deliver_no_wcard" to indicate its function? "bond_should_drop" doesn't really describe what the field is used for (which is to specify that the packet should be delivered only to non-wildcard packet handlers). With this change, packets are never dropped outright as the result of what the bonding "should drop" logic says. I'm open to alternatives to using a field in the sk_buff; John's original submission added a PACKET_DROP (to supplant PACKET_LOCAL, PACKET_BROADCAST, etc), but that seemed like a loss of information to me. I haven't thought of a way to efficiently accomplish John's goal without putting state into the skb somewhere. -J > kmemcheck_bitfield_end(flags2); > >diff --git a/net/8021q/vlan_core.c b/net/8021q/vlan_core.c >index c584a0a..57ac2d3 100644 >--- a/net/8021q/vlan_core.c >+++ b/net/8021q/vlan_core.c >@@ -11,8 +11,10 @@ int __vlan_hwaccel_rx(struct sk_buff *skb, struct vlan_group *grp, > if (netpoll_rx(skb)) > return NET_RX_DROP; > >+#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) > if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) >- goto drop; >+ skb->bond_should_drop = 1; >+#endif > skb->skb_iif = skb->dev->ifindex; > __vlan_hwaccel_put_tag(skb, vlan_tci); >@@ -83,8 +85,10 @@ vlan_gro_common(struct napi_struct *napi, struct vlan_group *grp, > { > struct sk_buff *p; > >+#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) > if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) >- goto drop; >+ skb->bond_should_drop = 1; >+#endif > > skb->skb_iif = skb->dev->ifindex; > __vlan_hwaccel_put_tag(skb, vlan_tci); >diff --git a/net/core/dev.c b/net/core/dev.c >index 3dc691d..92fdff4 100644 >--- a/net/core/dev.c >+++ b/net/core/dev.c >@@ -2782,11 +2782,13 @@ static int __netif_receive_skb(struct sk_buff *skb) > { > struct packet_type *ptype, *pt_prev; > struct net_device *orig_dev; >- struct net_device *master; > struct net_device *null_or_orig; > struct net_device *orig_or_bond; > int ret = NET_RX_DROP; > __be16 type; >+#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >+ struct net_device *master; >+#endif > > if (!skb->tstamp.tv64) > net_timestamp(skb); >@@ -2801,15 +2803,28 @@ static int __netif_receive_skb(struct sk_buff *skb) > if (!skb->skb_iif) > skb->skb_iif = skb->dev->ifindex; > >+ /* >+ * bonding note: skbs received on inactive slaves should only >+ * be delivered to pkt handlers that are exact matches. Also >+ * the bond_should_drop flag will be set. If packet handlers >+ * are sensitive to duplicate packets these skbs will need to >+ * be dropped at the handler. The vlan accel path may have >+ * already set the bond_should_drop flag. >+ */ > null_or_orig = NULL; > orig_dev = skb->dev; >+#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) > master = ACCESS_ONCE(orig_dev->master); >- if (master) { >- if (skb_bond_should_drop(skb, master)) >+ if (skb->bond_should_drop) >+ null_or_orig = orig_dev; >+ else if (master) { >+ if (skb_bond_should_drop(skb, master)) { >+ skb->bond_should_drop = 1; > null_or_orig = orig_dev; /* deliver only exact match */ >- else >+ } else > skb->dev = master; > } >+#endif > > __get_cpu_var(softnet_data).processed++; > > --- -Jay Vosburgh, IBM Linux Technology Center, fubar@us.ibm.com -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Jay Vosburgh wrote: > David Miller <davem@davemloft.net> wrote: > >> From: John Fastabend <john.r.fastabend@intel.com> >> Date: Wed, 26 May 2010 21:52:58 -0700 >> >>> I know you were looking over this series at one point. Did you have >>> any comments? Sorry for the ping just wanted to keep this on my >>> radar. >> I'll hold this last one until Jay has a chance to comment on it. > > I've looked at it, and was initially hoping to combine this with > Andy/Neil's vaguely similar changes, but I don't see a reasonable way to > do that. > > I think the functionality is reasonable, i.e., adding a facility > to implement "direct bind" delivery of packets on inactive bonding > slaves (where direct bind means that the struct packet_type has a > non-NULL dev). > > I have a couple of concerns about the patch itself: > >> From: John Fastabend <john.r.fastabend@intel.com> >> Subject: [PATCH 3/3] net: deliver skbs on inactive slaves to exact matches >> To: andy@greyhouse.net, fubar@us.ibm.com, nhorman@tuxdriver.com, >> bonding-devel@lists.sourceforge.net, netdev@vger.kernel.org >> Cc: john.r.fastabend@intel.com >> Date: Thu, 13 May 2010 00:31:17 -0700 >> >> Currently, the accelerated receive path for VLAN's will >> drop packets if the real device is an inactive slave and >> is not one of the special pkts tested for in >> skb_bond_should_drop(). This behavior is different then >> the non-accelerated path and for pkts over a bonded vlan. >> >> For example, >> >> vlanx -> bond0 -> ethx >> >> will be dropped in the vlan path and not delivered to any >> packet handlers at all. However, >> >> bond0 -> vlanx -> ethx >> >> and >> >> bond0 -> ethx >> >> will be delivered to handlers that match the exact dev, >> because the VLAN path checks the real_dev which is not a >> slave and netif_recv_skb() doesn't drop frames but only >> delivers them to exact matches. >> >> This patch adds a sk_buff flag which is used for tagging >> skbs that would previously been dropped and allows the >> skb to continue to skb_netif_recv(). Here we add >> logic to check for the bond_should_drop flag and if it >> is set only deliver to handlers that match exactly. This >> makes both paths above consistent and gives pkt handlers >> a way to identify skbs that come from inactive slaves. >> Without this patch in some configurations skbs will be >> delivered to handlers with exact matches and in others >> be dropped out right in the vlan path. >> >> I have tested the following 4 configurations in failover modes >> and load balancing modes. >> >> # bond0 -> ethx >> >> # vlanx -> bond0 -> ethx >> >> # bond0 -> vlanx -> ethx >> >> # bond0 -> ethx >> | >> vlanx -> -- >> >> Signed-off-by: John Fastabend <john.r.fastabend@intel.com> >> --- >> >> include/linux/skbuff.h | 8 +++++++- >> net/8021q/vlan_core.c | 8 ++++++-- >> net/core/dev.c | 23 +++++++++++++++++++---- >> 3 files changed, 32 insertions(+), 7 deletions(-) >> >> diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h >> index c9525bc..5ba4fd5 100644 >> --- a/include/linux/skbuff.h >> +++ b/include/linux/skbuff.h >> @@ -379,8 +379,14 @@ struct sk_buff { >> >> kmemcheck_bitfield_begin(flags2); >> __u16 queue_mapping:16; >> -#ifdef CONFIG_IPV6_NDISC_NODETYPE >> +#if defined(CONFIG_IPV6_NDISC_NODETYPE) && \ >> + (defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE)) >> + __u8 ndisc_nodetype:2, >> + bond_should_drop:1; >> +#elif defined(CONFIG_IPV6_NDISC_NODETYPE) >> __u8 ndisc_nodetype:2; >> +#elif defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >> + __u8 bond_should_drop:1; >> #endif > > First, this looks like too much #ifdef soup here. None of the > bonding-specific code in the packet receive processing path has > historically been #ifdefed out; there are more examples further down in > the patch. > OK, I will remove the #ifdefs. > Second, should the field be named something generic, e.g., > "deliver_no_wcard" to indicate its function? "bond_should_drop" doesn't > really describe what the field is used for (which is to specify that the > packet should be delivered only to non-wildcard packet handlers). With > this change, packets are never dropped outright as the result of what > the bonding "should drop" logic says. > Agreed "deliver_no_wcard" is better. Updated patch coming soon. Thanks, John > I'm open to alternatives to using a field in the sk_buff; John's > original submission added a PACKET_DROP (to supplant PACKET_LOCAL, > PACKET_BROADCAST, etc), but that seemed like a loss of information to > me. I haven't thought of a way to efficiently accomplish John's goal > without putting state into the skb somewhere. > > -J > >> kmemcheck_bitfield_end(flags2); >> >> diff --git a/net/8021q/vlan_core.c b/net/8021q/vlan_core.c >> index c584a0a..57ac2d3 100644 >> --- a/net/8021q/vlan_core.c >> +++ b/net/8021q/vlan_core.c >> @@ -11,8 +11,10 @@ int __vlan_hwaccel_rx(struct sk_buff *skb, struct vlan_group *grp, >> if (netpoll_rx(skb)) >> return NET_RX_DROP; >> >> +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >> if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) >> - goto drop; >> + skb->bond_should_drop = 1; >> +#endif >> skb->skb_iif = skb->dev->ifindex; >> __vlan_hwaccel_put_tag(skb, vlan_tci); >> @@ -83,8 +85,10 @@ vlan_gro_common(struct napi_struct *napi, struct vlan_group *grp, >> { >> struct sk_buff *p; >> >> +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >> if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) >> - goto drop; >> + skb->bond_should_drop = 1; >> +#endif >> >> skb->skb_iif = skb->dev->ifindex; >> __vlan_hwaccel_put_tag(skb, vlan_tci); >> diff --git a/net/core/dev.c b/net/core/dev.c >> index 3dc691d..92fdff4 100644 >> --- a/net/core/dev.c >> +++ b/net/core/dev.c >> @@ -2782,11 +2782,13 @@ static int __netif_receive_skb(struct sk_buff *skb) >> { >> struct packet_type *ptype, *pt_prev; >> struct net_device *orig_dev; >> - struct net_device *master; >> struct net_device *null_or_orig; >> struct net_device *orig_or_bond; >> int ret = NET_RX_DROP; >> __be16 type; >> +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >> + struct net_device *master; >> +#endif >> >> if (!skb->tstamp.tv64) >> net_timestamp(skb); >> @@ -2801,15 +2803,28 @@ static int __netif_receive_skb(struct sk_buff *skb) >> if (!skb->skb_iif) >> skb->skb_iif = skb->dev->ifindex; >> >> + /* >> + * bonding note: skbs received on inactive slaves should only >> + * be delivered to pkt handlers that are exact matches. Also >> + * the bond_should_drop flag will be set. If packet handlers >> + * are sensitive to duplicate packets these skbs will need to >> + * be dropped at the handler. The vlan accel path may have >> + * already set the bond_should_drop flag. >> + */ >> null_or_orig = NULL; >> orig_dev = skb->dev; >> +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) >> master = ACCESS_ONCE(orig_dev->master); >> - if (master) { >> - if (skb_bond_should_drop(skb, master)) >> + if (skb->bond_should_drop) >> + null_or_orig = orig_dev; >> + else if (master) { >> + if (skb_bond_should_drop(skb, master)) { >> + skb->bond_should_drop = 1; >> null_or_orig = orig_dev; /* deliver only exact match */ >> - else >> + } else >> skb->dev = master; >> } >> +#endif >> >> __get_cpu_var(softnet_data).processed++; >> >> > > --- > -Jay Vosburgh, IBM Linux Technology Center, fubar@us.ibm.com -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h index c9525bc..5ba4fd5 100644 --- a/include/linux/skbuff.h +++ b/include/linux/skbuff.h @@ -379,8 +379,14 @@ struct sk_buff { kmemcheck_bitfield_begin(flags2); __u16 queue_mapping:16; -#ifdef CONFIG_IPV6_NDISC_NODETYPE +#if defined(CONFIG_IPV6_NDISC_NODETYPE) && \ + (defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE)) + __u8 ndisc_nodetype:2, + bond_should_drop:1; +#elif defined(CONFIG_IPV6_NDISC_NODETYPE) __u8 ndisc_nodetype:2; +#elif defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) + __u8 bond_should_drop:1; #endif kmemcheck_bitfield_end(flags2); diff --git a/net/8021q/vlan_core.c b/net/8021q/vlan_core.c index c584a0a..57ac2d3 100644 --- a/net/8021q/vlan_core.c +++ b/net/8021q/vlan_core.c @@ -11,8 +11,10 @@ int __vlan_hwaccel_rx(struct sk_buff *skb, struct vlan_group *grp, if (netpoll_rx(skb)) return NET_RX_DROP; +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) - goto drop; + skb->bond_should_drop = 1; +#endif skb->skb_iif = skb->dev->ifindex; __vlan_hwaccel_put_tag(skb, vlan_tci); @@ -83,8 +85,10 @@ vlan_gro_common(struct napi_struct *napi, struct vlan_group *grp, { struct sk_buff *p; +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) if (skb_bond_should_drop(skb, ACCESS_ONCE(skb->dev->master))) - goto drop; + skb->bond_should_drop = 1; +#endif skb->skb_iif = skb->dev->ifindex; __vlan_hwaccel_put_tag(skb, vlan_tci); diff --git a/net/core/dev.c b/net/core/dev.c index 3dc691d..92fdff4 100644 --- a/net/core/dev.c +++ b/net/core/dev.c @@ -2782,11 +2782,13 @@ static int __netif_receive_skb(struct sk_buff *skb) { struct packet_type *ptype, *pt_prev; struct net_device *orig_dev; - struct net_device *master; struct net_device *null_or_orig; struct net_device *orig_or_bond; int ret = NET_RX_DROP; __be16 type; +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) + struct net_device *master; +#endif if (!skb->tstamp.tv64) net_timestamp(skb); @@ -2801,15 +2803,28 @@ static int __netif_receive_skb(struct sk_buff *skb) if (!skb->skb_iif) skb->skb_iif = skb->dev->ifindex; + /* + * bonding note: skbs received on inactive slaves should only + * be delivered to pkt handlers that are exact matches. Also + * the bond_should_drop flag will be set. If packet handlers + * are sensitive to duplicate packets these skbs will need to + * be dropped at the handler. The vlan accel path may have + * already set the bond_should_drop flag. + */ null_or_orig = NULL; orig_dev = skb->dev; +#if defined(CONFIG_BONDING) || defined(CONFIG_BONDING_MODULE) master = ACCESS_ONCE(orig_dev->master); - if (master) { - if (skb_bond_should_drop(skb, master)) + if (skb->bond_should_drop) + null_or_orig = orig_dev; + else if (master) { + if (skb_bond_should_drop(skb, master)) { + skb->bond_should_drop = 1; null_or_orig = orig_dev; /* deliver only exact match */ - else + } else skb->dev = master; } +#endif __get_cpu_var(softnet_data).processed++;
Currently, the accelerated receive path for VLAN's will drop packets if the real device is an inactive slave and is not one of the special pkts tested for in skb_bond_should_drop(). This behavior is different then the non-accelerated path and for pkts over a bonded vlan. For example, vlanx -> bond0 -> ethx will be dropped in the vlan path and not delivered to any packet handlers at all. However, bond0 -> vlanx -> ethx and bond0 -> ethx will be delivered to handlers that match the exact dev, because the VLAN path checks the real_dev which is not a slave and netif_recv_skb() doesn't drop frames but only delivers them to exact matches. This patch adds a sk_buff flag which is used for tagging skbs that would previously been dropped and allows the skb to continue to skb_netif_recv(). Here we add logic to check for the bond_should_drop flag and if it is set only deliver to handlers that match exactly. This makes both paths above consistent and gives pkt handlers a way to identify skbs that come from inactive slaves. Without this patch in some configurations skbs will be delivered to handlers with exact matches and in others be dropped out right in the vlan path. I have tested the following 4 configurations in failover modes and load balancing modes. # bond0 -> ethx # vlanx -> bond0 -> ethx # bond0 -> vlanx -> ethx # bond0 -> ethx | vlanx -> -- Signed-off-by: John Fastabend <john.r.fastabend@intel.com> --- include/linux/skbuff.h | 8 +++++++- net/8021q/vlan_core.c | 8 ++++++-- net/core/dev.c | 23 +++++++++++++++++++---- 3 files changed, 32 insertions(+), 7 deletions(-) -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html