From patchwork Wed Aug 1 17:09:27 2012 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Or Gerlitz X-Patchwork-Id: 174544 X-Patchwork-Delegate: davem@davemloft.net Return-Path: X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 3CB852C009B for ; Thu, 2 Aug 2012 03:11:01 +1000 (EST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753633Ab2HARKy (ORCPT ); Wed, 1 Aug 2012 13:10:54 -0400 Received: from eu1sys200aog115.obsmtp.com ([207.126.144.139]:60268 "HELO eu1sys200aog115.obsmtp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1753478Ab2HARJ7 (ORCPT ); Wed, 1 Aug 2012 13:09:59 -0400 Received: from mtlsws123.lab.mtl.com ([82.166.227.17]) (using TLSv1) by eu1sys200aob115.postini.com ([207.126.147.11]) with SMTP ID DSNKUBli32UoiUA+sscnIR+H8cHqhvqQyh1A@postini.com; Wed, 01 Aug 2012 17:09:58 UTC Received: from r-vnc04.lab.mtl.com (r-vnc04.lab.mtl.com [10.208.0.116]) by mtlsws123.lab.mtl.com (8.13.8/8.13.8) with ESMTP id q71H9mFe004604; Wed, 1 Aug 2012 20:09:51 +0300 From: Or Gerlitz To: davem@davemloft.net Cc: roland@kernel.org, netdev@vger.kernel.org, ali@mellanox.com, sean.hefty@intel.com, Erez Shitrit , Or Gerlitz Subject: [PATCH V2 04/12] IB/ipoib: Add support for acting as VIF Date: Wed, 1 Aug 2012 20:09:27 +0300 Message-Id: <1343840975-3252-5-git-send-email-ogerlitz@mellanox.com> X-Mailer: git-send-email 1.7.8.2 In-Reply-To: <1343840975-3252-1-git-send-email-ogerlitz@mellanox.com> References: <1343840975-3252-1-git-send-email-ogerlitz@mellanox.com> Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org From: Erez Shitrit When IPoIB interface acts as a VIF for an eIPoIB interface, it uses the skb cb storage area on the RX flow, to place information which can be of use to the upper layer device. One such usage example, is when an eIPoIB inteface needs to generate a source mac for incoming Ethernet frames. The IPoIB code checks the VIF private flag on the RX path, and according to the value of the flag prepares the skb CB data, etc. Signed-off-by: Erez Shitrit Signed-off-by: Or Gerlitz --- drivers/infiniband/ulp/ipoib/ipoib.h | 5 +++ drivers/infiniband/ulp/ipoib/ipoib_cm.c | 9 +++++ drivers/infiniband/ulp/ipoib/ipoib_ib.c | 8 ++++- drivers/infiniband/ulp/ipoib/ipoib_main.c | 21 +++++++++++ include/rdma/e_ipoib.h | 54 +++++++++++++++++++++++++++++ 5 files changed, 96 insertions(+), 1 deletions(-) create mode 100644 include/rdma/e_ipoib.h diff --git a/drivers/infiniband/ulp/ipoib/ipoib.h b/drivers/infiniband/ulp/ipoib/ipoib.h index 0df6668..3235033 100644 --- a/drivers/infiniband/ulp/ipoib/ipoib.h +++ b/drivers/infiniband/ulp/ipoib/ipoib.h @@ -52,6 +52,7 @@ #include #include #include +#include /* constants */ @@ -209,6 +210,7 @@ struct ipoib_cm_rx { unsigned long jiffies; enum ipoib_cm_state state; int recv_count; + u32 qpn; }; struct ipoib_cm_tx { @@ -703,6 +705,9 @@ extern int ipoib_recvq_size; extern struct ib_sa_client ipoib_sa_client; +void set_skb_oob_cb_data(struct sk_buff *skb, struct ib_wc *wc, + struct napi_struct *napi); + #ifdef CONFIG_INFINIBAND_IPOIB_DEBUG extern int ipoib_debug_level; diff --git a/drivers/infiniband/ulp/ipoib/ipoib_cm.c b/drivers/infiniband/ulp/ipoib/ipoib_cm.c index 6d66ab0..c561956 100644 --- a/drivers/infiniband/ulp/ipoib/ipoib_cm.c +++ b/drivers/infiniband/ulp/ipoib/ipoib_cm.c @@ -440,6 +440,7 @@ static int ipoib_cm_req_handler(struct ib_cm_id *cm_id, struct ib_cm_event *even struct net_device *dev = cm_id->context; struct ipoib_dev_priv *priv = netdev_priv(dev); struct ipoib_cm_rx *p; + struct ipoib_cm_data *data = event->private_data; unsigned psn; int ret; @@ -452,6 +453,10 @@ static int ipoib_cm_req_handler(struct ib_cm_id *cm_id, struct ib_cm_event *even cm_id->context = p; p->state = IPOIB_CM_RX_LIVE; p->jiffies = jiffies; + + /* used to keep track of base qpn in CM mode */ + p->qpn = be32_to_cpu(data->qpn); + INIT_LIST_HEAD(&p->list); p->qp = ipoib_cm_create_rx_qp(dev, p); @@ -669,6 +674,10 @@ copied: skb->dev = dev; /* XXX get correct PACKET_ type here */ skb->pkt_type = PACKET_HOST; + /* if handler is registered on top of ipoib, set skb oob data. */ + if (skb->dev->priv_flags & IFF_EIPOIB_VIF) + set_skb_oob_cb_data(skb, wc, NULL); + netif_receive_skb(skb); repost: diff --git a/drivers/infiniband/ulp/ipoib/ipoib_ib.c b/drivers/infiniband/ulp/ipoib/ipoib_ib.c index f10221f..f248e6e 100644 --- a/drivers/infiniband/ulp/ipoib/ipoib_ib.c +++ b/drivers/infiniband/ulp/ipoib/ipoib_ib.c @@ -304,7 +304,13 @@ static void ipoib_ib_handle_rx_wc(struct net_device *dev, struct ib_wc *wc) likely(wc->wc_flags & IB_WC_IP_CSUM_OK)) skb->ip_summed = CHECKSUM_UNNECESSARY; - napi_gro_receive(&priv->napi, skb); + /* if handler is registered on top of ipoib, set skb oob data */ + if (dev->priv_flags & IFF_EIPOIB_VIF) { + set_skb_oob_cb_data(skb, wc, &priv->napi); + /* the registered handler will take care of the skb */ + netif_receive_skb(skb); + } else + napi_gro_receive(&priv->napi, skb); repost: if (unlikely(ipoib_ib_post_receive(dev, wr_id))) diff --git a/drivers/infiniband/ulp/ipoib/ipoib_main.c b/drivers/infiniband/ulp/ipoib/ipoib_main.c index 637980b..47034b4 100644 --- a/drivers/infiniband/ulp/ipoib/ipoib_main.c +++ b/drivers/infiniband/ulp/ipoib/ipoib_main.c @@ -91,6 +91,24 @@ static struct ib_client ipoib_client = { .remove = ipoib_remove_one }; +void set_skb_oob_cb_data(struct sk_buff *skb, struct ib_wc *wc, + struct napi_struct *napi) +{ + struct ipoib_cm_rx *p_cm_ctx = NULL; + struct eipoib_cb_data *data = NULL; + + p_cm_ctx = wc->qp->qp_context; + data = IPOIB_HANDLER_CB(skb); + + data->rx.slid = wc->slid; + data->rx.sqpn = wc->src_qp; + data->rx.napi = napi; + + /* in CM mode, use the "base" qpn as sqpn */ + if (p_cm_ctx) + data->rx.sqpn = p_cm_ctx->qpn; +} + int ipoib_open(struct net_device *dev) { struct ipoib_dev_priv *priv = netdev_priv(dev); @@ -1253,6 +1271,9 @@ static struct net_device *ipoib_add_port(const char *format, goto event_failed; } + /* indicates pif port */ + priv->dev->priv_flags |= IFF_EIPOIB_PIF; + result = register_netdev(priv->dev); if (result) { printk(KERN_WARNING "%s: couldn't register ipoib port %d; error %d\n", diff --git a/include/rdma/e_ipoib.h b/include/rdma/e_ipoib.h new file mode 100644 index 0000000..7249334 --- /dev/null +++ b/include/rdma/e_ipoib.h @@ -0,0 +1,54 @@ +/* + * Copyright (c) 2012 Mellanox Technologies. All rights reserved + * + * This software is available to you under a choice of one of two + * licenses. You may choose to be licensed under the terms of the GNU + * General Public License (GPL) Version 2, available from the file + * COPYING in the main directory of this source tree, or the + * openfabric.org BSD license below: + * + * Redistribution and use in source and binary forms, with or + * without modification, are permitted provided that the following + * conditions are met: + * + * - Redistributions of source code must retain the above + * copyright notice, this list of conditions and the following + * disclaimer. + * + * - Redistributions in binary form must reproduce the above + * copyright notice, this list of conditions and the following + * disclaimer in the documentation and/or other materials + * provided with the distribution. + * + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, + * EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF + * MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND + * NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS + * BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN + * ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN + * CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE + * SOFTWARE. + */ + +#ifndef _LINUX_ETH_IB_IPOIB_H +#define _LINUX_ETH_IB_IPOIB_H + +#include + +struct eipoib_cb_data { + /* + * extra care taken not to collide with the usage done + * by the qdisc layer in struct skb cb data. + */ + struct qdisc_skb_cb qdisc_cb; + struct { /* must be <= 20 bytes */ + u32 sqpn; + struct napi_struct *napi; + u16 slid; + u8 data[6]; + } __packed rx; +}; + +#define IPOIB_HANDLER_CB(skb) ((struct eipoib_cb_data *)(skb)->cb) + +#endif /* _LINUX_ETH_IB_IPOIB_H */