From patchwork Mon Jan 16 17:08:32 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Marcin Wojtas X-Patchwork-Id: 715831 X-Patchwork-Delegate: davem@davemloft.net Return-Path: X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 3v2KQz4pvmz9s2G for ; Tue, 17 Jan 2017 04:07:07 +1100 (AEDT) Authentication-Results: ozlabs.org; dkim=pass (2048-bit key; unprotected) header.d=semihalf-com.20150623.gappssmtp.com header.i=@semihalf-com.20150623.gappssmtp.com header.b="rSTnnUne"; dkim-atps=neutral Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751474AbdAPRHF (ORCPT ); Mon, 16 Jan 2017 12:07:05 -0500 Received: from mail-lf0-f46.google.com ([209.85.215.46]:36320 "EHLO mail-lf0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751324AbdAPRGr (ORCPT ); Mon, 16 Jan 2017 12:06:47 -0500 Received: by mail-lf0-f46.google.com with SMTP id z134so85044422lff.3 for ; Mon, 16 Jan 2017 09:06:46 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=semihalf-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=NqthK5eQW+4ysuQKjYXqRgn+L6E6R7/gGxCK33L1cvo=; b=rSTnnUnev3cLqqUf9tOwC1WRjA3cl2CMIHZcM3ObiLu3HeE46SCbuLkBvZ2ME6MurL 2t6iD/N6jnOZwMiMgS+i21dBGzn2/LMQPkHAIu6syM1aG+YrLkGdTv1eTjpQazsaIjin /DzIJ/+kXqjhTRCGySpdF3Kq3+YaKE0pzR/KCTgS5NtaNpTENDFgoTLAbAtSY0sMyJCZ Z4kIHW2vyvuA+sxlJljvsj/JEnuEtojMB4zEUB2Nn+oxwtPUfoKlC7P6vTT5rQTvWBDe LwcoXN0PUm4kn4E/LqUs61qGELvyHfu9rjtzvdN1OhkEhv3NBf/5+gdlaQbPEKDQAfBz WWTQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=NqthK5eQW+4ysuQKjYXqRgn+L6E6R7/gGxCK33L1cvo=; b=t7MPyK0WOyp612tev9veLBClNnEiGBNSKd2V6dNspysm5Wpz3aiD+SMpeqCNak9ooZ JWo0wPh3q/zgwB7g/+fpQ6QDJWANlesD5eoSFrhWJzr8zImVf8ayCUX2IgOTp2nwwsLs YvaPtgpJsiQdsfqYXNX6j9ovLCyjf+LdJY70S1ooevk1aovOYPoJgfk+8gCdl5FnBIfp gCj6gpk9QG7ZMjRxxn1CklVoeASSawZZdCggKyUVUXax5GCyN2wvIewGo8/haaWC7Sm2 6FkjJRdxEMSOO6yXN9/V6amjrUof6sMMeACOu5WkTy7Rzt912jnAtx/QKF2beDCdJ/5A uOkg== X-Gm-Message-State: AIkVDXL1CfDDyKS9maHfN4PRBhB3M47Dr6WkT+0kPhUGtDDtx/j2DQrzcVO7ibsLa2xOZw== X-Received: by 10.25.219.69 with SMTP id s66mr10991425lfg.116.1484586405596; Mon, 16 Jan 2017 09:06:45 -0800 (PST) Received: from enkidu.semihalf.local (31-172-191-173.noc.fibertech.net.pl. [31.172.191.173]) by smtp.gmail.com with ESMTPSA id 206sm7630733ljf.35.2017.01.16.09.06.44 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 16 Jan 2017 09:06:44 -0800 (PST) From: Marcin Wojtas To: linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, netdev@vger.kernel.org Cc: davem@davemloft.net, linux@arm.linux.org.uk, sebastian.hesselbarth@gmail.com, andrew@lunn.ch, jason@lakedaemon.net, thomas.petazzoni@free-electrons.com, gregory.clement@free-electrons.com, mw@semihalf.com, jaz@semihalf.com Subject: [PATCH v2 net-next 2/2] net: mvneta: add BQL support Date: Mon, 16 Jan 2017 18:08:32 +0100 Message-Id: <1484586512-16412-3-git-send-email-mw@semihalf.com> X-Mailer: git-send-email 1.8.3.1 In-Reply-To: <1484586512-16412-1-git-send-email-mw@semihalf.com> References: <1484586512-16412-1-git-send-email-mw@semihalf.com> Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org Tests showed that when whole bandwidth is consumed, the latency for various kind of traffic can reach high values. With saturated link (e.g. with iperf from target to host) simple ping could take significant amount of time. BQL proved to improve this situation when implemented in mvneta driver. Measurements of ping latency for 3 link speeds: Speed | Latency w/o BQL | Latency with BQL 10 | 7-14 ms | 3.5 ms 100 | 2-12 ms | 0.6 ms 1000 | often timeout | up to 2ms Decreasing latency as above result in sligt performance cost - 4kpps (-1.4%) when pushing 64B packets via two bridged interfaces of Armada 38x. For 1500B packets in the same setup, the mpstat tool showed +8% of CPU occupation (default affinity, second CPU idle). Even though this cost seems reasonable to take, considering other improvements. This commit adds byte queue limit mechanism for the mvneta driver. Signed-off-by: Marcin Wojtas --- drivers/net/ethernet/marvell/mvneta.c | 22 +++++++++++++++++++--- 1 file changed, 19 insertions(+), 3 deletions(-) diff --git a/drivers/net/ethernet/marvell/mvneta.c b/drivers/net/ethernet/marvell/mvneta.c index 9624537..6dcc951 100644 --- a/drivers/net/ethernet/marvell/mvneta.c +++ b/drivers/net/ethernet/marvell/mvneta.c @@ -1759,8 +1759,10 @@ static struct mvneta_tx_queue *mvneta_tx_done_policy(struct mvneta_port *pp, /* Free tx queue skbuffs */ static void mvneta_txq_bufs_free(struct mvneta_port *pp, - struct mvneta_tx_queue *txq, int num) + struct mvneta_tx_queue *txq, int num, + struct netdev_queue *nq) { + unsigned int bytes_compl = 0, pkts_compl = 0; int i; for (i = 0; i < num; i++) { @@ -1768,6 +1770,11 @@ static void mvneta_txq_bufs_free(struct mvneta_port *pp, txq->txq_get_index; struct sk_buff *skb = txq->tx_skb[txq->txq_get_index]; + if (skb) { + bytes_compl += skb->len; + pkts_compl++; + } + mvneta_txq_inc_get(txq); if (!IS_TSO_HEADER(txq, tx_desc->buf_phys_addr)) @@ -1778,6 +1785,8 @@ static void mvneta_txq_bufs_free(struct mvneta_port *pp, continue; dev_kfree_skb_any(skb); } + + netdev_tx_completed_queue(nq, pkts_compl, bytes_compl); } /* Handle end of transmission */ @@ -1791,7 +1800,7 @@ static void mvneta_txq_done(struct mvneta_port *pp, if (!tx_done) return; - mvneta_txq_bufs_free(pp, txq, tx_done); + mvneta_txq_bufs_free(pp, txq, tx_done, nq); txq->count -= tx_done; @@ -2401,6 +2410,8 @@ static int mvneta_tx(struct sk_buff *skb, struct net_device *dev) struct mvneta_pcpu_stats *stats = this_cpu_ptr(pp->stats); struct netdev_queue *nq = netdev_get_tx_queue(dev, txq_id); + netdev_tx_sent_queue(nq, len); + txq->count += frags; if (txq->count >= txq->tx_stop_threshold) netif_tx_stop_queue(nq); @@ -2429,9 +2440,10 @@ static void mvneta_txq_done_force(struct mvneta_port *pp, struct mvneta_tx_queue *txq) { + struct netdev_queue *nq = netdev_get_tx_queue(pp->dev, txq->id); int tx_done = txq->count; - mvneta_txq_bufs_free(pp, txq, tx_done); + mvneta_txq_bufs_free(pp, txq, tx_done, nq); /* reset txq */ txq->count = 0; @@ -2957,6 +2969,8 @@ static int mvneta_txq_init(struct mvneta_port *pp, static void mvneta_txq_deinit(struct mvneta_port *pp, struct mvneta_tx_queue *txq) { + struct netdev_queue *nq = netdev_get_tx_queue(pp->dev, txq->id); + kfree(txq->tx_skb); if (txq->tso_hdrs) @@ -2968,6 +2982,8 @@ static void mvneta_txq_deinit(struct mvneta_port *pp, txq->size * MVNETA_DESC_ALIGNED_SIZE, txq->descs, txq->descs_phys); + netdev_tx_reset_queue(nq); + txq->descs = NULL; txq->last_desc = 0; txq->next_desc_to_proc = 0;