[net-next] net: gro: add a per device gro flush timer

From: Eric Dumazet <edumazet@google.com>

From: Eric Dumazet <edumazet@google.com>

Tuning coalescing parameters on NIC can be really hard.

Servers can handle both bulk and RPC like traffic, with conflicting
goals : bulk flows want as big GRO packets as possible, RPC want minimal
latencies.

To reach big GRO packets on 10Gbe NIC, one can use :

ethtool -C eth0 rx-usecs 4 rx-frames 44

But this penalizes rpc sessions, with an increase of latencies, up to
50% in some cases, as NICs generally do not force an interrupt when
a packet with TCP Push flag is received.

Some NICs do not have an absolute timer, only a timer rearmed for every
incoming packet.

This patch uses a different strategy : Let GRO stack decides what do do,
based on traffic pattern.

Packets with Push flag wont be delayed.
Packets without Push flag might be held in GRO engine, if we keep
receiving data.

This new mechanism is off by default, and shall be enabled by setting
/sys/class/net/eth0/gro_flush_timeout to a value in nanosecond.

Tested:
 Ran 200 netperf TCP_STREAM from A to B (10Gbe link, 8 RX queues)

Without this feature, we send back about 305,000 ACK per second.

GRO aggregation ratio is low (811/305 = 2.65 segments per GRO packet)

Setting a timer of 2000 nsec is enough to increase GRO packet sizes
and reduce number of ACK packets. (811/19.2 = 42)

Receiver performs less calls to upper stacks, less wakes up.
This also reduces cpu usage on the sender, as it receives less ACK
packets.

Note that reducing number of wakes up increases cpu efficiency, but can
decrease QPS, as applications wont have the chance to warmup cpu caches
doing a partial read of RPC requests/answers if they fit in one skb.

B:~# sar -n DEV 1 10 | grep eth0 | tail -1
Average:         eth0 811269.80 305732.30 1199462.57  19705.72      0.00      0.00      0.50

B:~# echo 2000 >/sys/class/net/eth0/gro_flush_timeout

lpaa6:~# sar -n DEV 1 10 | grep eth0 | tail -1
Average:         eth0 811577.30  19230.80 1199916.51   1239.80      0.00      0.00      0.50

Signed-off-by: Eric Dumazet <edumazet@google.com>
---
 include/linux/netdevice.h |   12 +++------
 net/core/dev.c            |   44 ++++++++++++++++++++++++++++++++++--
 net/core/net-sysfs.c      |   18 ++++++++++++++
 3 files changed, 64 insertions(+), 10 deletions(-)

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Message ID	1415235320.13896.51.camel@edumazet-glaptop2.roam.corp.google.com
State	Superseded, archived
Delegated to:	David Miller
Headers	show Return-Path: <netdev-owner@vger.kernel.org> X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 442C114007D for <patchwork-incoming@ozlabs.org>; Thu, 6 Nov 2014 11:56:46 +1100 (AEDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1750963AbaKFAzX (ORCPT <rfc822;patchwork-incoming@ozlabs.org>); Wed, 5 Nov 2014 19:55:23 -0500 Received: from mail-ig0-f177.google.com ([209.85.213.177]:43183 "EHLO mail-ig0-f177.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750877AbaKFAzW (ORCPT <rfc822;netdev@vger.kernel.org>); Wed, 5 Nov 2014 19:55:22 -0500 Received: by mail-ig0-f177.google.com with SMTP id hl2so2489375igb.10 for <netdev@vger.kernel.org>; Wed, 05 Nov 2014 16:55:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:subject:from:to:cc:date:content-type:mime-version :content-transfer-encoding; bh=VXE3gzawgUrmoG3Uqq2C2WMa042SCZCFDO5jhEwtp+0=; b=KOj3BVf6w3k4Oi76HfH0jsMAc5hB0vZOxCqRlns5jkhNZkPCJidYF42sAJFSAwyrtk b4uxHY1/+Tfg/3v3puGBSPMuArThoeWLmsWQNyoptyXNlt49kZZmPyfdHNbtVxo2+Owm ylRumqYCvQHmueTBwYsAiAf1pM5uM1GbMuE6FsSN4LtYrLnCav90fdQqd9q+OvMXB/FA d2WUf/iDSrwkmbuT9PtdqR+yq1CUKPIcy0l5D58FRxOHfA0TIwV6cTC97tsoVGMI/RYF 0/GxcKlTBkaQSus2nTfS/LsqVS+DjslY+c2AFH1y/rmkHjeb7Nl2ABN7NqBHoxBSQbsK MDGw== X-Received: by 10.50.25.71 with SMTP id a7mr36123144igg.48.1415235321568; Wed, 05 Nov 2014 16:55:21 -0800 (PST) Received: from [172.19.240.156] ([172.19.240.156]) by mx.google.com with ESMTPSA id 131sm2254699ioo.8.2014.11.05.16.55.20 for <multiple recipients> (version=SSLv3 cipher=RC4-SHA bits=128/128); Wed, 05 Nov 2014 16:55:21 -0800 (PST) Message-ID: <1415235320.13896.51.camel@edumazet-glaptop2.roam.corp.google.com> Subject: [PATCH net-next] net: gro: add a per device gro flush timer From: Eric Dumazet <eric.dumazet@gmail.com> To: David Miller <davem@davemloft.net> Cc: netdev <netdev@vger.kernel.org>, Or Gerlitz <ogerlitz@mellanox.com>, Willem de Bruijn <willemb@google.com> Date: Wed, 05 Nov 2014 16:55:20 -0800 Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.10.4-0ubuntu2 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: <netdev.vger.kernel.org> X-Mailing-List: netdev@vger.kernel.org

[net-next] net: gro: add a per device gro flush timer

Commit Message

Comments

Patch