From patchwork Sun May  8 14:27:07 2011
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Alex Bligh <alex@alex.org.uk>
X-Patchwork-Id: 94554
X-Patchwork-Delegate: davem@davemloft.net
Return-Path: <netdev-owner@vger.kernel.org>
X-Original-To: patchwork-incoming@ozlabs.org
Delivered-To: patchwork-incoming@ozlabs.org
Received: from vger.kernel.org (vger.kernel.org [209.132.180.67])
	by ozlabs.org (Postfix) with ESMTP id 4843BB7158
	for <patchwork-incoming@ozlabs.org>;
	Mon,  9 May 2011 00:35:01 +1000 (EST)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
	id S1753857Ab1EHO1M (ORCPT <rfc822;patchwork-incoming@ozlabs.org>);
	Sun, 8 May 2011 10:27:12 -0400
Received: from mail.avalus.com ([89.16.176.221]:47783 "EHLO mail.avalus.com"
	rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
	id S1751206Ab1EHO1L (ORCPT <rfc822;netdev@vger.kernel.org>);
	Sun, 8 May 2011 10:27:11 -0400
Received: from [192.168.100.15] (87-194-71-186.bethere.co.uk [87.194.71.186])
	by mail.avalus.com (Postfix) with ESMTPSA id 66CB7C560D5;
	Sun,  8 May 2011 15:27:08 +0100 (BST)
Date: Sun, 08 May 2011 15:27:07 +0100
From: Alex Bligh <alex@alex.org.uk>
Reply-To: Alex Bligh <alex@alex.org.uk>
To: paulmck@linux.vnet.ibm.com
cc: Eric Dumazet <eric.dumazet@gmail.com>, netdev@vger.kernel.org,
	Alex Bligh <alex@alex.org.uk>
Subject: Re: Scalability of interface creation and deletion
Message-ID: <C449131127D58077CB25C9D8@Ximines.local>
In-Reply-To: <20110508134425.GL2641@linux.vnet.ibm.com>
References: <0F4A638C2A523577CDBC295E@Ximines.local>
	<1304785589.3207.5.camel@edumazet-laptop>
	<178E8895FB84C07251538EF7@Ximines.local>
	<1304793174.3207.22.camel@edumazet-laptop>
	<1304793749.3207.26.camel@edumazet-laptop>
	<1304838742.3207.45.camel@edumazet-laptop>
	<F57561A93EFF5E88729A8D53@nimrod.local>
	<7B76F9D75FD26D716624004B@nimrod.local>
	<20110508125028.GK2641@linux.vnet.ibm.com>
	<B2891EFD056565BBD4DBCE16@nimrod.local>
	<20110508134425.GL2641@linux.vnet.ibm.com>
X-Mailer: Mulberry/4.0.8 (Mac OS X)
MIME-Version: 1.0
Content-Disposition: inline
Sender: netdev-owner@vger.kernel.org
Precedence: bulk
List-ID: <netdev.vger.kernel.org>
X-Mailing-List: netdev@vger.kernel.org

Paul,

>> Yes, really 20-49us and 50-99us, not ms. Raw data attached :-)
>>
>> I'm guessing there are circumstances where there is an early exit.
>
> Well, if you were onlining and offlining CPUs, then if there was only
> one CPU online, this could happen.

No, I wasn't doing that.

>  And there really is only one CPU
> online during boot, so if your measurements included early boot time,
> this could easily explain these very short timings.

No, I waited a few minutes after boot for the system to stabilize, and
all CPUs were definitely online.

The patch to the kernel I am running is below.

>> There is nothing much going on these systems (idle, no other users,
>> just normal system daemons).
>
> And normal system daemons might cause this, right?

Yes. Everything is normal, except I did
 service udev stop
 unshare -n bash
which together stop the system running interface scripts when
interfaces are created (as upstart and upstart-udev-bridge are
now integrated, you can't kill upstart, so you have to rely on
unshare -n to stop the events being propagated). That's just
to avoid measuring the time it takes to execute the scripts.

diff --git a/kernel/rcutree.c b/kernel/rcutree.c
index dd4aea8..e401018 100644
--- a/kernel/rcutree.c
+++ b/kernel/rcutree.c
@@ -1518,6 +1518,7 @@ EXPORT_SYMBOL_GPL(call_rcu_bh);
 void synchronize_sched(void)
 {
        struct rcu_synchronize rcu;
+       ktime_t time_start = ktime_get();

        if (rcu_blocking_is_gp())
                return;
@@ -1529,6 +1530,7 @@ void synchronize_sched(void)
        /* Wait for it. */
        wait_for_completion(&rcu.completion);
        destroy_rcu_head_on_stack(&rcu.head);
+       pr_err("synchronize_sched() in %lld us\n", 
ktime_us_delta(ktime_get(), time_start));
 }
 EXPORT_SYMBOL_GPL(synchronize_sched);

diff --git a/net/core/dev.c b/net/core/dev.c
index 856b6ee..013f627 100644
--- a/net/core/dev.c
+++ b/net/core/dev.c
@@ -5164,7 +5164,9 @@ static void rollback_registered_many(struct list_head 
*head)
        dev = list_first_entry(head, struct net_device, unreg_list);
        call_netdevice_notifiers(NETDEV_UNREGISTER_BATCH, dev);

+       pr_err("begin rcu_barrier()\n");
        rcu_barrier();
+       pr_err("end rcu_barrier()\n");

        list_for_each_entry(dev, head, unreg_list)