[v1,0/4] Context domains

David Miller wrote:	[Thu Jul 20 2017, 11:06:55PM EDT]
> From: David Miller <davem@davemloft.net>
> Date: Fri, 21 Jul 2017 03:50:05 +0100 (WEST)
> 
> > Having to allocate a full trap frame just to TLB flush one page or an
> > MM is a serious regression.
> > 
> > Next, allocating a whole new data structure and clearing it out on
> > every new address creation is going to be a significant new cost as
> > well.
None of this work has seen performance analysis. We rushed our sharing
because of forthcoming job reassignments.
> 
> So, just thinking out loud:
> 
> 1) You can retain the cross call TLB flush assembler by passing in the
>    appropriate context value for each individual cpu from the cross
>    call dispatcher.
Agreed and reinstated.
> 
> 2) If you have some constant bounds on the upper number of context
>    domains, you can simply inline them into the existing mmu_context
>    structure.  This avoids the memory allocation per mm creation.
Yes the direction I was moving towards on Friday (7/14). I have
similar on top of what Pasha posted. It has to be debugged and tested.
Pasha made some nice identifier name changes and clean ups before
posting.
> 
> You can also make the context domain salting extremely cheap.
> Perhaps something like "(cpuid>>x) & y".
> 
> No, you won't map cores to context domains so precisely like the
> code does now, but you will make up for it in code simplicity and
> overall new costs added by these changes for the more common
> cases.
> 
> I suggest "(cpuid>>x) & y" and a very small number of context domains
> (which determines 'y') because we don't need something perfect, we
> need something which divides the problem by some order of magnitude.
> 
> The hash of locks caught my eye as well.  I think you don't need that
> and we really steer clear of hashed spinlock tables in the Linux
> kernel because they never scale properly.
Agreed and witnessed during experimentation  (7/7 - 7/11) with CTX_NR_BITS
reduction.
> 
> Instead, I think you can use something like RCU to provide the
> necessary synchronization.  So you could first make sure X isn't
> referenced on the local cpu any more, and then do call_rcu() to do the
> actual clearing of the bitmap which allows X to be allocated again.
Making the context id array compile time within mm_context_t eliminates
requirement for hashed spin lock. The elimination of hash spin lock and
code simplicity is why I changed direction on 7/14. Not sure we will
require any RCU like thing. This also will likely make mm_cd_destroy()
more scalable with fewer context domains to inspect and possibly context
id releases. The context id release aspect will decrease wrap events.

It is difficult to satisfy all topologies. To me it seems likely
impossible.

This is a snippet of the patch series. I just booted it on my t5-2:
. We may want to reduce CPU_TO_CD_SHIFT to let's say 7. I will have
to do this just for testing on my t5-2. The current value of 8 will
likely make Debian a single context domain. Please allow a couple
more days for testing.

To summarize, reinstate cross calls, mm_cd_destroy() will now release
valid context-id-s (mitigate wrap) and compile time mm.context.cds.
> 
> Just some ideas...
thanx,

bob
--
To unsubscribe from this list: send the line "unsubscribe sparclinux" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Message ID	20170725134515.GI26577@zareason
State	RFC
Delegated to:	David Miller
Headers	show Return-Path: <sparclinux-owner@vger.kernel.org> Date: Tue, 25 Jul 2017 09:45:15 -0400 From: Bob Picco <bob.picco@oracle.com> To: David Miller <davem@davemloft.net> Cc: pasha.tatashin@oracle.com, sparclinux@vger.kernel.org, bob.picco@oracle.com Subject: Re: [PATCH v1 0/4] Context domains Message-ID: <20170725134515.GI26577@zareason> References: <1500601861-203232-1-git-send-email-pasha.tatashin@oracle.com> <20170721.035005.1773741271807870610.davem@davemloft.net> <20170721.040655.1839487789481890275.davem@davemloft.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170721.040655.1839487789481890275.davem@davemloft.net> User-Agent: Mutt/1.8.3 (2017-05-23) Sender: sparclinux-owner@vger.kernel.org Precedence: bulk

[v1,0/4] Context domains

Commit Message

Comments

Patch