From patchwork Sat Dec 1 01:38:31 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: kemi X-Patchwork-Id: 1006284 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=sourceware.org (client-ip=209.132.180.131; helo=sourceware.org; envelope-from=libc-alpha-return-97835-incoming=patchwork.ozlabs.org@sourceware.org; receiver=) Authentication-Results: ozlabs.org; dmarc=fail (p=none dis=none) header.from=intel.com Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; secure) header.d=sourceware.org header.i=@sourceware.org header.b="ArX/q9/C"; dkim-atps=neutral Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 436DXj5hf1z9s8r for ; Sat, 1 Dec 2018 12:42:45 +1100 (AEDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:from:to:cc:subject:date:message-id :mime-version:content-type:content-transfer-encoding; q=dns; s= default; b=Cvsqsn4F1DgaTiXnBIXnnTZ8PZCinGDGJ77h5s18SICz5tag4qsxm W/9LTrA5lsEZzvYVCHv2zcgCOiOnCXoqGziE8iVJTjc8/SE2u2OFFJtMKg33y3mf oEb0A6GIBhjqmQhBwVX0N9DCiKZqnjVgohxVYYYfZtZ/eioPgfnP88= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:from:to:cc:subject:date:message-id :mime-version:content-type:content-transfer-encoding; s=default; bh=GTAth4T8p70aP0s9o4yZd9lbNEE=; b=ArX/q9/CKLHmYVUwnhA/G4UoK6oY /hProIEQyBGoH7TP+rR+Uo8H99XmzKKzcDSXSWB9Qybm6ivZoyf50WAPQI/lznRs a9C6po+29yQQsZJ4UU2yI+jT4tPvcij+UIFeVLIF/hzinjt7vJzrdBEFIjA3+1jc lodn75UszmII9ag= Received: (qmail 69856 invoked by alias); 1 Dec 2018 01:42:37 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 69843 invoked by uid 89); 1 Dec 2018 01:42:36 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-26.9 required=5.0 tests=BAYES_00, GIT_PATCH_0, GIT_PATCH_1, GIT_PATCH_2, GIT_PATCH_3, KAM_SHORT, SPF_PASS autolearn=ham version=3.3.2 spammy= X-HELO: mga14.intel.com From: Kemi Wang To: Carlos , Glibc alpha Cc: Kemi Wang Subject: [PATCH V11] Mutex: Add pthread mutex tunables Date: Sat, 1 Dec 2018 09:38:31 +0800 Message-Id: <1543628311-25239-1-git-send-email-kemi.wang@intel.com> MIME-Version: 1.0 This patch does not have any functionality change, we only provide a spin count tunes for pthread adaptive spin mutex. The tunable glibc.pthread.mutex_spin_count tunes can be used by system administrator to squeeze system performance according to different hardware capabilities and workload characteristics. The maximum value of spin count is limited to 32767 to avoid the overflow of mutex->__data.__spins variable with the possible type of short in pthread_mutex_lock (). The default value of spin count is set to 100 with the reference to the previous number of times of spinning via trylock. This value would be architecture-specific and can be tuned with kinds of benchmarks to fit most cases in future. * sysdeps/nptl/dl-tunables.list: Add glibc.pthread.mutex_spin_count entry. * manual/tunables.texi: Add glibc.pthread.mutex_spin_count description. * nptl/pthread_mutex_conf.h: New file. * nptl/pthread_mutex_conf.c: New file. * nptl/Makefile: Add pthread_mutex_conf.c for compilation. * nptl/nptl-init.c: Put pthread mutex tunable initialization in pthread initialization. * nptl/pthreadP.h: Add a new function max_adaptive_count() to get the maximum adaptive spin value. * sysdeps/generic/adaptive_spin_count.h: Move DEFAULT_ADAPTIVE_COUNT macro to a generic header. * nptl/pthread_mutex_lock.c: Replace MAX_ADAPTIVE_COUNT by max_adaptive_count(). * nptl/pthread_mutex_timedlock.c: Replace MAX_ADAPTIVE_COUNT by max_adaptive_count(). I would extend my appreciation sincerely to H.J.Lu for his help to refine this patch series. ChangeLog: V10->V11: a) Fix comment style to use GNU style, and fixing indent in adaptive_spin_count.h. b) Update Changelog. V9->V10: a) Remove superfluous comments, as suggested by Adhemerval Zanella b) Use max_adaptive_count() to get the maximum spin count, as suggested by Adhemerval Zanella c) Move DEFAULT_ADAPTIVE_COUNT definition to a generic header and override this header by the architecture in future, as suggested by Adhemerval Zanella d) Use macro DEFAULT_ADAPTIVE_COUNT to replace magic number 100, as suggested by Carlos e) Other minor change on tunable description, as suggested by Carlos f) Change maximum value of adaptive spin count from 30000 to 32767(full short range), as suggested by Adhemerval Zanella. g) Move "struct mutex_config" definition back to pthread_mutex_conf.h, thus, the header pthreadP.h does not have to be included in pthread_mutex_conf.c V8->V9: a) Add the "unistd.h" header file back, or it will cause build regression on 32 bits system. b) Rebase on the latest master branch c) Tested on x86_64 with build-many-glibcs.py V7->V8: a) Refine the pthread tunables description in manual/tunables.texi accordingly to Carlos O'Donell and Rical Jason. V6->V7: a) Patch is refined by H.J.Lu V5->V6: a) Missing "pthread mutex tunables" entry in the menu of tunables.texi, add it. V4->V5 a) Put mutex tunable (glibc.mutex.spin_count) initialization as part of overall pthread initialization, that would avoid the extra relocation, as suggested by Florian Weimer. Thanks for pointing it out! b) Move the READ_ONLY_SPIN macro definition from the third patch to this patch V3->V4 a) Add comments in elf/dl-tunables.list V2->V3 a) Polish the description of glibc.mutex.spin_count tunable with the help from Rical Jasan. b) Get rid of the TUNABLE_CALLBACK_FNDECL macros in pthread_mutex_conf.c, as suggested by Florian Weimer. c) Adjust the default value of spin count to 100 with the reference of the previous spinning way via trylock. V1->V2 a) Renamed nptl/mutex-conf.h -> nptl/pthread_mutex_conf.h b) Renamed nptl/mutex-conf.c -> nptl/pthread_mutex_conf.c c) Change the Makefile to compile pthread_mutex_conf.c d) Modify the copyright "2013-2018" -> "2018" for new added files e) Fix the indentation issue (tab -> double space) in elf/dl-tunables.list f) Remove the env alias LD_SPIN_COUNT in elf/dl-tunables.list g) Fix the typo errors and refresh glibc.mutex.spin_count tunable description in manual/tunables.texi. h) Fix the indentation issue in nptl/pthread_mutex_conf.c i) Fix the indentation issue for nested preprocessor (add one space for each level) Suggested-by: Andi Kleen Reviewed-by: Carlos O'Donell Signed-off-by: Kemi.wang --- ChangeLog | 18 ++++++++++++++ manual/tunables.texi | 27 +++++++++++++++++++++ nptl/Makefile | 2 +- nptl/nptl-init.c | 5 ++++ nptl/pthreadP.h | 11 ++++++--- nptl/pthread_mutex_conf.c | 45 +++++++++++++++++++++++++++++++++++ nptl/pthread_mutex_conf.h | 34 ++++++++++++++++++++++++++ nptl/pthread_mutex_lock.c | 2 +- nptl/pthread_mutex_timedlock.c | 2 +- sysdeps/generic/adaptive_spin_count.h | 22 +++++++++++++++++ sysdeps/nptl/dl-tunables.list | 27 +++++++++++++++++++++ 11 files changed, 189 insertions(+), 6 deletions(-) create mode 100644 nptl/pthread_mutex_conf.c create mode 100644 nptl/pthread_mutex_conf.h create mode 100644 sysdeps/generic/adaptive_spin_count.h create mode 100644 sysdeps/nptl/dl-tunables.list diff --git a/ChangeLog b/ChangeLog index 87d3863..773a7ec 100644 --- a/ChangeLog +++ b/ChangeLog @@ -1,3 +1,21 @@ +2018-12-1 Kemi Wang + + * sysdeps/nptl/dl-tunables.list: Add glibc.pthread.mutex_spin_count entry. + * manual/tunables.texi: Add glibc.pthread.mutex_spin_count description. + * nptl/pthread_mutex_conf.h: New file. + * nptl/pthread_mutex_conf.c: New file. + * nptl/Makefile: Add pthread_mutex_conf.c for compilation. + * nptl/nptl-init.c: Put pthread mutex tunable initialization in pthread + initialization. + * nptl/pthreadP.h: Add a new function max_adaptive_count() to get the + maximum adaptive spin value. + * sysdeps/generic/adaptive_spin_count.h: Move DEFAULT_ADAPTIVE_COUNT + macro to a generic header. + * nptl/pthread_mutex_lock.c: Replace MAX_ADAPTIVE_COUNT by + max_adaptive_count(). + * nptl/pthread_mutex_timedlock.c: Replace MAX_ADAPTIVE_COUNT by + max_adaptive_count(). + 2018-11-30 Rafael Ávila de Espíndola [BZ #19767] diff --git a/manual/tunables.texi b/manual/tunables.texi index 3345a23..09a2565 100644 --- a/manual/tunables.texi +++ b/manual/tunables.texi @@ -32,6 +32,7 @@ their own namespace. * Tunable names:: The structure of a tunable name * Memory Allocation Tunables:: Tunables in the memory allocation subsystem * Elision Tunables:: Tunables in elision subsystem +* POSIX Thread Tunables:: Tunables in the POSIX thread subsystem * Hardware Capability Tunables:: Tunables that modify the hardware capabilities seen by @theglibc{} @end menu @@ -281,6 +282,32 @@ of try lock attempts. The default value of this tunable is @samp{3}. @end deftp +@node POSIX Thread Tunables +@section POSIX Thread Tunables +@cindex pthread mutex tunables +@cindex thread mutex tunables +@cindex mutex tunables +@cindex tunables thread mutex + +@deftp {Tunable namespace} glibc.pthread +The behavior of POSIX threads can be tuned to gain performance improvements +according to specific hardware capabilities and workload characteristics by +setting the following tunables in the @code{pthread} namespace: +@end deftp + +@deftp Tunable glibc.pthread.mutex_spin_count +The @code{glibc.pthread.mutex_spin_count} tunable sets the maximum number of times +a thread should spin on the lock before calling into the kernel to block. +Adaptive spin is used for mutexes initialized with the +@code{PTHREAD_MUTEX_ADAPTIVE_NP} GNU extension. It affects both +@code{pthread_mutex_lock} and @code{pthread_mutex_timedlock}. + +The thread spins until either the maximum spin count is reached or the lock +is acquired. + +The default value of this tunable is @samp{100}. +@end deftp + @node Hardware Capability Tunables @section Hardware Capability Tunables @cindex hardware capability tunables diff --git a/nptl/Makefile b/nptl/Makefile index 98b0aa0..34ae830 100644 --- a/nptl/Makefile +++ b/nptl/Makefile @@ -145,7 +145,7 @@ libpthread-routines = nptl-init nptlfreeres vars events version pt-interp \ mtx_destroy mtx_init mtx_lock mtx_timedlock \ mtx_trylock mtx_unlock call_once cnd_broadcast \ cnd_destroy cnd_init cnd_signal cnd_timedwait cnd_wait \ - tss_create tss_delete tss_get tss_set + tss_create tss_delete tss_get tss_set pthread_mutex_conf # pthread_setuid pthread_seteuid pthread_setreuid \ # pthread_setresuid \ # pthread_setgid pthread_setegid pthread_setregid \ diff --git a/nptl/nptl-init.c b/nptl/nptl-init.c index 907411d..adf99f1 100644 --- a/nptl/nptl-init.c +++ b/nptl/nptl-init.c @@ -38,6 +38,7 @@ #include #include #include +#include #ifndef TLS_MULTIPLE_THREADS_IN_TCB /* Pointer to the corresponding variable in libc. */ @@ -431,6 +432,10 @@ __pthread_initialize_minimal_internal (void) /* Determine whether the machine is SMP or not. */ __is_smp = is_smp_system (); + +#if HAVE_TUNABLES + pthread_tunables_init (); +#endif } strong_alias (__pthread_initialize_minimal_internal, __pthread_initialize_minimal) diff --git a/nptl/pthreadP.h b/nptl/pthreadP.h index 19efe1e..7f16ba9 100644 --- a/nptl/pthreadP.h +++ b/nptl/pthreadP.h @@ -33,6 +33,7 @@ #include #include #include +#include "pthread_mutex_conf.h" /* Atomic operations on TLS memory. */ @@ -47,10 +48,14 @@ #endif -/* Adaptive mutex definitions. */ -#ifndef MAX_ADAPTIVE_COUNT -# define MAX_ADAPTIVE_COUNT 100 +static inline short max_adaptive_count (void) +{ +#if HAVE_TUNABLES + return __mutex_aconf.spin_count; +#else + return DEFAULT_ADAPTIVE_COUNT; #endif +} /* Magic cookie representing robust mutex with dead owner. */ diff --git a/nptl/pthread_mutex_conf.c b/nptl/pthread_mutex_conf.c new file mode 100644 index 0000000..612981a --- /dev/null +++ b/nptl/pthread_mutex_conf.c @@ -0,0 +1,45 @@ +/* Pthread mutex tunable parameters. + Copyright (C) 2018 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#if HAVE_TUNABLES +# define TUNABLE_NAMESPACE pthread +#include +#include +#include +#include /* Get STDOUT_FILENO for _dl_printf. */ +#include + +struct mutex_config __mutex_aconf = +{ + /* The maximum number of times a thread should spin on the lock before + calling into kernel to block. */ + .spin_count = DEFAULT_ADAPTIVE_COUNT, +}; + +static void +TUNABLE_CALLBACK (set_mutex_spin_count) (tunable_val_t *valp) +{ + __mutex_aconf.spin_count = (int32_t) (valp)->numval; +} + +void pthread_tunables_init (void) +{ + TUNABLE_GET (mutex_spin_count, int32_t, + TUNABLE_CALLBACK (set_mutex_spin_count)); +} +#endif diff --git a/nptl/pthread_mutex_conf.h b/nptl/pthread_mutex_conf.h new file mode 100644 index 0000000..945eff8 --- /dev/null +++ b/nptl/pthread_mutex_conf.h @@ -0,0 +1,34 @@ +/* Pthread mutex tunable parameters. + Copyright (C) 2018 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ +#ifndef _PTHREAD_MUTEX_CONF_H +#define _PTHREAD_MUTEX_CONF_H 1 + +#include + +#if HAVE_TUNABLES +struct mutex_config +{ + int spin_count; +}; + +extern struct mutex_config __mutex_aconf attribute_hidden; + +extern void pthread_tunables_init (void); +#endif + +#endif diff --git a/nptl/pthread_mutex_lock.c b/nptl/pthread_mutex_lock.c index 29cc143..474b4df 100644 --- a/nptl/pthread_mutex_lock.c +++ b/nptl/pthread_mutex_lock.c @@ -126,7 +126,7 @@ __pthread_mutex_lock (pthread_mutex_t *mutex) if (LLL_MUTEX_TRYLOCK (mutex) != 0) { int cnt = 0; - int max_cnt = MIN (MAX_ADAPTIVE_COUNT, + int max_cnt = MIN (max_adaptive_count (), mutex->__data.__spins * 2 + 10); do { diff --git a/nptl/pthread_mutex_timedlock.c b/nptl/pthread_mutex_timedlock.c index 888c12f..453b824 100644 --- a/nptl/pthread_mutex_timedlock.c +++ b/nptl/pthread_mutex_timedlock.c @@ -118,7 +118,7 @@ __pthread_mutex_timedlock (pthread_mutex_t *mutex, if (lll_trylock (mutex->__data.__lock) != 0) { int cnt = 0; - int max_cnt = MIN (MAX_ADAPTIVE_COUNT, + int max_cnt = MIN (max_adaptive_count (), mutex->__data.__spins * 2 + 10); do { diff --git a/sysdeps/generic/adaptive_spin_count.h b/sysdeps/generic/adaptive_spin_count.h new file mode 100644 index 0000000..6b30a2a --- /dev/null +++ b/sysdeps/generic/adaptive_spin_count.h @@ -0,0 +1,22 @@ +/* Maximum adaptive spin count by default + Copyright (C) 2018 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +/* The choice of 100 spins for the default spin count for an adaptive spin + is a completely arbitrary choice that has not been evaluated thoroughly + using modern hardware. */ +#define DEFAULT_ADAPTIVE_COUNT 100 diff --git a/sysdeps/nptl/dl-tunables.list b/sysdeps/nptl/dl-tunables.list new file mode 100644 index 0000000..beebd5a --- /dev/null +++ b/sysdeps/nptl/dl-tunables.list @@ -0,0 +1,27 @@ +# Copyright (C) 2018 Free Software Foundation, Inc. +# This file is part of the GNU C Library. + +# The GNU C Library is free software; you can redistribute it and/or +# modify it under the terms of the GNU Lesser General Public +# License as published by the Free Software Foundation; either +# version 2.1 of the License, or (at your option) any later version. + +# The GNU C Library is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU +# Lesser General Public License for more details. + +# You should have received a copy of the GNU Lesser General Public +# License along with the GNU C Library; if not, see +# . + +glibc { + pthread { + mutex_spin_count { + type: INT_32 + minval: 0 + maxval: 32767 + default: 100 + } + } +}