[RFC] sparc64: Meaning of /sys/**/core_siblings on newer platforms.

Before SPARC T7, the notion of core_siblings was both those CPUs that share a
common highest level cache and the set of CPUs within a particular socket
(share same package_id). This was also true on older x86 CPUs and perhaps most
recent though my knowledge of x86 is dated.

The idea of same package_id is stated in Documentation/cputopology.txt and
programs such as lscpu have relied upon this to find the number of sockets by
counting the number of unique core_siblings_list entries. I suspect the reliance
on that algorithm predates the ability to read package IDs directly which is
simpler, more straightforward and preserves the platform assigned package ID
versus an ID that is just an incremented index based on order of discovery.

The idea that it needs to represent shared common highest level cache comes
from irqbalance, an important run-time performance enhancing daemon.

irqbalance uses the following hierarchy of locality goodness:

          - shared common core (thread_siblings)
          - shared common cache (core_siblings)
          - shared common socket (CPUs with same physical_package_id)
          - shared common node (CPUS in same node)

This layout perfectly describes the T7 and interestingly suggests that there are
one or more other architectures that have reached the point where enough cores
can be jammed into the same package that a shared high level cache is either not
desirable or not worth the real estate/effort. Said differently, socket in the
future will likely become less synonymous with shared cache and instead more
synonymous with node. I'm still digging to see if and what those architectures
are.

The issue is that on newer SPARC HW both definitions can no longer be true and
that choosing one versus the other will break differing sets of code. This can
be illustrated as a choice between an unmodified lscpu spitting out nonsensical
answers (although it currently can do that for different unrelated reasons) or
an unmodified irqbalance incorrectly making cache-thrashing decisions. The
number of important programs in each class is unknown, but either way some
things will have to be fixed. As I believe the whole point of large SPARC
servers is performance and the goal of the people on the SPARC mailing list is
to maximize SPARC linux performance, I would argue for not breaking what I
would call the performance class of programs versus the topology description
class.

Rationale:

- performance class breakage is harder to diagnose as it results in lost
performance and tracing back to root cause is incredibly difficult. Topology
description programs on the other hand spit out easily identified nonsense
and can be modified in a manner that is actually more straight forward than
the current algorithm while preserving architecturally neutral functional
correctness (i.e. not hacks/workarounds)

Attached is a working sparc64 patch for redefinition in favor of "shared
highest level cache" (not intended in its current form for actual upstream
submission but to clarify the proposal and allow actual testing). I'm seeking
feedback on how to proceed here to prevent wasted effort fixing the wrong set
of user land programs and related in-progress patches for SPARC sysfs.

Example results of patch:

Before:
          [root@ca-sparc30 topology]# cat core_siblings_list
          32-63,128-223

After:
          [root@ca-sparc30 topology]# cat core_siblings_list
          32-63

the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Message ID	029d528e-27bf-b55c-5dfb-335d202dc1ce@oracle.com
State	Superseded
Delegated to:	David Miller
Headers	show Return-Path: <sparclinux-owner@vger.kernel.org> X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 3rNq3V1y5Cz9t60 for <patchwork-incoming@ozlabs.org>; Tue, 7 Jun 2016 08:23:32 +1000 (AEST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752738AbcFFWXa (ORCPT <rfc822;patchwork-incoming@ozlabs.org>); Mon, 6 Jun 2016 18:23:30 -0400 Received: from userp1040.oracle.com ([156.151.31.81]:50465 "EHLO userp1040.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751874AbcFFWX2 (ORCPT <rfc822; sparclinux@vger.kernel.org>); Mon, 6 Jun 2016 18:23:28 -0400 Received: from userv0021.oracle.com (userv0021.oracle.com [156.151.31.71]) by userp1040.oracle.com (Sentrion-MTA-4.3.2/Sentrion-MTA-4.3.2) with ESMTP id u56MNRQ4009905 (version=TLSv1 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for <sparclinux@vger.kernel.org>; Mon, 6 Jun 2016 22:23:27 GMT Received: from userv0121.oracle.com (userv0121.oracle.com [156.151.31.72]) by userv0021.oracle.com (8.13.8/8.13.8) with ESMTP id u56MNRX6015279 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for <sparclinux@vger.kernel.org>; Mon, 6 Jun 2016 22:23:27 GMT Received: from abhmp0005.oracle.com (abhmp0005.oracle.com [141.146.116.11]) by userv0121.oracle.com (8.13.8/8.13.8) with ESMTP id u56MNQCe024442 for <sparclinux@vger.kernel.org>; Mon, 6 Jun 2016 22:23:27 GMT Received: from [192.168.1.9] (/69.207.174.138) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Mon, 06 Jun 2016 15:23:26 -0700 Subject: [RFC] sparc64: Meaning of /sys/**/core_siblings on newer platforms. References: <920a7739-001f-5773-465e-aaab7d547c85@oracle.com> To: sparclinux@vger.kernel.org From: chris hyser <chris.hyser@oracle.com> X-Forwarded-Message-Id: <920a7739-001f-5773-465e-aaab7d547c85@oracle.com> Message-ID: <029d528e-27bf-b55c-5dfb-335d202dc1ce@oracle.com> Date: Mon, 6 Jun 2016 18:23:22 -0400 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.1.1 MIME-Version: 1.0 In-Reply-To: <920a7739-001f-5773-465e-aaab7d547c85@oracle.com> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-Source-IP: userv0021.oracle.com [156.151.31.71] Sender: sparclinux-owner@vger.kernel.org Precedence: bulk List-ID: <sparclinux.vger.kernel.org> X-Mailing-List: sparclinux@vger.kernel.org

[RFC] sparc64: Meaning of /sys/**/core_siblings on newer platforms.

Commit Message

Comments

Patch