From patchwork Wed Oct 11 18:50:01 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Vijay Kumar X-Patchwork-Id: 824512 Return-Path: X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Authentication-Results: ozlabs.org; spf=none (mailfrom) smtp.mailfrom=vger.kernel.org (client-ip=209.132.180.67; helo=vger.kernel.org; envelope-from=sparclinux-owner@vger.kernel.org; receiver=) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 3yC33k2WsXz9sBd for ; Thu, 12 Oct 2017 05:51:30 +1100 (AEDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752714AbdJKSv3 (ORCPT ); Wed, 11 Oct 2017 14:51:29 -0400 Received: from aserp1040.oracle.com ([141.146.126.69]:49033 "EHLO aserp1040.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753334AbdJKSuZ (ORCPT ); Wed, 11 Oct 2017 14:50:25 -0400 Received: from aserv0022.oracle.com (aserv0022.oracle.com [141.146.126.234]) by aserp1040.oracle.com (Sentrion-MTA-4.3.2/Sentrion-MTA-4.3.2) with ESMTP id v9BIoG2k021063 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 11 Oct 2017 18:50:16 GMT Received: from aserv0121.oracle.com (aserv0121.oracle.com [141.146.126.235]) by aserv0022.oracle.com (8.14.4/8.14.4) with ESMTP id v9BIoGgJ001736 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 11 Oct 2017 18:50:16 GMT Received: from abhmp0017.oracle.com (abhmp0017.oracle.com [141.146.116.23]) by aserv0121.oracle.com (8.14.4/8.13.8) with ESMTP id v9BIoFO6019015; Wed, 11 Oct 2017 18:50:16 GMT Received: from ca-sparc60.us.oracle.com (/10.147.24.150) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Wed, 11 Oct 2017 11:50:15 -0700 From: Vijay Kumar To: davem@davemloft.net Cc: linux-kernel@vger.kernel.org, sparclinux@vger.kernel.org, babu.moger@oracle.com, rob.gardner@oracle.com, sam@ravnborg.org, anthony.yznaga@oracle.com Subject: [PATCH v4 0/5] sparc64: Optimize fls and __fls Date: Wed, 11 Oct 2017 12:50:01 -0600 Message-Id: <1507747806-10251-1-git-send-email-vijay.ac.kumar@oracle.com> X-Mailer: git-send-email 1.7.1 X-Source-IP: aserv0022.oracle.com [141.146.126.234] Sender: sparclinux-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: sparclinux@vger.kernel.org SPARC provides lzcnt instruction (with VIS3) which can be used to optimize fls, __fls and fls64 functions. For the systems that supports lzcnt instruction, we now do boot time patching to use sparc optimized fls, __fls and fls64 functions. v3->v4: - Fixed a typo. v2->v3: - Using ENTRY(), ENDPROC() for assembler functions. - Removed BITS_PER_LONG from __fls. - Using generic fls64(). - Replaced lzcnt instruction with .word directive. v1->v2: - Fixed delay slot issue. Vijay Kumar (5): sparc64: Define SPARC default fls function sparc64: Define SPARC default __fls function sparc64: SPARC optimized fls function sparc64: SPARC optimized __fls function sparc64: Use sparc optimized fls and __fls for T4 and above arch/sparc/include/asm/bitops_64.h | 5 ++- arch/sparc/kernel/head_64.S | 2 + arch/sparc/lib/Makefile | 3 ++ arch/sparc/lib/NG4fls.S | 30 ++++++++++++++++ arch/sparc/lib/NG4patch.S | 9 +++++ arch/sparc/lib/fls.S | 67 ++++++++++++++++++++++++++++++++++++ arch/sparc/lib/fls64.S | 61 ++++++++++++++++++++++++++++++++ 7 files changed, 175 insertions(+), 2 deletions(-) create mode 100644 arch/sparc/lib/NG4fls.S create mode 100644 arch/sparc/lib/fls.S create mode 100644 arch/sparc/lib/fls64.S Reviewed-by: Babu Moger --- To unsubscribe from this list: send the line "unsubscribe sparclinux" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html