From patchwork Mon Sep 10 15:22:35 2012 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Tejas Belagod X-Patchwork-Id: 182910 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) by ozlabs.org (Postfix) with SMTP id 0B27F2C007A for ; Tue, 11 Sep 2012 01:23:05 +1000 (EST) Comment: DKIM? See http://www.dkim.org DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=gcc.gnu.org; s=default; x=1347895386; h=Comment: DomainKey-Signature:Received:Received:Received:Received:Received: Message-ID:Date:From:User-Agent:MIME-Version:To:Subject: Content-Type:Mailing-List:Precedence:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:Sender:Delivered-To; bh=Y1b+aAo Z1Pmo8bXH+MotiFKkRDg=; b=a4mUrueb26dkNkG7+trYnRn8GT9Xac2BUAEt0/O 9T5yVFX3Mlrf6AB65q6ZO6Nq2Mmr05o/HIpNJokIwJy/usZNbfs1OReVHZ+jHvQG hg94+8IQfnIpeoc9XEfdLOcy6ND5w+r8lR8WrnNYn1/K9kqAlN1g9USrVZXxynY1 cusg= Comment: DomainKeys? See http://antispam.yahoo.com/domainkeys DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=default; d=gcc.gnu.org; h=Received:Received:X-SWARE-Spam-Status:X-Spam-Check-By:Received:Received:Received:Message-ID:Date:From:User-Agent:MIME-Version:To:Subject:X-MC-Unique:Content-Type:X-IsSubscribed:Mailing-List:Precedence:List-Id:List-Unsubscribe:List-Archive:List-Post:List-Help:Sender:Delivered-To; b=WchcDXSAIJnAuUKEXpxVD+DsZAiBuMOMfNK5cNoNWKXBWMVZ8vumwQvSZrb4nA wOg4BidMhQoOUaYZOUQp5j3JwdpIHwk47sUznlglXRX3WGgfmmMLZjnAKuB+f7hX ifT+qN22x6PfWmbn5Qh3z6OIAxoIJcFaoN/3hZi1YGG1Y=; Received: (qmail 9811 invoked by alias); 10 Sep 2012 15:23:01 -0000 Received: (qmail 9801 invoked by uid 22791); 10 Sep 2012 15:22:59 -0000 X-SWARE-Spam-Status: No, hits=-2.3 required=5.0 tests=AWL, BAYES_00, KHOP_RCVD_UNTRUST, RCVD_IN_DNSWL_LOW, TW_BH, TW_DQ, TW_VR X-Spam-Check-By: sourceware.org Received: from service87.mimecast.com (HELO service87.mimecast.com) (91.220.42.44) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Mon, 10 Sep 2012 15:22:40 +0000 Received: from cam-owa1.Emea.Arm.com (fw-tnat.cambridge.arm.com [217.140.96.21]) by service87.mimecast.com; Mon, 10 Sep 2012 16:22:38 +0100 Received: from [10.1.79.66] ([10.1.255.212]) by cam-owa1.Emea.Arm.com with Microsoft SMTPSVC(6.0.3790.0); Mon, 10 Sep 2012 16:22:35 +0100 Message-ID: <504E05BB.20101@arm.com> Date: Mon, 10 Sep 2012 16:22:35 +0100 From: Tejas Belagod User-Agent: Thunderbird 2.0.0.18 (X11/20081120) MIME-Version: 1.0 To: "gcc-patches@gcc.gnu.org" Subject: [Patch][AArch64] Fix Narrowing high shifts. X-MC-Unique: 112091016223803501 X-IsSubscribed: yes Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Delivered-To: mailing list gcc-patches@gcc.gnu.org Hi, The attached patch has fixes to assembler templates for rshrn2 and shrn2. OK? Thanks, Tejas Belagod. ARM. Changelog: 2012-09-10 Tejas Belagod gcc/ * config/aarch64/arm_neon.h (vrshrn_high_n_s16, vrshrn_high_n_s32, vrshrn_high_n_s64, vrshrn_high_n_u16, vrshrn_high_n_u32, vrshrn_high_n_u64, vshrn_high_n_s16, vshrn_high_n_s32, vshrn_high_n_s32, vshrn_high_n_s64, vshrn_high_n_u16, vshrn_high_n_u32, vshrn_high_n_u64): Fix template to reference correct operands. diff --git a/gcc/config/aarch64/arm_neon.h b/gcc/config/aarch64/arm_neon.h index 46abaf6..a4b2e78 100644 --- a/gcc/config/aarch64/arm_neon.h +++ b/gcc/config/aarch64/arm_neon.h @@ -15334,7 +15334,7 @@ vrndqp_f64 (float64x2_t a) int8x8_t a_ = (a); \ int8x16_t result = vcombine_s8 \ (a_, vcreate_s8 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.16b,%2.8h,#%3" \ + __asm__ ("rshrn2 %0.16b,%1.8h,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -15348,7 +15348,7 @@ vrndqp_f64 (float64x2_t a) int16x4_t a_ = (a); \ int16x8_t result = vcombine_s16 \ (a_, vcreate_s16 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.8h,%2.4s,#%3" \ + __asm__ ("rshrn2 %0.8h,%1.4s,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -15362,7 +15362,7 @@ vrndqp_f64 (float64x2_t a) int32x2_t a_ = (a); \ int32x4_t result = vcombine_s32 \ (a_, vcreate_s32 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.4s,%2.2d,#%3" \ + __asm__ ("rshrn2 %0.4s,%1.2d,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -15376,7 +15376,7 @@ vrndqp_f64 (float64x2_t a) uint8x8_t a_ = (a); \ uint8x16_t result = vcombine_u8 \ (a_, vcreate_u8 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.16b,%2.8h,#%3" \ + __asm__ ("rshrn2 %0.16b,%1.8h,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -15390,7 +15390,7 @@ vrndqp_f64 (float64x2_t a) uint16x4_t a_ = (a); \ uint16x8_t result = vcombine_u16 \ (a_, vcreate_u16 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.8h,%2.4s,#%3" \ + __asm__ ("rshrn2 %0.8h,%1.4s,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -15404,7 +15404,7 @@ vrndqp_f64 (float64x2_t a) uint32x2_t a_ = (a); \ uint32x4_t result = vcombine_u32 \ (a_, vcreate_u32 (UINT64_C (0x0))); \ - __asm__ ("rshrn2 %0.4s,%2.2d,#%3" \ + __asm__ ("rshrn2 %0.4s,%1.2d,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16088,7 +16088,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) int8x8_t a_ = (a); \ int8x16_t result = vcombine_s8 \ (a_, vcreate_s8 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.16b,%2.8h,#%3" \ + __asm__ ("shrn2 %0.16b,%1.8h,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16102,7 +16102,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) int16x4_t a_ = (a); \ int16x8_t result = vcombine_s16 \ (a_, vcreate_s16 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.8h,%2.4s,#%3" \ + __asm__ ("shrn2 %0.8h,%1.4s,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16116,7 +16116,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) int32x2_t a_ = (a); \ int32x4_t result = vcombine_s32 \ (a_, vcreate_s32 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.4s,%2.2d,#%3" \ + __asm__ ("shrn2 %0.4s,%1.2d,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16130,7 +16130,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) uint8x8_t a_ = (a); \ uint8x16_t result = vcombine_u8 \ (a_, vcreate_u8 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.16b,%2.8h,#%3" \ + __asm__ ("shrn2 %0.16b,%1.8h,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16144,7 +16144,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) uint16x4_t a_ = (a); \ uint16x8_t result = vcombine_u16 \ (a_, vcreate_u16 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.8h,%2.4s,#%3" \ + __asm__ ("shrn2 %0.8h,%1.4s,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \ @@ -16158,7 +16158,7 @@ vrsubhn_u64 (uint64x2_t a, uint64x2_t b) uint32x2_t a_ = (a); \ uint32x4_t result = vcombine_u32 \ (a_, vcreate_u32 (UINT64_C (0x0))); \ - __asm__ ("shrn2 %0.4s,%2.2d,#%3" \ + __asm__ ("shrn2 %0.4s,%1.2d,#%2" \ : "+w"(result) \ : "w"(b_), "i"(c) \ : /* No clobbers */); \