From patchwork Thu May 14 17:20:37 2015 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Paul Eggert X-Patchwork-Id: 472444 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 0A86514027C for ; Fri, 15 May 2015 03:20:49 +1000 (AEST) Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=sourceware.org header.i=@sourceware.org header.b=s9BRw+kr; dkim-atps=neutral DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type; q=dns; s=default; b=cjKGS1 seJ3wFMtQ10ONeFMhLJqlz8yUpG+r2+D+rSuQ+D4Q87GN89ay8NPHGJb6xs/8Z5V +QK1utPmjKgYCmUsmW4Gqmh0dqpx9mIW9bsrdvUaThlpj3PiMXwXIkZeBcBlYHnX ojLKBZwwMe+ovsgUyblwSpZyHzfgvpg27s/uw= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type; s=default; bh=CXu9fYZC6m3S 4yWrWuQsZN4FuD4=; b=s9BRw+krcqeTrURyZTq1l4xvzeO5bZN7EL0ZoiJ8HKX7 CL53650QOUFkYP+leqQBxTy9mCIc4IblD2l8W+99Z1Y0aCsDxJFjuuknHBt3fexp n8AFiJGbOl09REGSk7TphoNqLP9E49OJ8KK3F/njlla0j3kx6v5MSasbPcq8L9g= Received: (qmail 17061 invoked by alias); 14 May 2015 17:20:42 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 17052 invoked by uid 89); 14 May 2015 17:20:41 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-3.1 required=5.0 tests=AWL, BAYES_00, SPF_PASS, T_RP_MATCHES_RCVD autolearn=ham version=3.3.2 X-HELO: smtp.cs.ucla.edu Message-ID: <5554D965.8050100@cs.ucla.edu> Date: Thu, 14 May 2015 10:20:37 -0700 From: Paul Eggert User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.6.0 MIME-Version: 1.0 To: Carlos O'Donell , GNU C Library Subject: Re: [PATCH] Remove obsolete aliases that broke 'locale -a' References: <55544398.2030007@cs.ucla.edu> <5554CAAC.20508@redhat.com> In-Reply-To: <5554CAAC.20508@redhat.com> On 05/14/2015 09:17 AM, Carlos O'Donell wrote: > This looks like your ChangeLog entry is not in UTF-8? No, it's a problem in transmission. The actual patch is a mixture of UTF-8 and Latin-1 encoding. I sent it via Thunderbird, which doesn't know its encoding and says only "Content-Type: text/x-patch" without specifying a character set for the attachment. The body of the attachment, generated by Git, says "Content-Type: text/plain; charset=UTF-8" which is not correct, as the attachment body uses some UTF-8 and some Latin-1. If I save the attachment into a file FOO, and edit FOO with Emacs, Emacs ignores the charset declaration and infers Latin-1 since FOO can't possibly be UTF-8, which botches display of the UTF-characters (the botch you noticed). If I then type 'C-x RET r U T F - 8 RET y e s RET', Emacs will show the patch as UTF-8 instead of Latin-1, which will fix the botch you noticed but will mess up display of the old contents of locale.aliases. What a mess, huh? > Could you please convert this file to UTF-8 instead of ISO-8859? > Sure, revised patch attached. It'll have the same transmission problems as before, and if you force Emacs to read it in UTF-8 only the old lines will be botched. From 442fe29ef8cb66f8a1c012705a75b6d9a57111ce Mon Sep 17 00:00:00 2001 From: Paul Eggert Date: Thu, 14 May 2015 10:18:11 -0700 Subject: [PATCH] Remove obsolete aliases that broke 'locale -a' MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit [BZ #18412] * intl/locale.alias: Remove obsolete aliases "bokmÃ¥l" and "français" which caused 'locale -a' to output Latin-1 data in UTF-8 locales, breaking some applications that use 'locale -a' output. Change the encoding of this file from Latin-1 to UTF-8 (this affects only the file's comments now). --- ChangeLog | 7 +++++++ intl/locale.alias | 9 +++++++-- 2 files changed, 14 insertions(+), 2 deletions(-) diff --git a/ChangeLog b/ChangeLog index e89c14e..2346fcf 100644 --- a/ChangeLog +++ b/ChangeLog @@ -1,3 +1,10 @@ +2015-05-14 Paul Eggert + + [BZ #18412] + * intl/locale.alias: Remove obsolete aliases "bokmÃ¥l" and "français" + which caused 'locale -a' to output Latin-1 data in UTF-8 locales, + breaking some applications that use 'locale -a' output. + 2015-05-14 Andrew Senkevich * Makeconfig (rpath-dirs, all-subdirs): Added mathvec folder. diff --git a/intl/locale.alias b/intl/locale.alias index ab1cb7a..ca4e777 100644 --- a/intl/locale.alias +++ b/intl/locale.alias @@ -24,8 +24,14 @@ # backward compatibility. Nobody should rely on the names defined here. # Locales should always be specified by their full name. +# Note: This file used to contain the following lines encoded in Latin-1: +# bokmÃ¥l nb_NO.ISO-8859-1 +# français fr_FR.ISO-8859-1 +# These lines were removed because they caused 'locale -a' to output +# text encoded in Latin-1, which broke applications in UTF-8 locales. See: +# https://sourceware.org/bugzilla/show_bug.cgi?id=18412 + bokmal nb_NO.ISO-8859-1 -bokmål nb_NO.ISO-8859-1 catalan ca_ES.ISO-8859-1 croatian hr_HR.ISO-8859-2 czech cs_CZ.ISO-8859-2 @@ -36,7 +42,6 @@ dutch nl_NL.ISO-8859-1 eesti et_EE.ISO-8859-1 estonian et_EE.ISO-8859-1 finnish fi_FI.ISO-8859-1 -français fr_FR.ISO-8859-1 french fr_FR.ISO-8859-1 galego gl_ES.ISO-8859-1 galician gl_ES.ISO-8859-1 -- 2.1.0