From patchwork Sat Jul 20 20:01:47 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Rafal Luzynski X-Patchwork-Id: 1134479 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=sourceware.org (client-ip=209.132.180.131; helo=sourceware.org; envelope-from=libc-alpha-return-103849-incoming=patchwork.ozlabs.org@sourceware.org; receiver=) Authentication-Results: ozlabs.org; dmarc=none (p=none dis=none) header.from=lingonborough.com Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; secure) header.d=sourceware.org header.i=@sourceware.org header.b="ffg3qFG6"; dkim-atps=neutral Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 45rf0x0Cz8z9s7T for ; Sun, 21 Jul 2019 06:02:24 +1000 (AEST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:to:message-id:subject:mime-version :content-type:content-transfer-encoding; q=dns; s=default; b=rcu jiN8Jc+2kBBOKQIhgkPsEnRSkf2mXVTJEbrRmLSd5EEl8xhbQswFZDadUU1jvbJ1 8kDHpbAEgSahyLTI2Vgwu9ZrjqJspXSNJDElrL41R2Jc0SRByOW97kyWl3kxqQUE mXSKNMI0BnyDO+6k5divDLmw3QyxJ5egLabuBSQw= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:to:message-id:subject:mime-version :content-type:content-transfer-encoding; s=default; bh=VJHihfJiw LNCkWd5U3Cho3wNOlY=; b=ffg3qFG6DlgTjyJDh4tXkKoUsEcu8cCGfb+w1Onhh 5EomEz3qM6Bb++s+PSDmKeFZALX4gWT6qdKfy9m1qMxx8diVx8WliVNc8SdM1uvq 9xXK6ruxhDo8X8X+XjLXYHWzQ/2VcdZHwGHElAMVgm5V2gWJjaL3478yoomdUjPV fY= Received: (qmail 17019 invoked by alias); 20 Jul 2019 20:02:17 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 17011 invoked by uid 89); 20 Jul 2019 20:02:17 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-18.6 required=5.0 tests=AWL, BAYES_00, GIT_PATCH_0, GIT_PATCH_1, GIT_PATCH_2, GIT_PATCH_3, RCVD_IN_DNSWL_NONE, UPPERCASE_50_75, XPRIO autolearn=ham version=3.3.1 spammy=NEWS, lean, che, tse X-HELO: shared-ano163.rev.nazwa.pl X-Spam-Score: -0.209 Date: Sat, 20 Jul 2019 22:01:47 +0200 (CEST) From: Rafal Luzynski To: libc-alpha@sourceware.org Message-ID: <904613472.72769.1563652907282@poczta.nazwa.pl> Subject: [PATCH COMMITTED] locale/C-translit.h.in: Cyrillic -> ASCII transliteration [BZ #2872] MIME-Version: 1.0 For the record, this is the patch I have just pushed to master. The content is exactly the same as Egor's v12 patch, minor changes include the commit message reworded and the ChangeLog entry added. I don't yet close the bug in Bugzilla because there may be few minor updates (e.g., should we add NEWS entry? Now I lean into saying no.) --- 8< --- From: Egor Kobylkin Date: Wed, 2 Jan 2019 05:50:13 +0100 Subject: [PATCH] locale/C-translit.h.in: Cyrillic -> ASCII transliteration [BZ #2872] This patch adds Cyrillic to plain ASCII transliteration table according to GOST 7.79-2000 System B standard to the C locale. [BZ #2872] * locale/C-translit.h.in: Add Cyrillic transliteration. --- ChangeLog | 5 ++ locale/C-translit.h.in | 169 +++++++++++++++++++++++++++++++++++++++++ 2 files changed, 174 insertions(+) "\x2004" " " # THREE-PER-EM SPACE diff --git a/ChangeLog b/ChangeLog index a606c5fd60..a1fdef9cff 100644 --- a/ChangeLog +++ b/ChangeLog @@ -1,3 +1,8 @@ +2019-07-20 Egor Kobylkin + + [BZ #2872] + * locale/C-translit.h.in: Add Cyrillic transliteration. + 2019-07-19 Florian Weimer * sysdeps/unix/sysv/linux/syscall-names.list: Add system calls diff --git a/locale/C-translit.h.in b/locale/C-translit.h.in index d5f00df0f3..758171c394 100644 --- a/locale/C-translit.h.in +++ b/locale/C-translit.h.in @@ -56,6 +56,175 @@ "\x02cd" "_" # MODIFIER LETTER LOW MACRON "\x02d0" ":" # MODIFIER LETTER TRIANGULAR COLON "\x02dc" "~" # SMALL TILDE +"\x0401" "YO" # CYRILLIC CAPITAL LETTER IO +"\x0402" "DJ" # CYRILLIC CAPITAL LETTER DJE +"\x0403" "G`" # CYRILLIC CAPITAL LETTER GJE +"\x0404" "YE" # CYRILLIC CAPITAL LETTER UKRAINIAN IE +"\x0405" "Z`" # CYRILLIC CAPITAL LETTER DZE +"\x0406" "I" # CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I +"\x0407" "YI" # CYRILLIC CAPITAL LETTER YI +"\x0408" "J" # CYRILLIC CAPITAL LETTER JE +"\x0409" "L`" # CYRILLIC CAPITAL LETTER LJE +"\x040a" "N`" # CYRILLIC CAPITAL LETTER NJE +"\x040b" "TSH" # CYRILLIC CAPITAL LETTER TSHE +"\x040c" "K`" # CYRILLIC CAPITAL LETTER KJE +"\x040e" "U`" # CYRILLIC CAPITAL LETTER SHORT U +"\x040f" "DH" # CYRILLIC CAPITAL LETTER DZHE +"\x0410" "A" # CYRILLIC CAPITAL LETTER A +"\x0411" "B" # CYRILLIC CAPITAL LETTER BE +"\x0412" "V" # CYRILLIC CAPITAL LETTER VE +"\x0413" "G" # CYRILLIC CAPITAL LETTER GHE +"\x0414" "D" # CYRILLIC CAPITAL LETTER DE +"\x0415" "E" # CYRILLIC CAPITAL LETTER IE +"\x0416" "ZH" # CYRILLIC CAPITAL LETTER ZHE +"\x0417" "Z" # CYRILLIC CAPITAL LETTER ZE +"\x0418" "I" # CYRILLIC CAPITAL LETTER I +"\x0419" "J" # CYRILLIC CAPITAL LETTER SHORT I +"\x041a" "K" # CYRILLIC CAPITAL LETTER KA +"\x041b" "L" # CYRILLIC CAPITAL LETTER EL +"\x041c" "M" # CYRILLIC CAPITAL LETTER EM +"\x041d" "N" # CYRILLIC CAPITAL LETTER EN +"\x041e" "O" # CYRILLIC CAPITAL LETTER O +"\x041f" "P" # CYRILLIC CAPITAL LETTER PE +"\x0420" "R" # CYRILLIC CAPITAL LETTER ER +"\x0421" "S" # CYRILLIC CAPITAL LETTER ES +"\x0422" "T" # CYRILLIC CAPITAL LETTER TE +"\x0423" "U" # CYRILLIC CAPITAL LETTER U +"\x0424" "F" # CYRILLIC CAPITAL LETTER EF +"\x0425" "X" # CYRILLIC CAPITAL LETTER HA +"\x0426" "CZ" # CYRILLIC CAPITAL LETTER TSE +"\x0427" "CH" # CYRILLIC CAPITAL LETTER CHE +"\x0428" "SH" # CYRILLIC CAPITAL LETTER SHA +"\x0429" "SHH" # CYRILLIC CAPITAL LETTER SHCHA +"\x042a" "A`" # CYRILLIC CAPITAL LETTER HARD SIGN +"\x042b" "Y`" # CYRILLIC CAPITAL LETTER YERU +"\x042c" "`" # CYRILLIC CAPITAL LETTER SOFT SIGN +"\x042d" "E`" # CYRILLIC CAPITAL LETTER E +"\x042e" "YU" # CYRILLIC CAPITAL LETTER YU +"\x042f" "YA" # CYRILLIC CAPITAL LETTER YA +"\x0430" "a" # CYRILLIC SMALL LETTER A +"\x0431" "b" # CYRILLIC SMALL LETTER BE +"\x0432" "v" # CYRILLIC SMALL LETTER VE +"\x0433" "g" # CYRILLIC SMALL LETTER GHE +"\x0434" "d" # CYRILLIC SMALL LETTER DE +"\x0435" "e" # CYRILLIC SMALL LETTER IE +"\x0436" "zh" # CYRILLIC SMALL LETTER ZHE +"\x0437" "z" # CYRILLIC SMALL LETTER ZE +"\x0438" "i" # CYRILLIC SMALL LETTER I +"\x0439" "j" # CYRILLIC SMALL LETTER SHORT I +"\x043a" "k" # CYRILLIC SMALL LETTER KA +"\x043b" "l" # CYRILLIC SMALL LETTER EL +"\x043c" "m" # CYRILLIC SMALL LETTER EM +"\x043d" "n" # CYRILLIC SMALL LETTER EN +"\x043e" "o" # CYRILLIC SMALL LETTER O +"\x043f" "p" # CYRILLIC SMALL LETTER PE +"\x0440" "r" # CYRILLIC SMALL LETTER ER +"\x0441" "s" # CYRILLIC SMALL LETTER ES +"\x0442" "t" # CYRILLIC SMALL LETTER TE +"\x0443" "u" # CYRILLIC SMALL LETTER U +"\x0444" "f" # CYRILLIC SMALL LETTER EF +"\x0445" "x" # CYRILLIC SMALL LETTER HA +"\x0446" "cz" # CYRILLIC SMALL LETTER TSE +"\x0447" "ch" # CYRILLIC SMALL LETTER CHE +"\x0448" "sh" # CYRILLIC SMALL LETTER SHA +"\x0449" "shh" # CYRILLIC SMALL LETTER SHCHA +"\x044a" "``" # CYRILLIC SMALL LETTER HARD SIGN +"\x044b" "y`" # CYRILLIC SMALL LETTER YERU +"\x044c" "`" # CYRILLIC SMALL LETTER SOFT SIGN +"\x044d" "e`" # CYRILLIC SMALL LETTER E +"\x044e" "yu" # CYRILLIC SMALL LETTER YU +"\x044f" "ya" # CYRILLIC SMALL LETTER YA +"\x0451" "yo" # CYRILLIC SMALL LETTER IO +"\x0452" "dj" # CYRILLIC SMALL LETTER DJE +"\x0453" "g`" # CYRILLIC SMALL LETTER GJE +"\x0454" "ye" # CYRILLIC SMALL LETTER UKRAINIAN IE +"\x0455" "z`" # CYRILLIC SMALL LETTER DZE +"\x0456" "i" # CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I +"\x0457" "yi" # CYRILLIC SMALL LETTER YI +"\x0458" "j" # CYRILLIC SMALL LETTER JE +"\x0459" "l`" # CYRILLIC SMALL LETTER LJE +"\x045a" "n`" # CYRILLIC SMALL LETTER NJE +"\x045b" "tsh" # CYRILLIC SMALL LETTER TSHE +"\x045c" "k`" # CYRILLIC SMALL LETTER KJE +"\x045e" "u`" # CYRILLIC SMALL LETTER SHORT U +"\x045f" "dh" # CYRILLIC SMALL LETTER DZHE +"\x046a" "O`" # CYRILLIC CAPITAL LETTER BIG YUS +"\x046b" "o`" # CYRILLIC SMALL LETTER BIG YUS +"\x0472" "FH" # CYRILLIC CAPITAL LETTER FITA +"\x0473" "fh" # CYRILLIC SMALL LETTER FITA +"\x0474" "YH" # CYRILLIC CAPITAL LETTER IZHITSA +"\x0475" "yh" # CYRILLIC SMALL LETTER IZHITSA +"\x048c" "E`" # CYRILLIC CAPITAL LETTER SEMISOFT SIGN +"\x048d" "e`" # CYRILLIC SMALL LETTER SEMISOFT SIGN +"\x0490" "G`" # CYRILLIC CAPITAL LETTER GHE WITH UPTURN +"\x0491" "g`" # CYRILLIC SMALL LETTER GHE WITH UPTURN +"\x0492" "GH" # CYRILLIC CAPITAL LETTER GHE WITH STROKE +"\x0493" "gh" # CYRILLIC SMALL LETTER GHE WITH STROKE +"\x0494" "GH" # CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK +"\x0495" "gh" # CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK +"\x0496" "ZH`" # CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER +"\x0497" "zh`" # CYRILLIC SMALL LETTER ZHE WITH DESCENDER +"\x049a" "K`" # CYRILLIC CAPITAL LETTER KA WITH DESCENDER +"\x049b" "k`" # CYRILLIC SMALL LETTER KA WITH DESCENDER +"\x049e" "K`" # CYRILLIC CAPITAL LETTER KA WITH STROKE +"\x049f" "k`" # CYRILLIC SMALL LETTER KA WITH STROKE +"\x04a2" "N`" # CYRILLIC CAPITAL LETTER EN WITH DESCENDER +"\x04a3" "n`" # CYRILLIC SMALL LETTER EN WITH DESCENDER +"\x04a4" "NG" # CYRILLIC CAPITAL LIGATURE EN GHE +"\x04a5" "ng" # CYRILLIC SMALL LIGATURE EN GHE +"\x04a6" "P`" # CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK +"\x04a7" "p`" # CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK +"\x04a8" "O`" # CYRILLIC CAPITAL LETTER ABKHASIAN HA +"\x04a9" "o`" # CYRILLIC SMALL LETTER ABKHASIAN HA +"\x04aa" "C`" # CYRILLIC CAPITAL LETTER ES WITH DESCENDER +"\x04ab" "C`" # CYRILLIC SMALL LETTER ES WITH DESCENDER +"\x04ac" "T`" # CYRILLIC CAPITAL LETTER TE WITH DESCENDER +"\x04ad" "t`" # CYRILLIC SMALL LETTER TE WITH DESCENDER +"\x04ae" "U" # CYRILLIC CAPITAL LETTER STRAIGHT U +"\x04af" "u" # CYRILLIC SMALL LETTER STRAIGHT U +"\x04b2" "H`" # CYRILLIC CAPITAL LETTER HA WITH DESCENDER +"\x04b3" "h`" # CYRILLIC SMALL LETTER HA WITH DESCENDER +"\x04b4" "TCZ" # CYRILLIC CAPITAL LIGATURE TE TSE +"\x04b5" "tcz" # CYRILLIC SMALL LIGATURE TE TSE +"\x04ba" "SH`" # CYRILLIC CAPITAL LETTER SHHA +"\x04bb" "sh`" # CYRILLIC SMALL LETTER SHHA +"\x04bc" "CH`" # CYRILLIC CAPITAL LETTER ABKHASIAN CHE +"\x04bd" "ch`" # CYRILLIC SMALL LETTER ABKHASIAN CHE +"\x04be" "CH`" # CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH DESCENDER +"\x04bf" "ch`" # CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DESCENDER +"\x04c0" "i" # CYRILLIC LETTER PALOCHKA +"\x04c1" "ZH`" # CYRILLIC CAPITAL LETTER ZHE WITH BREVE +"\x04c2" "zh`" # CYRILLIC SMALL LETTER ZHE WITH BREVE +"\x04cb" "CH`" # CYRILLIC CAPITAL LETTER KHAKASSIAN CHE +"\x04cc" "ch`" # CYRILLIC SMALL LETTER KHAKASSIAN CHE +"\x04d0" "A`" # CYRILLIC CAPITAL LETTER A WITH BREVE +"\x04d1" "a`" # CYRILLIC SMALL LETTER A WITH BREVE +"\x04d2" "A`" # CYRILLIC CAPITAL LETTER A WITH DIAERESIS +"\x04d3" "a`" # CYRILLIC SMALL LETTER A WITH DIAERESIS +"\x04d6" "E`" # CYRILLIC CAPITAL LETTER IE WITH BREVE +"\x04d7" "e`" # CYRILLIC SMALL LETTER IE WITH BREVE +"\x04d8" "A`" # CYRILLIC CAPITAL LETTER SCHWA +"\x04d9" "a`" # CYRILLIC SMALL LETTER SCHWA +"\x04dc" "ZH`" # CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS +"\x04dd" "zh`" # CYRILLIC SMALL LETTER ZHE WITH DIAERESIS +"\x04de" "Z`" # CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS +"\x04df" "z`" # CYRILLIC SMALL LETTER ZE WITH DIAERESIS +"\x04e0" "Z`" # CYRILLIC CAPITAL LETTER ABKHASIAN DZE +"\x04e1" "z`" # CYRILLIC SMALL LETTER ABKHASIAN DZE +"\x04e4" "I`" # CYRILLIC CAPITAL LETTER I WITH DIAERESIS +"\x04e5" "i`" # CYRILLIC SMALL LETTER I WITH DIAERESIS +"\x04e6" "O`" # CYRILLIC CAPITAL LETTER O WITH DIAERESIS +"\x04e7" "o`" # CYRILLIC SMALL LETTER O WITH DIAERESIS +"\x04e8" "O`" # CYRILLIC CAPITAL LETTER BARRED O +"\x04e9" "o`" # CYRILLIC SMALL LETTER BARRED O +"\x04f0" "U`" # CYRILLIC CAPITAL LETTER U WITH DIAERESIS +"\x04f1" "u`" # CYRILLIC SMALL LETTER U WITH DIAERESIS +"\x04f2" "U`" # CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE +"\x04f3" "u`" # CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE +"\x04f4" "CH`" # CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS +"\x04f5" "ch`" # CYRILLIC SMALL LETTER CHE WITH DIAERESIS +"\x04f8" "Y`" # CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS +"\x04f9" "y`" # CYRILLIC SMALL LETTER YERU WITH DIAERESIS "\x2002" " " # EN SPACE "\x2003" " " # EM SPACE