www.delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/06/29/16:16:25

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-0.2 required=5.0 tests=AWL,BAYES_50,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SARE_SUB_ENC_UTF8,TW_BJ,T_TO_NO_BRKTS_FREEMAIL
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <AANLkTil80itd3rHEYl5eAJ6Bfra-Mw-aNunC04canB1-@mail.gmail.com>
References: <hvn70o$n5q$1 AT dough DOT gmane DOT org> <AANLkTimrIRAaUp-xP8ZUe9ew7Wob3EBiUL-hkv-VjY2a AT mail DOT gmail DOT com> <i0cb2o$m64$1 AT dough DOT gmane DOT org> <AANLkTil80itd3rHEYl5eAJ6Bfra-Mw-aNunC04canB1- AT mail DOT gmail DOT com>
Date: Tue, 29 Jun 2010 21:15:51 +0100
Message-ID: <AANLkTilvvxiKUfTJKuC3Q0ZqXF23dWsDrlpLj8Y2yKef@mail.gmail.com>
Subject: Re: LANG=ru_RU.cp1251, but binutils use UTF-8.
From: Andy Koppe <andy DOT koppe AT gmail DOT com>
To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2010/6/29 Csaba Raduly:
> On Tue, Jun 29, 2010 at 10:32 AM, Oleksandr Gavenko wrote:
>> =C2=A0$ objdump --help | head -n 2 | hexl
>> 00000000: d098 d181 d0bf d0be d0bb d18c d0b7 d0be =C2=A0................
> (snipped)
>
> These bytes sure look like UTF-8. According to
> http://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>
> U+0418 CYRILLIC CAPITAL LETTER I character (&#x0418;)
> U+0441 CYRILLIC SMALL LETTER ES character (&#x0441;)
> U+043F CYRILLIC SMALL LETTER PE character (&#x043F;)
> U+043E CYRILLIC SMALL LETTER O character (&#x043E;)
> U+043B CYRILLIC SMALL LETTER EL character (&#x043B;)
> U+044C CYRILLIC SMALL LETTER SOFT SIGN character (&#x044C;)
> U+0437 CYRILLIC SMALL LETTER ZE character (&#x0437;)
> (snipped)
>
> As raw characters:
>
> =D0=98=D1=81=D0=BF=D0=BE=D0=BB=D1=8C=D0=B7=D0=BE=D0=B2=D0=B0=D0=BD=D0=B8=
=D0=B5: objdump <=D0=BE=D0=BF=D1=86=D0=B8=D0=B8> <=D1=84=D0=B0=D0=B9=D0=BB(=
=D1=8B)>
> =C2=A0=D0=9E=D1=82=D0=BE=D0=B1=D1=80=D0=B0=D0=B6=D0=B0=D0=B5=D1=82 =D0=B8=
=D0=BD=D1=84=D0=BE=D1=80=D0=BC=D0=B0=D1=86=D0=B8=D1=8E =D0=B8=D0=B7 =D0=BE=
=D0=B1=D1=8A=D0=B5=D0=BA=D1=82=D0=B0 <=D1=84=D0=B0=D0=B9=D0=BB(=D1=8B)>.
>
> But I can't get that output out of
> CYGWIN_NT-6.0 1.7.5(0.225/5/3) 2010-04-12 19:07 i686 Cygwin
> and
> GNU objdump (GNU Binutils) 2.20.51.20100410

Same here. No German either. French and Spanish work, but not quite
right: it needlessly decomposes accented characters into ASCII
characters:

$ LANG=3Des_ES.UTF-8 objdump --help
Modo de empleo: objdump <opcion(es)> <fichero(s)>
Muestra la informaci'on de <fichero(s)> objeto.

That should be "informaci=C3=B3n" rather than "informaci'on".

Translations are working properly for example for 'ls' and 'gcc'.

Andy

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019