www.delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-SWARE-Spam-Status: | No, hits=-1.9 required=5.0 tests=BAYES_00,SPF_NEUTRAL |
X-Spam-Check-By: | sourceware.org |
Message-ID: | <4ACB6309.9020609@cornell.edu> |
Date: | Tue, 06 Oct 2009 11:32:25 -0400 |
From: | Ken Brown <kbrown AT cornell DOT edu> |
User-Agent: | Thunderbird 2.0.0.22 (Windows/20090605) |
MIME-Version: | 1.0 |
To: | cygwin AT cygwin DOT com |
Subject: | Re: [ANNOUNCEMENT] [1.7] Updated: cygwin-1.7.0-62 |
References: | <announce DOT 20091003135912 DOT GA32467 AT calimero DOT vinschen DOT de> |
In-Reply-To: | <announce.20091003135912.GA32467@calimero.vinschen.de> |
X-IsSubscribed: | yes |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
Note-from-DJ: | This may be spam |
--------------040002080806050804010804 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit On 10/3/2009 9:59 AM, Corinna Vinschen wrote: > Apart from bugfixes, this patch contains a change to the > internationalization efforts in Cygwin which cristalized out of a couple > of longish discussions on the cygwin and cygwin-developer lists. > > Here's how it's supposed to work in future: [...] > - The "C" locale's default charset is UTF-8. Does this mean that non-ASCII characters are supposed to display OOTB, or is some user configuration expected? Here's a test case. I've tried to view the attached file (extracted from the output of fc-list) in various ways, and here's what I've found (running XP in the U.S., with no language-related customization): - Using emacs under X, emacs recognizes the file as UTF-8 and displays the foreign characters correctly. - 'cat temp.txt' in the cygwin console produces lots of question marks. - 'cat temp.txt' in xterm or mintty produces lots of garbage. The garbage changes in mintty if I change the choice of codepage in the options, but I haven't been able to get rid of the garbage. - If I set LANG=C.UTF-8 before starting xterm, I get correct display of the foreign characters as in emacs (under X). But this doesn't seem to work for the cygwin console or mintty (or at least I haven't figured out how to make it work). Ken P.S. This post is related to the discussion started in http://cygwin.com/ml/cygwin-developers/2009-10/msg00062.html. But I'm approaching the question as a user, so I didn't think I should reply there. (I'm not subscribed anyway.) --------------040002080806050804010804 Content-Type: text/plain; name="temp.txt" Content-Transfer-Encoding: 8bit Content-Disposition: inline; filename="temp.txt" obyčejné Κανονικά Normál Обычный Normálne --------------040002080806050804010804 Content-Type: text/plain; charset=us-ascii -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple --------------040002080806050804010804--
webmaster | delorie software privacy |
Copyright 2019 by DJ Delorie | Updated Jul 2019 |