X-Recipient: archive-cygwin AT delorie DOT com X-Spam-Check-By: sourceware.org Date: Tue, 8 Sep 2009 21:34:56 +0200 From: Corinna Vinschen To: cygwin AT cygwin DOT com Subject: Re: The C locale Message-ID: <20090908193456.GC17515@calimero.vinschen.de> Reply-To: cygwin AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com References: <416096c60908300959i1e0084b1xc8f6e65e792b035d AT mail DOT gmail DOT com> <20090831005258 DOT GG2068 AT ednor DOT casa DOT cgf DOT cx> <416096c60909012329l2f25e735yc07145b8d6698cda AT mail DOT gmail DOT com> <3f0ad08d0909020656v7d9fce6ft4afea63ed363b9a9 AT mail DOT gmail DOT com> <416096c60909071308qc5ff057sbe9cb1dbc270554f AT mail DOT gmail DOT com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <416096c60909071308qc5ff057sbe9cb1dbc270554f@mail.gmail.com> User-Agent: Mutt/1.5.19 (2009-02-20) Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com On Sep 7 21:08, Andy Koppe wrote: > Which leaves one apparently good solution for the "C" locale: > >> - Use the default Windows codepage for filenames, console, and > >> multibyte functions. This is what happens already if you specifiy a > >> locale with a language but no charset, e.g. "en". Maximum 1.5 > >> compatibility. UTF-8 has been chosen because it has the advantage that every UTF-16 Windows filename will result in a valid multibyte string. Every choice has its advantage and its trade-offs. Maximum 1.5 compatibility (what for and how long?) vs. maximum default usability in the long run (at least I hope so). > On a closely related note, Debian are introducing a "C.UTF-8" locale > as a language-neutral locale with a UTF-8 character set. This is > useful for choosing UTF-8 without picking up language-specific stuff > like sorting rules. See here: > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=522776. It's a rather > lengthy thread, but in the end they did decide to go for it. Doesn't just setting LC_CTYPE=fo_ba.UTF-8 has the same result? > Cygwin 1.7, through newlib, already has "C-UTF-8", as well as the > likes of "C-ISO-8859-1" or "C-SJIS". So how about replacing the "C-" > with "C." in those, considering that Cygwin has no backward > compatibility requirement regarding those? No, but newlib has. That was the only reason to keep these specifiers. Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple