X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.8 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <20090928161626.GC8378@calimero.vinschen.de> References: <20090921103758 DOT GE20981 AT calimero DOT vinschen DOT de> <3f0ad08d0909240003j435818e7h6f7cde2e26188f7e AT mail DOT gmail DOT com> <20090924073441 DOT GA30267 AT calimero DOT vinschen DOT de> <3f0ad08d0909240237s518de248jee409b731711404a AT mail DOT gmail DOT com> <20090924095701 DOT GC30851 AT calimero DOT vinschen DOT de> <20090924100006 DOT GD30851 AT calimero DOT vinschen DOT de> <20090926091504 DOT GA7275 AT calimero DOT vinschen DOT de> <3f0ad08d0909262021u5fe79873r65850865166ce40f AT mail DOT gmail DOT com> <3f0ad08d0909280903t5caaf611ie4049a73beb93f06 AT mail DOT gmail DOT com> <20090928161626 DOT GC8378 AT calimero DOT vinschen DOT de> Date: Tue, 29 Sep 2009 05:27:29 +0100 Message-ID: <416096c60909282127h775c674n89fa9b0a6d55d221@mail.gmail.com> Subject: Re: The C locale From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/9/28 Corinna Vinschen >> My conclusion is as follows as a result of hearing other Japanese >> people's opinion: >> >> LANG=3Dja -> UTF-8 >> LANG=3Dja_JP -> UTF-8 >> >> Because, we specify "eucJP" explicitly when we need it. > > Hmm. > > That's an interesting point. > > In theory this sounds like a good idea to be used for all locales which > don't specify the charset explicitely, because that results in using the > same charset, "UTF-8", for all such locales. =C2=A0"C", "ja" or "en_US" > would all default to UTF-8. Hmm, there's much to be said for that. > The downside is that a user, who needs to work under the default ANSI > codepage for some reason, has to know the name of the default ANSI > codepage. =C2=A0Right now any user who needs the default ANSI codepage can > simply set LANG to some language code and go ahead, without having to > know the number. =C2=A0With your solution, that wouldn't be possible anym= ore > and the user would have to figure out the default ANSI codepage on the > system before being able to use it. How about an explicit "ANSI" charset that maps to GetACP()? And "OEM" for GetOEMCP()? Those would make easy replacements for the CYGWIN=3Dcodepage:[ansi|oem] option. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple