www.delorie.com/archives/browse.cgi | search |
2009/9/22 Andy Koppe <andy DOT koppe AT gmail DOT com>: > Let's use the Windows "ANSI" codepage as the character set for the C > locale, for both the conversion functions and filenames. This means > CP1252 on Western systems, CP1251 on Cyrillic ones, CP932 on Japanese > ones, and so on. I oppose the approach (the ANSI codepage is used at C locale) because CP932 (the codepage for Japanese) is hostile to the UNIX-like tools. The reason is that the CP932 format contains a lot of meta characters as follows. single character of CP932: /[\x00-\x7F\xA0-\xDF]|[\x81-\x9F\xE0-\xFC][\x40-\x7E\x80-\xFC]/ This has a ruined influence to the tools that don't see locale. -- IWAMURO Motnori <http://vmi.jp/> -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |