X-Recipient: archive-cygwin AT delorie DOT com X-Original-To: cygwin AT cygwin DOT com Delivered-To: cygwin AT cygwin DOT com DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org E0E033858C78 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=t-online.de Authentication-Results: sourceware.org; spf=none smtp.mailfrom=t-online.de Message-ID: <03ac7dd2-7082-2bb1-6c1b-c230764216e2@t-online.de> Date: Tue, 15 Feb 2022 02:36:08 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.0 Subject: Re: [:xdigit:] does not work with std::wstring in a Cygwin environment Content-Language: en-US To: cygwin AT cygwin DOT com References: <53a83ef8dcc847e2914be35aa8c4525a AT brillux DOT de> <87tud2mxn1 DOT fsf AT Rainer DOT invalid> From: =?UTF-8?Q?Hans-Bernhard_Br=c3=b6ker?= In-Reply-To: <87tud2mxn1.fsf@Rainer.invalid> X-TOI-EXPURGATEID: 150726::1644888973-0000B8A1-3C0FD123/0/0 CLEAN NORMAL X-TOI-MSGID: c48e4b6e-6c58-425e-bafe-b52b3faae823 X-Spam-Status: No, score=2.9 required=5.0 tests=BAYES_00, FREEMAIL_FROM, KAM_DMARC_STATUS, KAM_LAZY_DOMAIN_SECURITY, NICE_REPLY_A, RCVD_IN_BARRACUDACENTRAL, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H3, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.4 X-Spam-Level: ** X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: cygwin AT cygwin DOT com X-Mailman-Version: 2.1.29 Precedence: list List-Id: General Cygwin discussions and problem reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="utf-8"; Format="flowed" Errors-To: cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com Sender: "Cygwin" Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 21F1b54d005264 Am 13.02.2022 um 19:25 schrieb Achim Gratz: > Gans, Markus writes: >> This seems to be an internal Cygwin error: >> >> https://www.reddit.com/r/cpp_questions/comments/sp52gq/xdigit_does_not_work_with_stdwstring_in_a_cygwin/ > >> […] >> Question: Why does Cygwin not detect the letters a, b, c, d, e, and >> f as hexadecimal digits in a wide string? [...] > There is no OS specific configuration for Cygwin explicitly, instead > there is one for newlib that actually gets used. This piqued my curiosity, so I had a look at how libstdc++ is built. I found that at least for one crucial source file, called ctype_members.cc, cygwin builds do _not_ use the newlib edition, but rather the "generic" one. And that may very well be the problem here. The superficial cause of the problem is that member function _M_initialize_ctype() in libstdc++-v3/config/locale/generic/ctype_member.cc fills most of its array _M_wmask[] with zeroes instead of meaningful character class identifiers. The slightly deeper reason is that the companion array _M_bit[] is also suspiciously full of zeroes. But the real problem, IMHO, is that the type ctype::mask is just a plain char. That overflows the looped shift used to fill _M_bit[], which in turn leads to nonsense in _M_wmask[]. I didn't manage to find where this ctype::mask is defined, but the way it's used here cannot work if it's defined as plain char. The newlib edition of ctype_members.cc loops over just 8 bits instead of 16, which would allow this to work. So we either have to pick up a different type definition of ctype::mask, or a different edition of ctype_members.cc --- I guess it should be the newlib one. -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple