X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:reply-to:message-id:to:subject :in-reply-to:references:mime-version:content-type :content-transfer-encoding; q=dns; s=default; b=exJucWXzUIqBPU9+ scb0bEGRw6wo5+osg1xO6cJxNXXM07As6uY80sVJbYNSjJRv8lHTVKY8tJg+Dd1t /nkEYYedeF0L0JgTOwEFdkRLfJgwtpjoiY8hHiqh/cc3qmLltRuz+gKdl6ub9gfJ fZIIwJHdPPyhfXE/rS3lRoHmh+k= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:reply-to:message-id:to:subject :in-reply-to:references:mime-version:content-type :content-transfer-encoding; s=default; bh=jKrVxlxIkJaShpqZfpcaaE n1rk0=; b=RLUbf4Wks7wwTnTsa/r8j22luUIUcdZmEnd1yLlc5Uwq6SetDzTquO USe7zOoMgBH/9YKFfvT7o9xXVnBY/5+aXppGR1T9fl2MzW7+9fG82Kub6F/+k4tv jcBcjnyVxyPFQ0AZZcLxu6cEFI5Y9lTDYZdFyNkQe1bFkKpb59O6E= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: Yes, score=6.0 required=5.0 tests=BAYES_50,BODY_8BITS,FREEMAIL_FROM,GARBLED_BODY,KAM_THEBAT,MIME_BASE64_BLANKS,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=no version=3.3.2 spammy=H*M:yandex, H*RU:192.168.1.10, H*r:ip*192.168.1.10, Hx-spam-relays-external:192.168.1.10 X-HELO: forward105p.mail.yandex.net Authentication-Results: smtp2p.mail.yandex.net; dkim=pass header.i=@yandex.ru Date: Thu, 22 Mar 2018 14:24:46 +0300 From: Andrey Repin Reply-To: cygwin AT cygwin DOT com Message-ID: <1547310513.20180322142446@yandex.ru> To: Dmitry Katsubo , cygwin AT cygwin DOT com Subject: Re: Quotes around command-line argument that has unicode characters are not removed In-Reply-To: <08d9621d-b9a0-c0d7-b58b-581ab957a08c@mail.ru> References: <08d9621d-b9a0-c0d7-b58b-581ab957a08c AT mail DOT ru> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 X-IsSubscribed: yes Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by delorie.com id w2MBbGGN021777 Greetings, Dmitry Katsubo! > Dear Cygwin community, > I observe the following on my Cygwin: This is not cygwin, this is bare Windows. > when I put quotes around file that has > non-ASCII symbols, these quotes are passed to argv of the process literally, > otherwise they are removed. I would expect that there is a consistency. Parameter unquoting done by the shell. CMD does that differently from POSIX shells. > I have written a small C program that displays arguments, and run it three > times: Run it in bash. I'm pretty sure you will see your results more consistent. > #1 For the file with space, taken into quotes ("the file.txt") -- OK > #2 For the file with non-ASCII characters (Château.txt) -- OK > #3 For the file with non-ASCII characters, taken into quotes ("Château.txt") -- WRONG > d:\cli> uname -a > CYGWIN_NT-6.1-WOW PC 2.9.0(0.318/5/3) 2017-09-12 10:41 i686 Cygwin > D:\cli> chcp > Active code page: 866 > D:\cli> dir > ...cut... > 2018-03-22 00:43 0 Château.txt > 2018-03-22 00:01 393 test.c > 2018-03-22 00:01 150,230 test.exe > 2018-03-21 00:15 186 test.pl > 2018-03-22 00:43 0 the file.txt > 2018-03-22 00:40 16 текст плюс.txt > 6 File(s) 150,825 bytes > 2 Dir(s) 41,972,293,632 bytes free > D:\cli> test "the file.txt" > param 0 = test > param 1 = the file.txt > File 'the file.txt' was opened > D:\cli> test Château.txt > param 0 = test > param 1 = Château.txt > File 'Château.txt' was opened > D:\cli> test "Château.txt" > param 0 = test > param 1 = "Château.txt" > Failed to open '"Château.txt"': No such file or directory > As one can see, the last run fails. I am a bit puzzled: how can I pass the name > of the file with space and Unicode symbols? I need to do it in uniform way, as I > am calling a Cygwin program from native Windows program, as in [1]. > D:\cli> test "текст плюс.txt" > param 0 = test > param 1 = "текст плюс.txt" > Failed to open '"текст плюс.txt"': No such file or directory > I have search a bit, but I couldn't find a direct answer. From post [1] and [2] > I see that compiler inserts the code to do some argument pre-processing like > @pathnames [3], but what are exactly the rules? Is quote pre-processing done in > dcrt0.cc:177 [4]? > Any feedback is appreciated. > [1] https://sourceware.org/ml/cygwin/2016-05/msg00082.html > [2] http://daviddeley.com/autohotkey/parameters/parameters.htm > [3] https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-at > [4] https://github.com/openunix/cygwin/blob/master/winsup/cygwin/dcrt0.cc#L177 > === test.c === > #include > #include > #include > int main(int argc, char* argv[]) > { > for (int i = 0; i < argc; i++) > { > printf("param %d = %s\n", i, argv[i]); > } > FILE* f = fopen(argv[1], "r"); > if (f != NULL) > { > printf("File '%s' was opened\n", argv[1]); > fclose(f); > } else { > printf("Failed to open '%s': %s\n", argv[1], strerror(errno)); > } > return 0; > } -- With best regards, Andrey Repin Thursday, March 22, 2018 14:21:25 Sorry for my terrible english... -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple