X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-2.4 required=5.0 tests=AWL,BAYES_00,SPF_PASS X-Spam-Check-By: sourceware.org Message-ID: <49D13333.3020806@gmail.com> Date: Mon, 30 Mar 2009 22:01:39 +0100 From: Dave Korn User-Agent: Thunderbird 2.0.0.17 (Windows/20080914) MIME-Version: 1.0 To: cygwin AT cygwin DOT com Subject: Re: sed converts 8-bit input text to 16-bit (Unicode-16?) characters - how to suppress that? References: <21F96204FB10403FB43CFA97A7D4BBC8 AT SEMENTINA> In-Reply-To: <21F96204FB10403FB43CFA97A7D4BBC8@SEMENTINA> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Michael Moser wrote: > >> -----Original Message----- BTW please trim the redundant headers... it's really considerate not to post people's email addresses in the body of your post because if you do so they get harvested by spammers. >> I tried with input files >> containing german umlauts and sed does not convert to wide >> char and it does not produce a BOM marker at the start of the file. > > Maybe that conversion comes from me redirecting the output to a file > using > 'sed {options} > filename.ext' ?!? I'll have to verify that! What terminal/console are you using? Unicode RXVT by any chance? How does it behave in other consoles? You can edit a file in-place by using the -i option to sed. cheers, DaveK -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/