www.delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-SWARE-Spam-Status: | No, hits=-0.7 required=5.0 tests=AWL,BAYES_50,SPF_PASS |
X-Spam-Check-By: | sourceware.org |
Message-ID: | <BLU113-W51FC38A48F454394262F2CBEA70@phx.gbl> |
From: | Mike Marchywka <marchywka AT hotmail DOT com> |
To: | <cygwin AT cygwin DOT com> |
Subject: | RE: pdftk and apropos - general questions |
Date: | Wed, 4 Mar 2009 10:35:51 -0500 |
In-Reply-To: | <49AE9494.1000804@veritech.com> |
References: | <BLU113-W74226535EC192149C5AEABEA60 AT phx DOT gbl> <49AE9494 DOT 1000804 AT veritech DOT com> |
MIME-Version: | 1.0 |
X-IsSubscribed: | yes |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Unsubscribe: | <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
Note-from-DJ: | This may be spam |
---------------------------------------- > Date: Wed, 4 Mar 2009 09:47:48 -0500 > From: > To: cygwin AT cygwin DOT com > Subject: Re: pdftk and apropos - general questions > > Mike Marchywka wrote: >> I've had a persistent problem getting apropos to work >> as it never finds anything appropriate. Is there >> something I need to do to make this work? >> > After each setup session, you need to run, /usr/sbin/makewhatis -u. Thanks but I did get that far after earlier hints and you list below is about what I ended up with too. One problem I ran into was trying to extract sensical text from the=20 IRS instructions. I used the pdftotext utility IIRC from=20 http://www.foolabs.com/xpdf/download.html and it didn't seem to be able to separate multi-column text automatically ( with sed and awk I got what I needed but what a mess). Is there a toolkit source or compiled program I could use to diagnose or fix this? I'd also like to be able to fill out forms programmatically- I would love to print out a filled-in 1040 form but I'm not going to buy software to do this or type it into a GUI. I'm going on a bit of a cusade about proprietary format or limited-supoort formats for public documents.=20 You'd be amazed how many public filings that should contain information are in a format like a scanned pdf from which little usable information can be extracted. The FCC even seems to accept locked PDF submissions... [ at this point, people concerned about top-posting should=20 be exploding over gh-osting or posting about text which is gone. LOL] > > _________________________________________________________________ Express your personality in color! Preview and select themes for Hotmail=AE= .=20 http://www.windowslive-hotmail.com/LearnMore/personalize.aspx?ocid=3DTXT_MS= GTX_WL_HM_express_032009#colortheme -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |