www.delorie.com/gnu/docs/wget/wget_12.html   search  
 
Buy GNU books!


GNU Wget Manual

[ < ] [ > ]   [ << ] [ Up ] [ >> ]         [Top] [Contents] [Index] [ ? ]

2.10 Recursive Accept/Reject Options

`-A acclist --accept acclist'
`-R rejlist --reject rejlist'
Specify comma-separated lists of file name suffixes or patterns to accept or reject (see section 4.2 Types of Files for more details).

`-D domain-list'
`--domains=domain-list'
Set domains to be followed. domain-list is a comma-separated list of domains. Note that it does not turn on `-H'.

`--exclude-domains domain-list'
Specify the domains that are not to be followed. (see section 4.1 Spanning Hosts).

`--follow-ftp'
Follow FTP links from HTML documents. Without this option, Wget will ignore all the FTP links.

`--follow-tags=list'
Wget has an internal table of HTML tag / attribute pairs that it considers when looking for linked documents during a recursive retrieval. If a user wants only a subset of those tags to be considered, however, he or she should be specify such tags in a comma-separated list with this option.

`-G list'
`--ignore-tags=list'
This is the opposite of the `--follow-tags' option. To skip certain HTML tags when recursively looking for documents to download, specify them in a comma-separated list.

In the past, the `-G' option was the best bet for downloading a single page and its requisites, using a commandline like:

 
wget -Ga,area -H -k -K -r http://site/document

However, the author of this option came across a page with tags like <LINK REL="home" HREF="/"> and came to the realization that `-G' was not enough. One can't just tell Wget to ignore <LINK>, because then stylesheets will not be downloaded. Now the best bet for downloading a single page and its requisites is the dedicated `--page-requisites' option.

`-H'
`--span-hosts'
Enable spanning across hosts when doing recursive retrieving (see section 4.1 Spanning Hosts).

`-L'
`--relative'
Follow relative links only. Useful for retrieving a specific home page without any distractions, not even those from the same hosts (see section 4.4 Relative Links).

`-I list'
`--include-directories=list'
Specify a comma-separated list of directories you wish to follow when downloading (see section 4.3 Directory-Based Limits for more details.) Elements of list may contain wildcards.

`-X list'
`--exclude-directories=list'
Specify a comma-separated list of directories you wish to exclude from download (see section 4.3 Directory-Based Limits for more details.) Elements of list may contain wildcards.

`-np'
`--no-parent'
Do not ever ascend to the parent directory when retrieving recursively. This is a useful option, since it guarantees that only the files below a certain hierarchy will be downloaded. See section 4.3 Directory-Based Limits, for more details.


[ < ] [ > ]   [ << ] [ Up ] [ >> ]         [Top] [Contents] [Index] [ ? ]

  webmaster   donations   bookstore     delorie software   privacy  
  Copyright © 2003   by The Free Software Foundation     Updated Jun 2003  

Please take a moment to fill out this visitor survey
You can help support this site by visiting the advertisers that sponsor it! (only once each, though)