HTTrack
The Web Mirror Utility

Option list:

usage:
httrack -aABcC[number]dDefFgIKlLm[number]M[number]nNOpP[number]qQr[number]RStTUvwWzZ] <URLs> [+<URLs>] [-<URLs>]

with options listed below: (* is the default value)

w *mirror with automatic wizard

W semi-automatic mirror with help-wizard

g just get files (saved in the current directory)

rN recurse get with limited link depth of N - does not use the wizard

a *stay on the same adress

d stay on the same principal domain

l stay on the same location (.com, etc.)

e go everywhere on the web

n get non-html files 'near' an html file (ex: an image located outside)

t test all URLs (even forbidden ones)

S stay on the same directory

D *can only go down into subdirs

U can only go to upper directories

B can both go up&down into the directory structure

NN name conversion type (0 *original structure 1,2,3 html/data in one directory)

LN long names (L1 *long names / L0 8-3 conversion)

K keep original links (e.g. http://www.adr/link)

R *relative links (e.g ../link)

pN priority mode: (* p3)

0 just scan, don't save anything (for checking links)

1 save only html files

2 save only non html files

*3 save all files

7 get html files before, then treat other files

cN number of multiple connections (* c8)

O path for mirror/logfiles+cache (-O path_mirror[,path_cache_and_logfiles])

P proxy use (-P url:port)

F user-agent field (-F "user-agent name")

mN maximum file length for a non-html file

mN,N' for non html (N) and html (N')

MN maximum overall size that can be uploaded/scanned

TN timeout, number of seconds after a non-responding link is shutdown

I *make an index (I0 don't make)

C *create/use a cache for updates and retries (C0 no cache)

q quiet mode (no questions)

Q log quiet mode (no log)

v verbose screen mode

f *log file mode

z extra infos log

Z debug log

special shortcuts:

--mirror <URLs> *make a mirror of site(s)

--get <URLs> get the files indicated, do not seek other URLs

--spider <URLs> spider site(s), to test links (reports Errors & Warnings)

--update <URLs> update a mirror, without confirmation

--skeleton <URLs> make a mirror, but gets only html files

example: E:\HTTRACK.EXE www.test.com/bob/

means: mirror site www.test.com/bob/ and only this site

example: E:\HTTRACK.EXE www.test.com/bob/ www.anothertest.com/mike/

means: mirror the two sites together (with shared links)

example: E:\HTTRACK.EXE www.test.com/bob/bobby.html -er6

means get all files starting from bobby.html, with 6 link-depth, and possibility of going everywhere on the web

example: E:\HTTRACK.EXE www.test.com/bob/bobby.html --spider -P proxy.myweb.abc:8080

runs the spider on www.test.com/bob/bobby.html using a proxy

HTTrack/0.996-beta-2 [May 28 1998] (HyperTextSeeker lib/0.991-beta-2)

(C)'98 Xavier Roche & Yann Philippot from Ensi Caen, France (www.ismra.fr)