usage: httrack -aABcC[number]dDefFgIKlLm[number]M[number]nNOpP[number]qQr[number]RStTUvwWzZ] <URLs> [+<URLs>] [-<URLs>] with options listed below: (* is the default value) w *mirror with automatic wizard W semi-automatic mirror with help-wizard g just get files (saved in the current directory) rN recurse get with limited link depth of N - does not use the wizard
a *stay on the same adress d stay on the same principal domain l stay on the same location (.com, etc.) e go everywhere on the web
n get non-html files 'near' an html file (ex: an image located outside) t test all URLs (even forbidden ones)
S stay on the same directory D *can only go down into subdirs U can only go to upper directories B can both go up&down into the directory structure
NN name conversion type (0 *original structure 1,2,3 html/data in one directory)
LN long names (L1 *long names / L0 8-3 conversion)
K keep original links (e.g. http://www.adr/link) R *relative links (e.g ../link)
pN priority mode: (* p3) 0 just scan, don't save anything (for checking links) 1 save only html files 2 save only non html files *3 save all files 7 get html files before, then treat other files
cN number of multiple connections (* c8)
O path for mirror/logfiles+cache (-O path_mirror[,path_cache_and_logfiles])
P proxy use (-P url:port)
F user-agent field (-F "user-agent name")
mN maximum file length for a non-html file mN,N' for non html (N) and html (N') MN maximum overall size that can be uploaded/scanned
TN timeout, number of seconds after a non-responding link is shutdown
I *make an index (I0 don't make)
C *create/use a cache for updates and retries (C0 no cache)
q quiet mode (no questions)
Q log quiet mode (no log) v verbose screen mode f *log file mode
z extra infos log Z debug log
special shortcuts: --mirror <URLs> *make a mirror of site(s) --get <URLs> get the files indicated, do not seek other URLs --spider <URLs> spider site(s), to test links (reports Errors & Warnings) --update <URLs> update a mirror, without confirmation --skeleton <URLs> make a mirror, but gets only html files
example: E:\HTTRACK.EXE www.test.com/bob/ means: mirror site www.test.com/bob/ and only this site
example: E:\HTTRACK.EXE www.test.com/bob/ www.anothertest.com/mike/ means: mirror the two sites together (with shared links)
example: E:\HTTRACK.EXE www.test.com/bob/bobby.html -er6 means get all files starting from bobby.html, with 6 link-depth, and possibility of going everywhere on the web
example: E:\HTTRACK.EXE www.test.com/bob/bobby.html --spider -P proxy.myweb.abc:8080 runs the spider on www.test.com/bob/bobby.html using a proxy
HTTrack/0.996-beta-2 [May 28 1998] (HyperTextSeeker lib/0.991-beta-2) (C)'98 Xavier Roche & Yann Philippot from Ensi Caen, France (www.ismra.fr) |