Re: whitehouse.gov/robots.txt

11 Dec
2003
11 Dec
'03
1:02 p.m.
I'd suggest "wget" for spidering sites. It can be told to ignore .robots files. It is good for mirroring sites which you suspect may be taken down. Win/Unix versions available.
7796
Age (days ago)
7796
Last active (days ago)
0 comments
1 participants
participants (1)
-
Major Variola (ret)