Re: whitehouse.gov/robots.txt
11 Dec
2003
11 Dec
'03
1:02 p.m.
I'd suggest "wget" for spidering sites. It can be told to ignore .robots files. It is good for mirroring sites which you suspect may be taken down. Win/Unix versions available.
7682
Age (days ago)
7682
Last active (days ago)
0 comments
1 participants
participants (1)
-
Major Variola (ret)