Excellent idea as the wayback is it really doesn't seem to capture anywhere near as wide a range of pages as google, and it also seems to be straining under the weight of requests. the number of internal server errors it generates is absurd, especially considering the size of some of it's sponsors. perhaps a google mirror that was archived periodically would be a better bet? tolan -----Original Message----- From: Bill Stewart [mailto:bill.stewart@pobox.com] Sent: 27 October 2001 04:37 To: Tolan Blundell Cc: cypherpunks@einstein.ssz.com Subject: Re: web.archive.org Internet archive to open ---google + archeology It's not as outrageous as you'd think. 100GB drives are around $200, which means that a terabyte will cost you about $3K if you throw in a PC and some networking gear to connect it, so you could replicate that in your basement next to your DES-cracker for about the same price - the more expensive problem is getting the fiber optic connection from the Presidio to your basement to keep it updated. More to the point, recent news articles say the Feds have been getting Google to delete things for them. http://www.inet-one.com/cypherpunks/current/msg00505.html Anybody know what's been deleted, and whether it's still in Wayback, and whether we can get copies out into the public before anyone pressures Brewster Kahle? At 06:10 PM 10/25/2001 +0100, Tolan Blundell wrote:
Thats fine, I've got a 100TB server in my attic you can use if you want? ;)
jbdigriz: Way cool. It needs to be mirrored, though. Single point of failure/distribution invites history being rewritten the way it always has been until now.
http://www.latimes.com/news/nationworld/nation/la-102501archive.story By JOSEPH MENN, Times Staff Writer
SAN FRANCISCO -- An Internet archive containing more text than any library in history will open its digital doors today, giving researchers and the public access to just about everything posted on the World Wide Web over the last five years.