web.archive.org Internet archive to open ---google + archeology

Tolan Blundell tolan at citipages.net
Sun Oct 28 07:44:30 PST 2001


Excellent idea as the wayback is it really doesn't seem to capture anywhere
near as wide a range of pages as google, and it also seems to be straining
under the weight of requests. the number of internal server errors it
generates is absurd, especially considering the size of some of it's
sponsors.

perhaps a google mirror that was archived periodically would be a better
bet?

tolan

-----Original Message-----
From: Bill Stewart [mailto:bill.stewart at pobox.com]
Sent: 27 October 2001 04:37
To: Tolan Blundell
Cc: cypherpunks at einstein.ssz.com
Subject: Re: web.archive.org Internet archive to open ---google +
archeology


It's not as outrageous as you'd think.  100GB drives are around $200,
which means that a terabyte will cost you about $3K if you throw in a PC
and some networking gear to connect it, so you could replicate that
in your basement next to your DES-cracker for about the same price -
the more expensive problem is getting the fiber optic connection
from the Presidio to your basement to keep it updated.

More to the point, recent news articles say the Feds have been
getting Google to delete things for them.
         http://www.inet-one.com/cypherpunks/current/msg00505.html
Anybody know what's been deleted, and whether it's still in Wayback,
and whether we can get copies out into the public before anyone pressures
Brewster Kahle?

At 06:10 PM 10/25/2001 +0100, Tolan Blundell wrote:
>Thats fine, I've got a 100TB server in my attic you can use if you want? ;)

>jbdigriz:
>Way cool. It needs to be mirrored, though. Single point of
>failure/distribution invites history being rewritten the way it always
>has been until now.


> > http://www.latimes.com/news/nationworld/nation/la-102501archive.story
> > By JOSEPH MENN, Times Staff Writer
> >
> > SAN FRANCISCO -- An Internet archive containing more text than any
> > library in history will open its digital doors today, giving researchers
> > and the public access to just about everything posted on the World Wide
> > Web over the last five years.







More information about the cypherpunks-legacy mailing list