USG pulls 'sensitive' info off net

Steve Schear schear at lvcm.com
Wed Oct 3 14:49:35 PDT 2001


At 12:29 PM 10/3/2001 -0700, Karsten M. Self wrote:
>on Wed, Oct 03, 2001 at 11:00:04AM -0400, Declan McCullagh (declan at well.com)
>wrote:
> >
> > On Wed, Oct 03, 2001 at 06:38:05AM -0700, Khoder bin Hakkin wrote:
> > > Must've never heard of caching..
> > >
> > > http://www.latimes.com/news/nationworld/nation/la-100301safe.story
>
> > Inevitable next step: Enterprising cypherpunk registers
> > censoredfedinfo.org, hunts through google's cache, posts everything
> > there, etc.
>
>Note that there are a relatively small number of Googles on the Net.

The trouble with Google and most other spiders is that they cannot access 
the DBs behind the sites.  Various industry estimates place the amount of 
data not accessible to crawlers at up to 500x the html content.  What's 
needed are open access data mining sites using more sophisticated crawlers 
like http://telegraph.cs.berkeley.edu/

steve





More information about the cypherpunks-legacy mailing list