4 Oct
2001
4 Oct
'01
6:47 a.m.
02:49 PM 10/3/01 -0700, Steve Schear wrote:
Must've never heard of caching.. The trouble with Google and most other spiders is that they cannot access
On Wed, Oct 03, 2001 at 06:38:05AM -0700, Khoder bin Hakkin wrote: the DBs behind the sites. Various industry estimates place the amount of data not accessible to crawlers at up to 500x the html content. What's
needed are open access data mining sites using more sophisticated crawlers like http://telegraph.cs.berkeley.edu/
Or for readers to take "Google" as representative of wget-crazed ephemeral rogue libertarian librarians.