Google Watch is hysterical in both senses.
I can't believe this conspiracy theorist hasn't even figured out who GoogleGuy is.
I don't buy this:
Quote:
12:04 pm on June 7, 2003
Google has reached its data indexing capacity of 4,294,967,296 (2^32) URLs. Now non-image URLs have an ID stored in 4 bytes, so Google is now running out of IDs for stored pages. When there will be no URLs returned "not found" and deleted from the index, total number of non-image files indexed will soon reach 4,294,967,296 including 3,083,324,652 html pages. After that Google will stop adding new URLs from indexed pages as well as new URLs added for indexing.
They are now considering reconstruction of the data tables which involves expanding ID fields to 5 bytes. This will result in additional 2 bytes per every word indexed throwing the total index size to be multiplied by 1.17. This procedure will require 1000 new page index servers and additional storage for temporary tables. They are hoping to make this change gradually server by server. The completion of the process will take up to one year after that the main URL index will be switched to use 5 bytes ID.
|
Why would a little extra data storage require 1000 new index servers and a year of time? Why couldn't Google double their number of index servers in one week? Reindexing a table is not difficult. Mirroring everything offline is totally within their ability and budget.
Lastly, this idea contradicts the fundamental concept of Google's algoriithm, their branding, and their ability to plan. It looks like nothing but a cheap shot, worthy of Microsoft post thug disruption tactics.