Google organizes 0.02% of world's information

Posted on Monday, November 12 2007 @ 5:10 CET by Thomas De Maesschalck
KnowledgeBid estimates Google is now organizing about 0.02% of all the information in the world:
Berkeley’s School of Information Management Systems calculated how much unique information was produced in 1999 and again in 2002. The SIMS studies are regarded as the benchmarks in this arena, measuring the volume of unique data created in the world each year saved to film, disk, optical, and paper. The 1999 study estimated that between 2,132,238 terabytes ("TB") and 3,212,731 TB of unique information were produced that year. The later study estimated that between 3,416,281 TB and 5,609,121 TB were produced in 2002, so there was a ~19% annual growth rate between ‘99 and ‘02. Assuming the same growth rate to present, somewhere between 6,869,341 TB and 11,278,629 TB of new information were produced in 2006.

In November of 2006, Google released a paper called "Bigtable: A Distributed Storage System for Structured Data" which discussed Google’s proprietary distributed storage system designed to scale to petabytes of data across thousands of commodity servers. The paper also mentioned that Google’s web index at the time was 800 TB and Bigtable as a whole was ~1085 TB. If you take the SIMS numbers as valid, in 2006 Google had organized the equivalent of 0.02% of the unique information produced in that year alone.
Interesting numbers.


About the Author

Thomas De Maesschalck

Thomas has been messing with computer since early childhood and firmly believes the Internet is the best thing since sliced bread. Enjoys playing with new tech, is fascinated by science, and passionate about financial markets. When not behind a computer, he can be found with running shoes on or lifting heavy weights in the weight room.



Loading Comments