How Google Does It
If you are as fascinated by Google as I am, you will enjoy this c|net article about the inner workings of Google data centers. Without giving too much away (naturally), Google fellow Jeff Dean talks about Google servers, parallelism, and the three core elements of "The Secret Sauce" – the Google File System (GFS), BigTable, and MapReduce.
From the article:
“To provide some structure to all that data, Google uses BigTable. Commercial databases from companies such as Oracle and IBM don't cut the mustard here. For one thing, they don't operate the scale Google demands, and if they did, they'd be too expensive, Dean said.
BigTable, which Google began designing in 2004, is used in more than 70 Google projects, including Google Maps, Google Earth, Blogger, Google Print, Orkut, and the core search index. The largest BigTable instance manages about 6 petabytes of data spread across thousands of machines, Dean said. “





Keep an eye out for the video/slides from Jeff's talk. It was an amazing presentation.