[IP] Interesting speculation on the tech behind gmail

To: ip@xxxxxxxxxxxxxx
Subject: [IP] Interesting speculation on the tech behind gmail
From: Dave Farber <dave@xxxxxxxxxx>
Date: Tue, 06 Apr 2004 04:31:35 -0400
List-help: <http://v2.listbox.com/doc/help_sub?list_name=ip@v2.listbox.com>
List-id: <ip@xxxxxxxxxxxxxx>
List-software: listbox.com v2.0
List-subscribe: <mailto:subscribe-ip@v2.listbox.com>, <http://v2.listbox.com/subscribe/?listname=ip@v2.listbox.com>
List-unsubscribe: <mailto:unsubscribe-ip@v2.listbox.com>, <http://v2.listbox.com/member/unsubscribe/?listname=ip@v2.listbox.com>
Reply-to: dave@xxxxxxxxxx
Sender: owner-ip@xxxxxxxxxxxxxx


Delivered-To: dfarber+@xxxxxxxxxxxxxxxxxx
Date: Tue, 06 Apr 2004 13:56:09 +0530
From: Suresh Ramasubramanian <suresh@xxxxxxxxxx>
Subject: Interesting speculation on the tech behind gmail
To: Dave Farber <dave@xxxxxxxxxx>

http://blog.topix.net/archives/000016.html

April 04, 2004
The Secret Source of Google's Power

Much is being written about Gmail, Google's new free webmail system.There's something deeper to learn about Google from this product than theinitial reaction to the product features, however. Ignore for a moment theobservations about Google leapfrogging their competitors with more uservalue and a new feature or two. Or Google diversifying away from searchinto other applications; they've been doing that for a while. Or theprivacy red herring.

No, the story is about seemingly incremental features that are actuallymassively expensive for others to match, and the platform that Google isbuilding which makes it cheaper and easier for them to develop and runweb-scale applications than anyone else.

I've written before about Google's snippet service, which required thatthey store the entire web in RAM. All so they could generate a slightlybetter page excerpt than other search engines.

Google has taken the last 10 years of systems software research out ofuniversity labs, and built their own proprietary, production qualitysystem. What is this platform that Google is building? It's a distributedcomputing platform that can manage web-scale datasets on 100,000 nodeserver clusters. It includes a petabyte, distributed, fault tolerantfilesystem, distributed RPC code, probably network shared memory andprocess migration. And a datacenter management system which lets a handfulof ops engineers effectively run 100,000 servers. Any of these projectscould be the sole focus of a startup.


Speculation: Gmail's Architecture and Economics

Let's make some guesses about how one might build a Gmail.

Hotmail has 60 million users. Gmail's design should be comparable, andshould scale to 100 million users. It will only have to support a couple ofmillion in the first year though.

The most obvious challenge is the storage. You can't lose people's email,and you don't want to ever be down, so data has to be replicated. RAID isno good; when a disk fails, a human needs to replace the bad disk, or thereis risk of data loss if more disks fail. One imagines the old ENIACtechnician running up and down the isles of Google's data center with ashopping cart full of spare disk drives instead of vacuum tubes. RAID alsorequires more expensive hardware -- at least the hot swap drive trays. AndRAID doesn't handle high availability at the server level anyway.

No. Google has 100,000 servers. [nytimes] If a server/disk dies, they leaveit dead in the rack, to be reclaimed/replaced later. Hardware failures needto be instantly routed around by software.

Google has built their own distributed, fault-tolerant, petabytefilesystem, the Google Filesystem. This is ideal for the job. Say GFSreplicates user email in three places; if a disk or a server dies, GFS canautomatically make a new copy from one of the remaining two. Compress theemail for a 3:1 storage win, then store user's email in three locations,and their raw storage need is approximately equivalent to the user's mail size.

The Gmail servers wouldn't be top-heavy with lots of disk. They need theCPU for indexing and page view serving anyway. No fancy RAID card orhot-swap trays, just 1-2 disks per 1U server.

It's straightforward to spreadsheet out the economics of the service,taking into account average storage per user, cost of the servers, andmonetization per user per year. Google apparently puts the operational costof storage at $2 per gigabyte. My napkin math comes up with numbers in thesame ballpark. I would assume the yearly monetized value of a webmail userto be in the $1-10 range.


Cheap Hardware

Here's an anecdote to illustrate how far Google's cultural approach tohardware cost is different from the norm, and what it means as a componentof their competitive advantage.

In a previous job I specified 40 moderately-priced servers to run a newinternet search site we were developing. The ops team overrode me; theywanted 6 more expensive servers, since they said it would be easier tomanage 6 machines than 40.

What this does is raise the cost of a CPU second. We had engineers thatcould imagine algorithms that would give marginally better search results,but if the algorithm was 10 times slower than the current code, ops wouldhave to add 10X the number of machines to the datacenter. If you've alreadygot $20 million invested in a modest collection of Suns, going 10X to runsome fancier code is not an option.


Google has 100,000 servers.

Any sane ops person would rather go with a fancy $5000 server than a bare$500 motherboard plus disks sitting exposed on a tray. But that's a 10Xdifference to the cost of a CPU cycle. And this frees up the algorithmdesigners to invent better stuff.

Without cheap CPU cycles, the coders won't even consider algorithms thatthe Google guys are deploying. They're just too expensive to run.

Google doesn't deploy bare motherboards on exposed trays anymore; they'reon at least the fourth iteration of their cheap hardware platform. Googlenow has an institutional competence building and maintaining servers thatcost a lot less than the servers everyone else is using. And they do itwith fewer people.

Think of the little internal factory they must have to deploy servers, andthe level of automation needed to run that many boxes. Either network bootor a production line to pre-install disk images. Servers thatself-configure on boot to determine their network config and load thelatest rev of the software they'll be running. Normal datacenter opspractices don't scale to what Google has.

What are all those OS Researchers doing at Google?

Rob Pike has gone to Google. Yes, that Rob Pike -- the OS researcher, themember of the original Unix team from Bell Labs. This guy isn't just somelabs hood ornament; he writes code, lots of it. Big chunks of whole newoperating systems like Plan 9.

Look at the depth of the research background of the Google employees in OS,networking, and distributed systems. Compiler Optimization. Threadmigration. Distributed shared memory.

I'm a sucker for cool OS research. Browsing papers from Google employeesabout distributed systems, thread migration, network shared memory, GFS,makes me feel like a kid in Tomorrowland wondering when we're going toMars. Wouldn't it be great, as an engineer, to have production versions ofall this great research.


Google engineers do!

Competitive Advantage

Google is a company that has built a single very large, custom computer.It's running their own cluster operating system. They make their bigcomputer even bigger and faster each month, while lowering the cost of CPUcycles. It's looking more like a general purpose platform than a clusteroptimized for a single application.

While competitors are targeting the individual applications Google hasdeployed, Google is building a massive, general purpose computing platformfor web-scale programming.

This computer is running the world's top search engine, a social networkingservice, a shopping price comparison engine, a new email service, and alocal search/yellow pages engine. What will they do next with the world'sbiggest computer and most advanced operating system?


Posted by skrenta at April 4, 2004 02:11 PM | TrackBack

-------------------------------------
You are subscribed as roessler@xxxxxxxxxxxxxxxxxx
To manage your subscription, go to
 http://v2.listbox.com/member/?listname=ip

Archives at: http://www.interesting-people.org/archives/interesting-people/

Prev by Date: [IP] Broadband legal limbo lingers
Next by Date: [IP] "A Bad Case of Gas"
Previous by thread: [IP] Broadband legal limbo lingers
Next by thread: [IP] "A Bad Case of Gas"
Index(es):
- Date
- Thread