Back of the envelope: how many servers does AWS use?
So I want to try to figure out the size of the server network backing AWS.
I’m going to come at this three ways: by revenue, by number of users, and by comparison to another major site. By the way if someone has found a more direct way (e.g. Amazon says somewhere how many servers back AWS) please let me know
Revenue: We don’t know exactly how much Amazon grosses from AWS, but it seems like its on the order of $10-$100 million – let’s call it $50 million. Lets say also they have 50% gross margins against their infrastructure costs (conveniently, this guess is both inherently reasonable and guaranteed not to be off by more than a factor of 2). $25 million dollars a year is about $2 million a month, which would get you about 15,000 2GB servers and corresponding bandwidth from slicehost at retail. Assuming slicehost has similar gross margin rates, we end up with about 30,000 mid-range servers backing AWS.
Now we can also say that $50 million buys 500 million small instance-hours on EC2, which works out to about 60,000 instance-years. If a small instance is physically backed by the equivalent of a low-to-mid-range server, the price-to-server ratio for EC2 is comparable to other AWS services, and Amazon gets about half of its revenue from data transfer as opposed to computing/storage, then we again end up with about 30,000 physical backing servers.
Customers: Eric reported in the TC article linked above that an Amazon executive said that they have 60,000 AWS customers. Its difficult to put an upper limit on how many servers that would correspond to – as some customers could easily use tens or hundreds of machines each – but it does suggest that Amazon almost certainly has 10s of thousands of servers backing AWS.
Comparison: Facebook has about half as many unique visitors as Amazon sites and runs on about 12,500 servers, which would suggest that Amazon proper might need about 25,000 servers. AWS recently passed Amazon proper in terms of bandwidth usage, suggesting that AWS’s infrasturucture is within a small constant factor and probably slightly larger than Amazon’s. This again suggests that AWS is backed by servers numbering in the low-to-mid tens-of-thousands.
Final triangulated guess: ~30,000 commodity servers back AWS.