<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Cloudscaling &#187; amazon</title>
	<atom:link href="http://www.cloudscaling.com/blog/tag/amazon/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cloudscaling.com</link>
	<description>Open Cloud Solutions</description>
	<lastBuildDate>Wed, 09 May 2012 16:43:27 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Cloud Computing Came to a Head in 2011</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/cloud-computing-came-to-a-head-in-2011/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/cloud-computing-came-to-a-head-in-2011/#comments</comments>
		<pubDate>Sat, 31 Dec 2011 00:42:39 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[Asymco]]></category>
		<category><![CDATA[Automation]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud futures series]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[commoditization]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[iaas]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[OpenStack]]></category>
		<category><![CDATA[predictions]]></category>
		<category><![CDATA[scaling]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[web-scale]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=2637</guid>
		<description><![CDATA[Happy New Year! I hope you are all having a fantastic holiday. This is a year end posting that I think you will find particularly compelling. Rather than predicting the future I thought I would take a look back at &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/cloud-computing-came-to-a-head-in-2011/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Happy New Year!  I hope you are all having a fantastic holiday.  This is a year end posting that I think you will find particularly compelling.  Rather than predicting the future I thought I would take a look back at five long years of ‘cloud computing’.</p>
<p>The Cloudscaling blog has a loyal following as can be seen from the website and RSS feed stats.  As many of you long time readers know, I’ve been ‘in the game’ working on cloud computing technology or blogging about as long as anyone except perhaps those at AWS.  In all of that time, my thinking and assessment of what’s happening and how it’s evolving has changed continuously.  What was interesting for me this year is that this continuously changing perspective slowed to a crawl or perhaps even stopped.  2011 is the year that much of my thinking and perspective on cloud computing, particularly infrastructure clouds (aka “IaaS”) hardened.</p>
<p>That sounds tough.  “Hardened.”  I don’t mean hardened in the sense of rigid, but rather in the notion of wet cement drying.  Many things that have seemed up in the air now seem settled and my doubts about the future of infrastructure clouds are gone.  They are not only here to stay, but the shape and direction of them seem very clear.  I’m not certain everyone else is clear, but I am.  Perhaps I will be wrong, but I doubt it.</p>
<p>Let’s take a look back at the arc of my thinking and how things did NOT change in 2011.  That will tell us what 2012 is likely to look like.</p>
<p><strong>The Evolution of My Cloud Thinking</strong><br />
My thinking evolved through three clear phases:</p>
<blockquote><p>Automation -&gt; VMs &amp; Virtual Datacenters -&gt; New IT Paradigm</p></blockquote>
<p><em>Phase 1: Automation</em><br />
About this time of year, in late 2006, a short time after Amazon EC2 launched, myself and others prototyped a cloud application management framework similar to RightScale.  At that time RightScale was named something else and had not been funded or publicly launched.  These were early days.</p>
<p>As someone with a deep passion for automation, I remember thinking then that a lot of my lifetime interests (networking, storage, security, and systems management) were all converging and being managed by automation.  For me, what was happening was all about automation … and lots of it.</p>
<p><em>Phase 2: Virtual Machines &amp; Virtual Datacenters</em><br />
Roughly summer of 2008, the first “<a href="http://en.wikipedia.org/wiki/CloudCamp">CloudCamp</a>” was thrown where a number of the cloud bloggers and thought leaders came together for the first time.  Unknowingly we all centered about using the term ‘cloud computing’ to explain what this new emerging phenomena was.  It was right after this event and over the summer of 2008 that the term “<a href="http://www.google.com/trends?q=%22cloud+computing%22&amp;ctab=0&amp;geo=all&amp;date=2008&amp;sort=0">cloud computing</a>” really took hold.  This also led to the formation of the “<a href="http://twitter.com/clouderati">clouderati</a>”  and I simultaneously <a href="http://cloudscaling.com/blog/cloud-computing/what-i-did-in-2008">joined GoGrid</a> as the VP Technology Strategy.</p>
<p>Perhaps GoGrid biased my thinking, but I started to move from a perspective that was cloud application centric back into my sweet spot of physical infrastructure and a focus on virtual datacenters or what I called at the time, “<a href="http://blog.gogrid.com/2009/01/08/cloudcenters-are-datacenters-in-the-sky/">cloud centers</a>”.  In this view, virtual machines were king and inevitably, the question was: “how will we model existing datacenter environments?”</p>
<p><em>Phase 3: Cloud Computing is a new kind of IT</em><br />
After leaving GoGrid in the summer of 2009 I had the opportunity to step back and take a fresh look at how things had evolved.  I wanted to build my own cloud business again, but I wanted to skate to where the puck would be, not where it was today.  I also could see that most everyone involved in the cloud computing space was spending time trying to retrofit the notion of ‘cloud computing’ to their existing business models and technology.  Simultaneously, I still didn’t see any serious competitors to AWS.</p>
<p><strong>What</strong> were they doing that was so different??</p>
<p>It’s not well known, but in the beginning of Cloudscaling’s (re)formation in fall of 2009 into mid 2010, I did a number of strategic and due diligence engagements on various IaaS vendors for VC firms, Platform-as-a-Service startups, enterprises, and enterprise vendors.  During that time I was involved in deep technical dives on the technology and business models for these IaaS vendors.  They ranged from GoGrid competitors to more of an enterprise cloud model.  By late 2010 Cloudscaling, collectively, had deep architectural and business model understanding of roughly 10 different Infrastructure-as-a-Service (IaaS) players, not including the IaaS clouds that we helped build [1].  I am not sure anyone else had or has that understanding today.  What we saw, was telling.</p>
<p>My primary takeaway was that even when it came to startups and direct AWS competitors, absolutely none of the infrastructure cloud players were developing their clouds like AWS.  For the most part, they were simply integrating common-off-the-shelf (COTS) components to mimic an AWS-like environment. None of them had <a href="http://cloudscaling.com/blog/cloud-computing/amazon-web-services-rapid-release-cycle">AWS velocity</a> [2], nor were they paying attention to what made AWS special [3].  All too often, they identified ‘flaws’ in AWS that were instead unrecognized strengths. Examples of this include:</p>
<ul>
<li><a href="http://cloudscaling.com/blog/cloud-computing/what-is-amazons-secret-for-success-and-why-is-ec2-a-runaway-train">Constrained feature set</a></li>
<li>Standardized instance sizes</li>
<li>Lack of VLANs [4]</li>
<li>Ephemeral storage</li>
<li>Generic load balancing service without fancy vendor lock-in features [5]</li>
</ul>
<p>That’s when I began to understand that ‘cloud computing’ had less to do with automation or virtual machines/datacenters on demand and more to do with *how* AWS was building their infrastructure cloud.</p>
<p>Ask yourself these questions:</p>
<ul>
<li>Why isn’t VMware more successful in the public cloud space if it’s just VMs and VDCs?</li>
<li>Why isn’t there a VMware-based competitor at similar scale to AWS?  Or even close?</li>
<li>There are now 100+ “VMs on demand” competitors, but almost none have the same growth rate as AWS … why not?</li>
<li>What do the largest Internet giants (Amazon, Google, Facebook, SFDC) all have in common from an architectural standpoint and how is that different from a typical enterprise datacenter?</li>
</ul>
<p><strong>Cloud Computing vs. Enterprise Computing</strong><br />
I gave the <a href="http://vimeo.com/21372341">third opening keynote at Cloud Connect 2011</a>, behind Werner Vogels of Amazon and Lew Tucker of Cisco.  That keynote drove much of the discussion during the first day around ‘enterprise clouds’ and their viability.  In that talk was also the initial crystallization of my infrastructure cloud thinking:</p>
<p><em>We didn’t have one way to build infrastructure clouds … we had two.</em></p>
<p>One was rooted in the old modalities and thinking around existing datacenters and enterprise applications.  The other was rooted in a new way of thinking about Information Technology (IT) that uprooted every approach that had gone before.</p>
<p>Enterprise Computing applied to ‘infrastructure cloud’ [6]:</p>
<ul>
<li>How do I virtualize and manage my existing datacenter apps?</li>
<li>How do I achieve bottom line cost savings and extend server consolidation?</li>
<li>How can my existing vendors help me create a ‘private cloud’?</li>
<li>How can I be compatible with everything I own today?</li>
</ul>
<p>Cloud Computing applied to ‘infrastructure cloud’:</p>
<ul>
<li>What can we do to allow application developers to experience ‘infinite scalability’?</li>
<li>How can we simplify the allocation of traditional IT resources of networking, storage, and compute?</li>
<li>What will it take to help next generation web applications ‘scale’ by simply adding more of these IT resources?</li>
<li>How do we make it continually less expensive such that application developers can consume as much as they need?</li>
<li>How can I, the service provider, make my cost of capital equipment and operational management as low as possible so I can pass those savings on to app developers? [7]</li>
</ul>
<p>These are two very different schools of thought.  One is about saving money for existing datacenters and applications.  The other is about enabling new revenue streams via new applications and unlocking the potential for developers to add value to the business.  The starkest example of this I can think of can be found in my blog <a href="http://cloudscaling.com/blog/cloud-computing/cloud-innovators-netflix-strategy-reflects-google-philosophy">interview with Adrian Cockcroft</a>, the chief architect at Netflix on their adoption of Amazon Web Services [8].</p>
<p>A brief aside: This is why I think the <a href="http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf">NIST definition of cloud computing</a> is such a huge FAIL.  It’s focus is on the superficial aspects of ‘clouds’ without looking at the true underlying patterns of how large Internet businesses had to rethink the IT stack.  They essentially fall into the error of staying at my &#8216;Phase 2: VMs and VDCs&#8217; (above).  No mention of <a href="http://en.wikipedia.org/wiki/CAP_theorem">CAP theorem</a>, understanding the <a href="http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing">fallacies of distributed computing</a> that lead to successful scale out architectures and strategies, the core socio-economics that are crucial to meeting certain capital and operational cost points, or really any acknowledgement of this very clear divide between clouds built using existing &#8216;enterprise computing&#8217; techniques and those using emergent &#8216;cloud computing&#8217; technologies and thinking. [9]</p>
<p><strong>How 2011 Unfolded &#8230;</strong><br />
Ever since that keynote at Cloud Connect, it’s become more and more clear that not only is cloud computing a new disruptive displacement of the existing IT model (see blog link just above) in the same way that enterprise computing (aka ‘client-server’) displaced mainframe computing, but that it’s directly intersecting with other major trends in technology.</p>
<p>Infrastructure cloud computing directly intersects and either enables or works with:</p>
<ul>
<li><em>Big data</em>, the explosion of data and data processing needs</li>
<li><a href="http://www.asymco.com/2011/07/06/the-post-pc-era-will-be-a-multi-platform-era/"><em>The post-PC era</em></a>, or the notion of the rise of appliances and mobile platforms as the long term predominant platform, and the shift to ‘apps’ from ‘desktops’ [10]</li>
<li><a href="http://us.trendmicro.com/imperia/md/content/us/pdf/trendwatch/consumerization/wp2_consumerization_110510us_pdf.pdf"><em>Consumerization of IT</em></a> (TrendMicro whitepaper in PDF), or the notion that knowledge workers prefer more adaptive and flexible environments to get their work done such as they experience in their private lives with the large web application providers (Google, Amazon, Facebook, etc.)</li>
</ul>
<p>I’m probably overlooking other related trends here, but what is blindingly obvious is that all of these trends are new opportunities, not old.  Nor are they a re-hash of old opportunities.  Every single one of them are driving infrastructure cloud computing growth.  From the hidden, such as Apple’s iCloud, to the obvious, such as becoming the de facto platform for building big data or mobile app backend services.</p>
<p>As 2011 draws to a close this weekend, I’m beginning to see the upcoming ‘<a href="http://en.wikipedia.org/wiki/Hype_cycle">trough of disillusionment</a>’ or ‘<a href="http://en.wikipedia.org/wiki/Crossing_the_Chasm_(book)">chasm</a>’ as Geoffrey Moore called it.</p>
<p><strong>Writing -&gt; Wall</strong><br />
We are five years in and no one has emerged as a legitimate challenger to AWS’s market dominance.  And, frankly, none are on the horizon.  The enterprise infrastructure cloud providers I’m aware of have terminally poor growth rates (&lt;10% <a href="http://www.investopedia.com/terms/c/cagr.asp">CAGR</a> in many cases) and most of them won’t see a return on investment before they hit their five-year hardware refresh cycle.  Translation: <strong><em>these enterprise clouds are essentially net losses when evaluating them on a 5-year TCO basis</em></strong>.  The hardware itself won’t even be paid for during that time, much less the cost of operations.</p>
<p>Meanwhile, <a href="http://cloudscaling.com/blog/cloud-computing/amzn-other-revenue-in-2011">AWS will reach $1 billion in revenue this year</a> and those few that are following roughly the same trajectory as AWS have at least similar growth rates, if not scale (<a href="http://www.slideshare.net/randybias/enterprise-cloud-myths">see slide 11</a>).</p>
<p>While VCE touts $1 billion in vBlock sales [11], the onslaught of so-called ‘<a href="http://en.wikipedia.org/wiki/Shadow_IT">shadow IT</a>’ hasn’t ceased or slowed down if AWS growth is any indication.  Most of these ‘private cloud’ deployments have failed to deliver on the promise of cloud computing, hence app developers still adopt AWS in droves.  Frankly, it’s stunning how many of the Fortune 1000 are running production apps, mostly next gen web apps or re-architected versions of last gen web apps, on AWS, but won’t talk about it.</p>
<p><strong>The Road Ahead: 2012</strong><br />
In 2012, we’re going to see the gap between ‘enterprise clouds’ and ‘web-scale clouds’ widen as we enter the chasm.  At Cloudscaling we are already seeing just about everyone with an ‘enterprise cloud’ out researching ‘low cost’ alternatives.  Unfortunately, this is still missing the forest for the trees, as business agility and top-line revenue growth is a far more compelling value proposition for web-scale clouds.</p>
<p>I believe that 2012 will be a time of experimentation, learning, and quite possibly even larger ‘cloud failing’ than has gone before.  Before it can get brighter, it’s got to get darker.</p>
<p>I don’t know the ultimate solution, but one thing is for certain, we’re all going to learn a lot making it through the chasm to the other side.  The only other thing I can tell you for certain is that mimicking existing enterprise datacenters is a ‘looking back’ rather than ‘leaning forward’ strategy.</p>
<p>In this coming year I plan to spend a lot more time on this blog and in speaking engagements exploring all of  these ideas, thoughts, and revelations in more depth.</p>
<p>&#8211;Randy Bias<br />
Co-Founder &amp; CTO, Cloudscaling</p>
<hr />[1] KT’s <a href="http://cloudscaling.com/www/news-events/press-releases/kt-and-cloudscaling-launch-korea’s-first-major-private-cloud">private</a> and public compute clouds, their OpenStack storage cloud, Internap’s <a href="http://gigaom.com/cloud/first-openstack-cloud-now-open-for-business/">OpenStack storage cloud</a>, and another I can’t currently discuss.<br />
[2] By my current estimation AWS is closing out at 71 significant feature releases this year, up 5 from my estimate of 66 for 2011.  I will provide a more detailed update soon.<br />
[3] The one possible exception here is the Rackspace team who I give full props to for understanding the nature of the change and doing their best to adapt.<br />
[4] I plan to explore VLANs and the confusion there and explain why VPC is meaingful, but mostly for legacy apps in a future posting; the biggest AWS users, like Zynga and Netflix don’t use VPC or VLANs at all.<br />
[5] Surfacing vendor specific ‘features’ to differentiate your load balancing service simply provides a layer of lock-in that end-users don’t want while making your infrastructure cloud less compatible with others.<br />
[6] I strongly recommend reading Simon Wardley’s piece on <a href="http://blog.gardeviance.org/2011/02/private-vs-enterprise-clouds.html">enterprise clouds</a>.<br />
[7] If you haven’t you *really* should watch this great <a href="http://vimeo.com/32994957">video interview</a> I did with Lew Tucker, CTO of Cisco Cloud Computing on operational and capital costs for building infrastructure clouds.<br />
[8] Also be sure to watch this <a href="http://vimeo.com/32951599">video interview</a> I did with Adrian Cockcroft at CloudBeat 2011.<br />
[9] I think my posting from late 2010 on why ‘<a href="http://cloudscaling.com/blog/cloud-computing/elasticity-is-not-cloud-computing-just-ask-google">Elasticity is NOT Cloud Computing</a>’ still holds up well in this context.<br />
[10] You really should listen to this great podcast (<a href="http://5by5.tv/criticalpath/14">audio</a>, <a href="http://cloudscaling.com/blog/cloud-computing/two-disruptions-for-the-price-of-one">text summary</a>)I did with Horace Dediu of Asymco where we cover a lot of crowd in the relationship between the post-PC era and cloud computing.<br />
[11] Unfortunately, I don’t have a reference for this.  I’ve heard it ‘off the record’ from a number of sources at Cisco and VCE, but I can’t find a public reference on it.  If anyone has such a reference I would appreciate a link in the comments below.  Full credit will be provided.<br />
[Freebie] Quora question: <a href="http://www.quora.com/In-what-ways-is-AWS-better-than-most-of-its-competitors">In what ways is AWS better than it’s competitors?</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/cloud-computing-came-to-a-head-in-2011/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>AWS Dedicated Instances, Hypervisor Security, and Multi-tenancy</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/aws-dedicated-instances-hypervisor-security-and-multi-tenancy/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/aws-dedicated-instances-hypervisor-security-and-multi-tenancy/#comments</comments>
		<pubDate>Wed, 30 Mar 2011 22:55:02 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[audits]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[commoditization]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[predictions]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1831</guid>
		<description><![CDATA[Most everyone in the blog ecosystem has missed both the point and some of the economics of AWS Dedicated Instances that were recently announced.  Folks like The Register focus on how a single virtual instance can cost $109,324 for a year &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/aws-dedicated-instances-hypervisor-security-and-multi-tenancy/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Most everyone in the blog ecosystem has missed both the point and some of the economics of <a href="http://aws.amazon.com/dedicated-instances/">AWS Dedicated Instances</a> that were recently announced.  Folks like The Register focus on how a single virtual instance can cost <a href="http://www.theregister.co.uk/2011/03/29/amazon_dedicated_ec2_instances/">$109,324 for a year</a> without really understanding the positioning and value proposition of this AWS offering.  Another blog posting claims dedicated instances are &#8220;<a href="http://www.readwriteweb.com/cloud/2011/03/amazon-web-services-adds-an-un.php">Un-cloudy</a>&#8220;. Let&#8217;s be honest folks, we might be able to claim Amazon is a lot of things, but foolish or &#8216;un-cloudy&#8217; is not one of them.  Frankly, I think since AWS is pretty much driving the definition of IaaS/&#8221;infrastructure cloud&#8221; right now, calling them &#8216;Un-cloudy&#8217; is unreasonable.</p>
<p>Let&#8217;s put this all to bed right now.  We&#8217;re going to look at the issues around multi-tenancy, security, pricing, and positioning.</p>
<p><span id="more-1831"></span></p>
<p><strong>Market Positioning</strong><br />
I&#8217;ll go into depth on this in the near future as it&#8217;s tightly related to my recent <a href="http://cloudscaling.com/blog/cloud-computing/cloud-connect-2011-wrap-up">postings</a> and <a href="http://www.slideshare.net/randybias/enterprise-cloud-myths">presentations</a> on &#8216;enterprise clouds&#8217; (cloud-washed enterprise computing &amp; virtualization systems).  Right now though, the key thing to understand is that AWS is *already* in the business of servicing enterprise customers regardless of security concerns.</p>
<p>Enterprises, like most other businesses, have two key adoption types: greenfield applications and legacy applications.  Greenfield enterprise applications have been adopting AWS and other commodity clouds for some time now.  During that same time, AWS has been very busy adding enterprise friendly features to increase the ability for legacy enterprise applications to adopt EC2.</p>
<p>A great example of this is <a href="http://aws.amazon.com/vpc/">Virtual Private Cloud (VPC)</a>, which originally provided simple layer-2 VLAN/Ethernet emulation combined with a VPN termination point.  Now, as of their latest release it also allows creating complex networking topologies, just like in a traditional enterprise datacenter.</p>
<p>Dedicated Instances are yet another arrow in the AWS quiver that reduces friction for enterprise adoption of existing legacy applications.  This is an enterprise focused feature.  It reduces concerns around security of the hypervisor and &#8216;sharing&#8217;.  Reduces, not eliminates.</p>
<p>It&#8217;s also worth noting that while billed as &#8216;Dedicated Instances&#8217;, Amazon has already been effectively selling dedicated VMs/instances in their HPC offering. [1]</p>
<p><strong>Hypervisor Security</strong><br />
Whether you or I believe hypervisor security issues are relevant doesn&#8217;t matter.  Some people clearly do and not sharing the hypervisor may be a requirement in some regulatory and audit situations.  Providing customers a dedicated physical server and reducing sharing to only the network and (maybe) storage[2] is seen as a win by some security and compliance people.</p>
<p>For large enterprises, getting over that security and compliance hump is important. Frankly, my recent observation is that when a massive disruption is happening the incumbents will focus on creating Fear, Uncertainty, and Doubt (FUD) in key areas.  One is security.  Many of the threatened enterprise IT vendors specifically throw this up as a reason to avoid adopting public commodity clouds or using their same approaches to build your own cloud.  Dedicated Instances remove this obstacle.</p>
<p><strong>Multi-Tenancy</strong><br />
Perhaps the most pernicious idea out there is that this is somehow &#8216;Uncloudy&#8217; because the hypervisor is not shared.  I&#8217;m not sure how this kind of thing gets started, but at it&#8217;s roots it assumes that multi-tenancy is a core property of infrastructure clouds and that it is only achieved via the hypervisor.  Taking aside the definition of &#8216;multi-tenancy&#8217; and whether it&#8217;s a core property, it should be noted that clouds &#8216;share&#8217; many resources, of which the CPU/server is only one.  They also can share storage, networking, billing systems, etc.</p>
<p>Don&#8217;t misunderstand me.  I *do* think some kind of multi-tenancy is important, but there is a spectrum of multi-tenancy from &#8216;a little&#8217; to &#8216;a lot&#8217;.  Also, what you call a &#8216;tenant&#8217; is critical.  Finally, tenancy happens differently in SaaS from PaaS and IaaS.  The tenancy models are very very different.</p>
<p>So, let&#8217;s dig into this notion of hypervisor tenancy.  I have a couple of diagrams to show my point.  Assume we have 6 customers with 4 instances each on a cloud with 6 compute nodes.  Randomly distributed we see something roughly like this:</p>
<p style="text-align: center;"><a href="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt1.png"><img class="aligncenter size-full wp-image-1833" title="hypervisor-shuffle-pt1" src="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt1.png" alt="" width="442" height="246" /></a></p>
<p style="text-align: left;">Voila!  Multi-tenancy.  Everyone is happy.  We have a cloud, people.</p>
<p style="text-align: left;">However, if we re-shuffle these instances and &#8216;bin pack&#8217; them onto dedicated servers, we suddenly turn off the multi-tenancy:</p>
<p style="text-align: center;"><a href="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt2.png"><img class="aligncenter size-full wp-image-1834" title="hypervisor-shuffle-pt2" src="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt2.png" alt="" width="442" height="246" /></a></p>
<p style="text-align: left;">What&#8217;s different here?  Have we truly lost multi-tenancy?  Customers are no longer sharing hypervisors and nothing has changed but that we&#8217;ve reshuffled the instances.  But perhaps we haven&#8217;t lost multi-tenancy.  Networking, storage, and other resources are still shared.</p>
<p style="text-align: left;">Let&#8217;s look at a more real world example, though, since most clouds don&#8217;t run at 100% capacity[3]:</p>
<p style="text-align: center;"><a href="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt3.png"><img class="aligncenter size-full wp-image-1835" title="hypervisor-shuffle-pt3" src="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt3.png" alt="" width="442" height="246" /></a></p>
<p>Here we have a cloud running at about 75% utilization rate with instances randomly distributed.  This is in pretty good shape, but of course, all of these open &#8216;slots&#8217; aren&#8217;t generating revenue anyway.  Of course, that&#8217;s part of the business model, so no harm, no foul.</p>
<p>Time to reshuffle!</p>
<p style="text-align: center;"><a href="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt4.png"><img class="aligncenter size-full wp-image-1836" title="hypervisor-shuffle-pt4" src="http://dev.cloudscaling.com/wp-content/uploads/2011/03/hypervisor-shuffle-pt4.png" alt="" width="442" height="246" /></a></p>
<p>Right, so now we&#8217;re still running at 75% for the entire cloud, but some customers are 25% utilization for their dedicated servers, some 50%, and some 100%.  Our cloud wide efficiency hasn&#8217;t been reduced significantly, but per customer it has.  This also means that customers are going to control the efficiency rate quite a bit more we would like, holding certain physical servers to themselves if this is the same as the AWS Dedicated Instances model.</p>
<p>This is where AWS rather clever pricing comes in.  They simply charge a sort of &#8216;tax&#8217; across a single region of $10/hr if you choose to use this capability.  This tax effectively makes up for any inefficiency created by allowing customers to hold open a few more instance slots than normal.</p>
<p><strong>AWS Dedicated Instances Pricing</strong><br />
Again, the confusion on whether this feature is &#8216;cost effective&#8217; mostly comes from the Register&#8217;s <a href="http://www.theregister.co.uk/2011/03/29/amazon_dedicated_ec2_instances/">biased assessment</a> of the costs.  This feature is not targeted at individual consumers, but large enterprises looking to adopt en masse.  For such customers I&#8217;ve heard of monthly run rates between 100K-1M in usage charges.  $12M/year for a large enterprise is a drop in the bucket.  $1.2M/year doesn&#8217;t even touch the radar.</p>
<p>As an example, if a large enterprise was spending $1M/month and wants to get slightly better security, much better compliance, they would have to spend a whopping $10/hr per AWS region, roughly $7200/month[4].  That&#8217;s $86,400/year.  Let&#8217;s see, that&#8217;s an addition of .7% to their total annual spend on AWS.  Is slightly better security and ability to meet compliance standards worth &gt;1% in additional cost?</p>
<p>If that same enterprise was only spending $100K/month, then we are looking at a 7% addition to total annual spend.  I don&#8217;t know what the value is of the AWS Dedicated Instances feature is to such a large customer, but I&#8217;m certain it&#8217;s more than 1-7% addition in additional spending.  Probably much more.</p>
<p>This pricing makes AWS Dedicated Instances extremely good value for money for a large business.  Combined with the <a href="http://aws.typepad.com/aws/2011/03/new-approach-amazon-ec2-networking.html">new VPC features</a> and being able to ride <a href="http://cloudscaling.com/blog/cloud-computing/amazon-web-services-rapid-release-cycle">Amazon&#8217;s innovation curve</a>, constant <a href="http://cloudscaling.com/blog/cloud-computing/aws-price-reduction">cost reduction cycle</a>, and the other benefits of a large commodity public cloud provider, it&#8217;s hard not to find the whole offering rather compelling.</p>
<p><strong>Conclusion</strong><br />
Better security, better compliance, less impedance mismatch with legacy applications, ability to onboard enterprise customers, and still cloudy.  This is a net win for everyone involved: AWS, enterprise customers, and the cloud community as a whole.</p>
<p>BTW, there are whispers that AWS has significant amounts of other related features that will further reduce impedance mismatch with enterprise clouds.  I expect that anyone sitting on an enterprise cloud (public or private) that doesn&#8217;t have an innovation cycle matching Amazon&#8217;s is going to get run over in the next year or two.  More from us on remaining competitive soon, though.</p>
<p><strong>UPDATE</strong>: Cleaned up some language and fixed some typos.</p>
<hr />[1] I know this because the <a href="http://aws.amazon.com/ec2/hpc-applications/">AWS HPC GPU offering</a> provides 2xNvidia GPUs for advanced HPC use cases; you can only have 2 GPUs in a single box because server boards only have 2 PCI-E x16 slots; in addition, each HPC system gets 8 full Nehalem cores and AWS is known not to oversubscribe cores.<br />
[2] Ephemeral storage on instances is not shared.  Only Elastic Block Storage (EBS) is shared.  So it&#8217;s really your call on whether or not you share disk or not.<br />
[3] As I said before, most target about 80% utilization rates.  Anything under 70% sinks the business model.<br />
[4] I&#8217;m glossed over the additional per instance fee here for brevity&#8217;s sake.  It doesn&#8217;t change the numbers significantly.  It&#8217;s still a nominal increase in costs for a significant increase in value no matter how you slice it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/aws-dedicated-instances-hypervisor-security-and-multi-tenancy/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Cloud Innovators: Netflix Strategy Reflects Google Philosophy</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/cloud-innovators-netflix-strategy-reflects-google-philosophy/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/cloud-innovators-netflix-strategy-reflects-google-philosophy/#comments</comments>
		<pubDate>Tue, 23 Nov 2010 16:00:22 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[netflix]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1539</guid>
		<description><![CDATA[Welcome to a new interview series we’re trying on at Cloudscaling. This series is meant to highlight not just cloud adoption stories, but stories about how businesses, particularly enterprise businesses, are changing the way they provide Information Technology (IT). As &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/cloud-innovators-netflix-strategy-reflects-google-philosophy/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Welcome to a new interview series we’re trying on at Cloudscaling.  This series is meant to highlight not just cloud adoption stories, but stories about how businesses, particularly enterprise businesses, are changing the way they provide Information Technology (IT).  As long time followers of the Cloudscaling blog know, our view is that “cloud computing” is neither <a href="http://cloudscaling.com/blog/cloud-computing/the-cloud-is-not-outsourcing">outsourcing</a> (2009) or <a href="http://cloudscaling.com/blog/technology/virtualization-is-not-the-answer-for-clouds">virtualization-on-demand</a> (2008), but something far more subtle and far-reaching:  a <a href="http://cloudscaling.com/blog/cloud-computing/elasticity-is-not-cloud-computing-just-ask-google">20-year shift in the way that all IT is provided</a>, much like the transition from mainframe computing to enterprise computing (aka ‘client-server computing’).</p>
<p>Just like when we transitioned from mainframes, a *lot* of change is going to happen.  In order to highlight this change, we wanted to start a series that really shows what is possible if you fully embrace cloud computing.</p>
<p>Our first interview is with <a href="http://perfcap.blogspot.com">Adrian Cockcroft</a>, Cloud Architect at Netflix.  Recently Netflix announced something incredible: their 24x7x365 video-on-demand streaming service, the largest of its kind in the world, had moved from their own datacenters onto Amazon Web Services EC2.  See this <a href="http://www.slideshare.net/adrianco/netflix-on-cloud-combined-slides-for-dev-and-ops">QCon presentation</a> from Adrian on the technical details.</p>
<p>This interview, however, takes a different tack and looks at why Netflix chose to take this route, examining business drivers, and asking “what do you tell others who wish to follow in your footsteps?”</p>
<p><strong>The Interview</strong></p>
<p><strong>RLB:</strong> <em>Could you describe at a high level what Netflix is doing on AWS?</em></p>
<p><strong>AC:</strong> Encoding movies for streaming, log analysis, production web site and API, most everything that scales with customers and streaming usage. Easier to say what we don’t have there: most internal IT that scales with employee count, legacy stuff, DVD shipping systems, account sign-up and billing systems. We use Akamai, Limelight and Level3 CDN’s for streaming the movies, which is a cloud based service. There is an AWS CDN service, but they aren’t a big enough player in this space at this point.</p>
<p><strong>RLB:</strong> <em>What was the key business driver that led to deciding to use AWS rather than your own datacenters?</em></p>
<p><strong>AC:</strong> Business agility and the inability to predict how much capacity we would need when and where for a business whose growth is accelerating. Year on year customer growth is 52% (up from 42% last quarter), year on year customers using streaming is up 145% (from ~4M to ~11M). We need to be a small-ish fish in a big AWS pond to leverage AWS investment in datacenter build-out, rather than building our own pond.</p>
<p><strong>RLB:</strong> <em>Many folks claim that they can deliver a private cloud at a similar price point to AWS. I assume you ran the numbers yourself.  In whatever detail you can share, what does the ROI look like for Netflix?</em></p>
<p><strong>AC:</strong> Our datacenter runs Oracle on IBM hardware, we could have switched to commodity hardware in a datacenter, but skipped that step by going to AWS. There are three points on cost, one is that Oracle on IBM is very expensive, so AWS looks cheap in comparison, and we have flat-lined our datacenter capacity.</p>
<p>The second point is that AWS costs are fully burdened, and we could not have hired enough SAs and DBAs to build out our own datacenter this fast. We have added 4-5x as many systems in the cloud as the total we have in our datacenter over the last year.</p>
<p>The third point is that the costs are elastic, you start paying for a resource just before it goes live, and if you stop using a resource you stop paying for it. If you own a resource it sits around a long time waiting to be delivered and installed, and if you no longer want to use that type of resource you are still paying for it for three years. When Amazon cuts prices, your installed capacity gets cheaper. When they install new instance types you can be running on them in hours, technology refresh in real time.</p>
<p>I don’t believe private clouds can compete with public on price, however if you have a bunch of empty datacenter space or want to re-organize your internal systems to be automated and API driven then there are real cost savings to building your own private cloud. I think VMware and Microsoft are going to own the private cloud space, but Amazon is going to continue to disrupt both of them at a lower price point for public cloud.</p>
<p><strong>RLB:</strong> <em>You have a great presentation on SlideShare that describes your development and operations approach.  You were willing to rethink your entire approach to the Netflix system.  Why was your business leadership willing to take on this kind of risk/investment?</em></p>
<p><strong>AC:</strong> That’s the way they roll&#8230; The impetus to do this came from the top down. The leadership saw that the biggest technology risk we had was lack of agility, and wanted to invest in our ability to move faster than our competitors without being held back by scale (e.g. running out of datacenter capacity) or technical debt issues. The leadership had to drag the engineers along to some extent, wean them off SQL and established habits and processes.</p>
<p><strong>RLB:</strong> <em>Our belief is that &#8216;cloud computing&#8217; is a new way of providing IT that displaces &#8216;enterprise computing&#8217; as enterprise computing displaced &#8216;mainframe computing&#8217;.  Your work seems to epitomize this. What would you say to folks who are willing to re-architect/re-engineer their applications for this new world?  What are the top 3 challenges they need to consider?</em></p>
<p><strong>AC:</strong> The key challenge is to get into the same mind-set as the Google’s of this world, the availability and robustness of your apps and services has to be designed into your software architecture, you have to assume that the hardware and underlying services are ephemeral, unreliable and may be broken or unavailable at any point, and that the other tenants in the multi-tenant public cloud will add random congestion and variance. In reality you always had this problem at scale, even with the most reliable hardware, so cloud ready architecture is about taking the patterns you have to use at large scale, and using them at a smaller scale to leverage the lowest cost infrastructure.</p>
<p>The second challenge is mostly political, an enterprise CIO and IT/ops department would much rather build datacenters and private clouds than become irrelevant to their customers. However while they are struggling to get their private clouds to work, their end users will be using AWS and setting a fully burdened price point expectation that IT cannot get near to. I have heard of internal chargeback for resources being an order of magnitude higher than using AWS for the same kind of resources [1].</p>
<p>The third challenge is the obvious one that the organizations that run compliance audits are working to a rule-sheet that has no concept of cloud, and it will take a while for some applications and industries (like finance) to get permissions and processes in place. That’s a when not an if.</p>
<p><strong>RLB:</strong> <em>I&#8217;ve heard from folks who think they are in the know that enterprises aren&#8217;t adopting AWS, yet Netflix, a mid-tier enterprise now has &#8216;thousands&#8217; of servers running its most mission critical service on AWS.  What do you say to these folks?</em></p>
<p><strong>AC:</strong> Netflix is pathfinding, others will follow, and tiny startups in the cloud can become very big very quickly like  Zynga, who are now one of the biggest games companies, from nowhere. Enterprises that don’t leverage the agility, scale and cost benefits of public cloud will lose out to those that do. For me, if its not multi-tenant, it’s not cloud, and only the biggest organizations should be building datacenters to host clouds, and they should be offering them publicly. If you are doing internal cloud and you have a dominant internal customer then you are doing it wrong, because you have to choose between baking in a lot of unused extra capacity or the risk that at some point that customer will blow up your cloud.</p>
<p><strong>Cloudscaling Perspective</strong><br />
For us, the incredible lesson with Netflix has been how much value can be had in public clouds if one fully embraces the model.  We see several kinds of failures to embrace cloud computing that parallel Adrian’s notes above:</p>
<ol>
<li>Failure of understanding (“it’s just [mainframes|service bureaus|outsourcing|virtualization]”)</li>
<li>Failure to re-engineer/re-architect applications</li>
<li>Focus on building simplistic internal ‘private clouds’ (aka “your mess for less; only virtualized!”)</li>
</ol>
<p>With understanding, the problem is the same as with the transition from the mainframe.  Mainframe operators and the vendors of that time did not want to believe things would change as dramatically as they did.  Many of them also looked at enterprise computing as a sort of ‘fad’ instead of seeing the underlying drivers at the time related to flexibility, agility, and expense (sound familiar?).</p>
<p>As far as re-engineering applications goes, we still see clients today asking us to build ‘five 9’ infrastructure clouds to support legacy enterprise applications they don’t want to re-architect.  Few of them understand what that means or how expensive a 99.999% uptime cloud infrastructure would be (hint: 10-100x the cost of AWS most likely).  I have seen so-called “private cloud” deployments where the 5-year TCO of a single virtual server ran close to USD $20,000.  Why virtualize in the first place if the resulting costs are similar to those of running physical servers?</p>
<p>Which brings me to that last point: private clouds designed to accommodate all of the “needs” of enterprise customers aren’t true clouds at all, but simply “your mess for less.”  Google can run 10,000 servers for a single headcount because their architecture is horizontal, homogeneous, and commoditized.  The worst enterprise clients we have seen manage as few as *5* servers per headcount with the average at *100*.  This is largely because of enterprise applications and legacy network architectures that require that the infrastructure is mapped to the applications instead of the other way around.  This process combined with heavily silo’ed teams and application architectures results in the operational costs overhead being enormous.</p>
<p><strong>Summary</strong><br />
Adrian and other Cloud Innovators have proven that designing for cloud scale, means designing for failure and the same ‘always-on’ software architecture of an Amazon.com or Google, *not* something as simplistic as virtualizing your existing legacy enterprise applications and moving them over to someone else’s cloud.  It’s going to take a while for the market to wake up to this principle and we’ll see lots of failures or as I like to say “blood on the floor” over the next few years.</p>
<p>Those who embrace the disruption and use it to their advantage will win as the landscape shifts.  Those who do not will be on the receiving end.</p>
<p style="text-align: center;"><em>Disrupt.  Don’t be Disrupted.</em></p>
<p style="text-align: center;">
<hr />[1] This is a number that the Cloudscaling team has confirmed with a number of large enterprise customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/cloud-innovators-netflix-strategy-reflects-google-philosophy/feed/</wfw:commentRss>
		<slash:comments>51</slash:comments>
		</item>
		<item>
		<title>Grid, Cloud, HPC &#8230; What&#8217;s the Diff?</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/grid-cloud-hpc-whats-the-diff/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/grid-cloud-hpc-whats-the-diff/#comments</comments>
		<pubDate>Thu, 18 Nov 2010 19:22:42 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[commoditization]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[grid]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hpc]]></category>
		<category><![CDATA[hsc]]></category>
		<category><![CDATA[iaas]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[scaling]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1517</guid>
		<description><![CDATA[It&#8217;s always nice when another piece of the puzzle comes into focus.  In this case, my time speaking at the first ever International Super Computer (ISC) Cloud Conference the week before last was well spent.  The conference was heavily attended &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/grid-cloud-hpc-whats-the-diff/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s always nice when another piece of the puzzle comes into focus.  In this case, my time speaking at the first ever International Super Computer (ISC) Cloud Conference the week before last was well spent.  The conference was heavily attended by those out of the <a href="http://en.wikipedia.org/wiki/Grid_computing">grid computing</a> space and I learned a lot about both cloud and grid.  In particular, I think I finally understand what causes some to view grid as a pre-cursor to cloud while others view it as a different beast only tangentially related.</p>
<p>This really comes down to a particular TLA in use to describe grid: <a href="http://en.wikipedia.org/wiki/High-performance_computing">High Performance Computing</a> or HPC.  HPC and grid are commonly used interchangeably.  Cloud is not HPC, although now it can certainly support some HPC workloads, née <a href="http://aws.amazon.com/hpc-applications/">Amazon&#8217;s EC2 HPC offering</a>.  No, cloud is something a little bit different:  High Scalability Computing or simply HSC here.</p>
<p>Let me explain in some depth &#8230;</p>
<p><span id="more-1517"></span></p>
<p><strong>Scalability vs. Performance</strong><br />
First it&#8217;s critical for readers to understand the fundamental difference between <a href="http://en.wikipedia.org/wiki/Scalability">scalability</a> and <a href="http://en.wikipedia.org/wiki/Computer_performance">performance</a>.  While the two are frequently conflated, they are quite different.  Performance is the capability of particular component to provide a certain amount of capacity, throughput, or &#8216;yield&#8217;.  Scalability, in contrast, is about the ability of a system to expand to meet demand.  This is quite frequently measured by looking at the aggregate performance of the individual components of a particular system and how they function over time.</p>
<p>Put more simply, performance measures the capability of a single part of a large system while scalability measures the ability of a large system to grow to meet growing demand.<br />
Scalable systems may have individual parts that are relatively low performing.  I have heard that the Amazon.com retail website&#8217;s web servers went from 300 transactions per second (TPS) to a mere 3 TPS each after moving to a more scalable architecture.  The upside is that while every web server might have lower individual performance, the overall system became significantly more scalable and new web servers could be added ad infinitum.</p>
<p>High performing systems on the other hand focus on eking out every ounce of resource from a particular component, rather than focusing on the big picture.  One might have high performance systems in a very scalable system or not.</p>
<p>For most purposes, scalability and performance are orthogonal, but many either equate them or believe that one breeds the other.</p>
<p><strong>Grid &amp; High Performance Computing</strong><br />
The origins of HPC/Grid exist within the academic community where needs arose to crunch large data sets very early on.  Think satellite data, genomics, nuclear physics, etc.  Grid, effectively, has been around since the beginning of the enterprise computing era, when it became easier for academic research institutions to move away from large mainframe-style supercomputers (e.g. Cray, Sequent) towards a more scale-out model using lots of relatively inexpensive x86 hardware in large clusters.  The emphasis here on *relatively*.</p>
<p>Most x86 clusters today are built out for <a href="http://www.top500.org/">very high performance *and* scalability</a>, but with a particular focus on performance of individual components (servers) and the interconnect network for reasons that I will explain below.  The price/performance of the overall system is not as important as aggregate throughput of the entire system.  Most academic institutions build out a grid to the full budget they have attempting to eke out every ounce of performance in each component.</p>
<p>This is not the way that cloud pioneers such as Amazon.com and Google built their infrastructures.</p>
<p><strong>Cloud &amp; High Scalability Computing</strong><br />
Cloud, or HSC, by contrast, focuses on hitting the price/performance sweet spot, using truly commodity components and buying *lots* more of them.  This means building very large and scalable systems.</p>
<p>I was surprised at the ISC Cloud Conference when I heard one participant bragging about their cluster with 320,000 &#8216;cores&#8217;.  Amazon EC2 (sans the new HPC offering) is at roughly 500,000 cores, quite possibly more.  And Google is probably in the order of 10 million+ cores.  Clouds built around High Scalability Computing are an order of magnitude larger than most grid clusters and designed to handle generic workloads, requiring hitting the price/performance sweet spot when building them.</p>
<p>Grid workloads can be very, very different.</p>
<p><strong>Some Grid Workloads Drive the Grid Community</strong><br />
In talking to the grid community I learned that there are effectively two key types of problem that are solved on large scale computing clusters: MPI (<a href="http://en.wikipedia.org/wiki/Message_Passing_Interface">Message Passing Interface</a>) and &#8216;embarrassingly parallel&#8217; problems.  I&#8217;m using terms I heard at the conference, but will use MPI and EPP (embarrassingly parallel problem) so that I can shorthand throughout the rest of this article.</p>
<p>MPI is essentially a programming paradigm that allows for taking extremely large sets of data and crunching the information in parallel WHILE sharing the data between compute nodes. Some times this is also referred to as &#8216;clustering&#8217;, although that term is frequently overloaded today.  Certain kinds of problems necessitate this sharing as the computed results on one node may effect the computed results on another node in the grid.  MPI-based grids, the de facto standard for most academic institutions, are built to maximum throughput and performance per system, including the lowest latency possible.  Most of them use Infiniband technology for example to effectively turn the entire grid into a single &#8216;<a href="http://en.wikipedia.org/wiki/Supercomputer">supercomputer</a>&#8216;.  In fact, most of these MPI-based grids are ranked into the Supercomputer <a href="http://en.wikipedia.org/wiki/TOP500">Top500</a>.</p>
<p>An MPI grid/cluster, in many ways, looks more like an old school mainframe and technology such as Infiniband essentially turns the network into a high-speed bus, just like a PCI bus inside a typical x86 server.</p>
<p>EPP workloads, by contrast, have no data sharing requirements.  A very large dataset is chopped into pieces, distributed to a large pool of workers, and then the data is brought back and reassembled.  Does this sound familiar?  It should, it&#8217;s very similar to Google&#8217;s <a href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> functionality and the open source tool, Hadoop.  EPP workloads are very commonly run on top of MPI clusters, although some academic institutions build out separate or smaller grids to run them instead.</p>
<p>The majority of grid workloads are of the EPP type.  The diagram below shows this.</p>
<p><a href="http://cloudscaling.com/wp-content/uploads/2010/11/hpc-vs-hsc-pyramid.png"><img class="aligncenter size-full wp-image-1518" title="hpc-vs-hsc-pyramid" src="http://cloudscaling.com/wp-content/uploads/2010/11/hpc-vs-hsc-pyramid.png" alt="" width="313" height="315" /></a></p>
<p>I had one person confide in me that &#8220;<em>MPI power users drive grid requirements for vendors and assume that if their problems are solved, then the problems of [EPP] users are solved.</em>&#8221;<br />
This is interesting since these two types of workloads have different needs.</p>
<p><strong>HPC vs. HSC</strong><br />
The reality is that High Scalability Computing is ideal for the majority of EPP grid workloads.  In fact, large amounts of this kind of work, in the form of MapReduce jobs have been running on Amazon EC2 since its beginning and have driven much of its growth.</p>
<p>HPC is a different beast altogether as many of the MPI workloads require very low latency and servers with individually high performance.  It turns out however, that all MPI workloads are not the same.  The lower bottom of the top part of that pyramid is filled with MPI workloads that require a great network, but not an Infiniband network:</p>
<p><a href="http://cloudscaling.com/wp-content/uploads/2010/11/hpc-vs-hsc-pyramid-mpi-high-latency.png"><img class="aligncenter size-full wp-image-1519" title="hpc-vs-hsc-pyramid-mpi-high-latency" src="http://cloudscaling.com/wp-content/uploads/2010/11/hpc-vs-hsc-pyramid-mpi-high-latency.png" alt="" width="500" height="284" /></a></p>
<p>In keeping with Amazon Web Service&#8217;s tendency to build out using commodity (cloud) techniques, their new HPC offering does not use Infiniband, but instead opts for 10Gig Ethernet.  This makes the network great, but not awesome and allows them to create a cloud service tailored for many HPC jobs.  In fact, this <a href="http://blog.cyclecomputing.com/2010/11/a-couple-more-nails-in-the-coffin-of-the-private-compute-cluster-gpu-on-cloud.html">recent benchmark posting</a> by CycleComputing shows that AWS&#8217; Cloud HPC system has impressive performance particularly for many MPI workloads.</p>
<p>HSC designed to accommodate HPC!</p>
<p>Which brings us back.</p>
<p><strong>The Moral of the Story</strong><br />
So, what we have learned is that scalable computing is different from computing optimized for performance.  That cloud can accommodate grid *and* HPC workloads, but is not itself necessarily a grid in the traditional sense.  More importantly, an extremely overlooked segment of grid (EPP) has pressing needs that can be accommodated by run-of-the-mill clouds such as EC2.  In addition to supporting EPP workloads that run on the &#8216;regular&#8217; cloud some clouds may also build out an area designed specifically for &#8216;HPC&#8217; workloads.</p>
<p>In other words, grid is not cloud, but there are some relationships and there is obviously a huge opportunity for cloud providers to accommodate this market segment.  At least, Amazon is spending 10s of Millions of dollars to do so, so why not you?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/grid-cloud-hpc-whats-the-diff/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>@Cloudscaling CEO @randybias on #VMworld &amp; #Interop 2010</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/cloudscalings-ceo-randybias-on-vmworld-interop-2010/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/cloudscalings-ceo-randybias-on-vmworld-interop-2010/#comments</comments>
		<pubDate>Mon, 25 Oct 2010 18:42:24 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[Automation]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[commoditization]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[scaling]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1478</guid>
		<description><![CDATA[During my most recent trip I was speaking at both VMworld Europe 2010 and Interop NYC 2010 &#8211; Enterprise Cloud Summit. This update attempts to provide a candid look at some of the trends, thoughts, and insights that occurred to &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/cloudscalings-ceo-randybias-on-vmworld-interop-2010/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>During my most recent trip I was speaking at both <a href="http://www.vmworld.com/community/conferences/europe2010/">VMworld Europe 2010</a> and <a href="http://www.interop.com/newyork/conference/overview.php">Interop NYC 2010 &#8211; Enterprise Cloud Summit</a>. This update attempts to provide a candid look at some of the trends, thoughts, and insights that occurred to me while engaging with customers, vendors, and the greater cloud community at these two events.</p>
<p>Here I will briefly cover the following points:</p>
<ul>
<li>Disconnects in Telco/SP Cloud Strategy</li>
<li>‘Hybrid Cloud’ Still Causing Confusion</li>
<li>Public Cloud Hits a Tipping Point?</li>
<li>Enterprise IT Governance</li>
</ul>
<p><strong>Telco/SP Enterprise Cloud Strategy Doomed to Failure?</strong><br />
How many of the large telecommunications and service providers have ‘enterprise’ cloud strategies? Their basic strategy boils down to:</p>
<ol>
<li>Deploy an ‘enterprise cloud’, usually with VMware</li>
<li>Farm current customer base for cloud customers</li>
<li>Create a suite of services around the cloud offering that make it compelling</li>
</ol>
<p>In talking to folks at VMware and various telcos and service providers, I heard a tremendous amount of focus on #3.  Many times I heard that &#8220;<em>infrastructure is just a commodity</em>” and “w<em>e’ll compete by providing value-added services.</em>”  This is clearly a sound strategy, except that most of these providers are <strong>not</strong> building commodity infrastructure.  Most enterprise clouds have a very expensive cost-basis for their build-outs.  The most stark difference can be seen with <a href="http://www.emc.com/campaign/global/vce/index.htm">VCE Vblocks</a> clouds, which are almost 10x the cost of a commodity cloud[1].</p>
<p>Taking aside a number of the other issues with these enterprise clouds[2], how can you sell enough value-added services on top of a 10x more expensive infrastructure solution to make up the difference?  The answer is that you can’t and most telcos and SPs with this strategy will eventually have to face the math.</p>
<p>Related to this, #2 above seems to have telcos and SPs focusing on protecting existing customers.  How much more failed can a cloud strategy be than if it’s defensive?  The only good strategy here in response to the Amazon juggernaught is a frontal assault using those assets and capabilities you have and Amazon does not.</p>
<p>It’s interesting that even VMware, <a href="http://cloudscaling.com/blog/cloud-computing/vmware-vs-amazon-round-one-fight">probably Amazon’s key competitor</a>, is preaching along with EMC, a ‘Journey to the Private Cloud’ and <a href="http://www.redmonk.com/jgovernor/2010/10/22/vmware-ceo-django-rails-open-frameworks-apps-as-commodity-and-the-new-kingmakers">pounding the pulpit</a> for enabling developers.  This quote from Paul Maritz, CEO of VMware is choice:</p>
<hr />
<blockquote><p><em>In the final analysis they [purchasers] are not the people making strategic decisions in the business. Developers have always been at the leading edge, because that’s where business value is generated. Things that don’t differentiate you at a higher level will be SaaS apps – which will also be purchased at a higher level. The differentiated stuff you have to do yourself, and that means software development”.</em></p></blockquote>
<hr />Most of the current VMware-based enterprise clouds are simply trying to sell a pure virtualization outsourcing play to IT admins of their existing customer base.  If a developer friendly strategy is working for Amazon and VMware is pushing a similar vision, then it’s incumbent on those who want to be successful in the emerging cloud computing space to think about where developer enablement fits in their strategy.</p>
<p>I very much hope that telcos and SPs will start to develop some strategies and cloud solutions that are ultimately competitive.  The worst thing that can happen here is to have a GOOG/AMZN duopoly. (Please see my <a href="http://cloudscaling.com/blog/cloud-computing/rumor-mill-google-ec2-competitor-coming-in-2010">earlier post on the rumor of Google launching their own EC2-like service</a>.)</p>
<p><strong>‘Hybrid Cloud’ Confusion</strong><br />
In a panel at Interop on adoption of public clouds by enterprise customers there was a heated debate about the meaning of ‘hybrid cloud’.  This debate, mostly between myself and Paddy Srinivasan of <a href="http://www.cumulux.com">Cumulux</a>, was helpful for attendees, although as in most conversations of this type there is danger of devolving into an argument on semantics.  I made some pretty strong assertions about the general lack of usefulness in <strong>any</strong> context of the term ‘hybrid cloud’[3].  Essentially I simply reprised my posting from February this year: <a href="http://cloudscaling.com/blog/cloud-computing/hybrid-clouds-are-half-baked">Hybrid Clouds are Half-baked</a>.</p>
<p>Why stick on this?  For me, this is a question of straight talk.  We all live with the confusion of ‘cloud’ every day in our work, but when vendors use the term as something new to denote simple <a href="http://en.wikipedia.org/wiki/Service-oriented_architecture">Service-Oriented Architecture</a> (SOA) or the joining of two clouds (aka ‘cloud bridging’), that muddies the conversation further. The arbitrary creation of fuzzy marketing terms and pretending as if they have meaning does a disservice to all those who are trying to understand how to move forward in this new world.  Even the Wikipedia entry for <a href="http://en.wikipedia.org/wiki/Cloud_computing#Hybrid_cloud">Hybrid Cloud</a> is a mess.</p>
<p>Another key reason to push on avoiding this term is that it ‘over promises’ on cloud.  Most companies don’t need a hybrid solution at the moment.  Certainly, some services, like identity management and authentication need to ‘bridge’ the firewall, but a single app doesn’t need to exist in both places nor does it need to move back and forth.  Neither do virtual machines (VM).  In fact, if you are following best practices using tools like <a href="http://incubator.apache.org/libcloud/">libcloud</a>, <a href="http://github.com/geemus/fog">fog</a>, or <a href="http://www.jclouds.org/">jclouds</a> to manage instances and <a href="http://www.opscode.com/chef">Chef</a> and <a href="http://www.puppetlabs.com/">Puppet</a> to package your app deployments, then you can deploy to any cloud on demand.  This approach makes far more sense than trying to move large VMs back and forth across wide area networks.</p>
<p>It’s just like buying two Internet connections from two separate ISPs, which certainly isn’t a ‘hybrid network’.  Using multiple clouds is a best practice enabled by proper tooling that increases portability &amp; interoperability while reducing risk, not some kind of &#8216;hybrid&#8217;.</p>
<p><strong>Public Cloud Hitting the Big Time?</strong><br />
Just before the enterprise public cloud adoption panel <a href="http://www.linkedin.com/pub/brian-butte/1/767/462">Brian Butte</a>, the moderator, informed me that a poll of the audience had found that 95% of the enterprise attendees were using public clouds or planning on it.  A stark turn around from the beginning of the year when most were focused on private cloud development.</p>
<p>This parallels our experience that most enterprises will ‘fail forward’ trying to build private clouds.  By fail forward, we mean here that IT departments will attempt to deploy highly automated virtualization systems thinking they are private clouds, but not hitting the mark.  As Nick Carr <a href="http://hbswk.hbs.edu/archive/4137.html">pointed out</a> earlier in <a href="http://books.google.com/books?id=wrROE6SLJFEC&amp;pg=PA111&amp;lpg=PA111&amp;dq=9%25+of+large+IT+projects+fail+does+IT+matter&amp;source=bl&amp;ots=hv-k0_2b3i&amp;sig=TM879MDB4wtVSebnsGjn2ZtFFdg&amp;hl=en&amp;ei=vMzFTIKFI4K0lQfF0ZwD&amp;sa=X&amp;oi=book_result&amp;ct=result&amp;resnum=4&amp;ved=0CC4Q6AEwAw#v=onepage&amp;q&amp;f=false">Does IT Matter</a>:</p>
<hr />
<blockquote><p><em>Of the more than eight thousand systems projects Standish examined, only 16 percent were considered successes—completed on time and on budget and fulfilling the original specifications. Nearly a third were canceled outright, and the remainder all went over budget, off schedule, and out-of-spec. <strong>Large companies—those with more than $500 million in annual sales—did even worse than the average: Only 9 percent of their IT projects succeeded.</strong> [ed. emphasis mine]</em></p></blockquote>
<hr />As I said in my Interop keynote presentation on Monday for the private cloud track of the Enterprise Cloud Summit, it’s not a real private cloud unless it’s built like a public cloud.  Most enterprise IT folks have little idea how to move forward, either culturally or technologically in building a true private cloud.  How much worse is the 9% success rate likely to be in this case?</p>
<p>So what we’re probably seeing now is that enough early attempts at private cloud have failed or have been so slow that business unit owners are pushing for public cloud solutions and demanding immediate success.</p>
<p><strong>Enterprise IT as Governor, not Control-Freak</strong><br />
I hit on this point repeatedly, but I also heard it from a number of other folks.  We’re clearly moving into a world of mixed IT capacity.  Some will be onsite and much will eventually be offsite and run by a multiplicity of cloud vendors; infrastructure, platforms, <strong>and</strong> applications.  In this new world, it’s more important for enterprise IT to provide governance rather than direct control.  This is very similar to how modern manufacturing or facilities management works.</p>
<p>Apple, Inc, for example, does not manufacture it’s hardware.  Instead, this key capability is outsourced, yet Apple has become an expert in managing a large extended supply chain of vendors and ultimate responsibility for delivering high quality hardware goods.</p>
<p>Similar to the process of managing global manufacturing relationships, I predict enterprise IT will shift to spend more time governing a large supply chain, not running each individual solution themselves.</p>
<p><strong>Wrap-Up</strong><br />
All of this just further reinforces my thinking about Cloudscaling’s general approach to the market place.  We want to help telcos and service providers compete with AMZN/GOOG, while helping enterprises to understand how to embrace and manage the transition to cloud computing.  We think this means that telcos and service providers have inherent advantages that AMZN/GOOG can’t compete with.  Unfortunately, when these advantages are coupled with expensive ‘enterprise cloud’ infrastructure, much of the potential competitive opportunity is lost.</p>
<p>Imagine instead, that the inherent advantages of a large telco, such as geographical dominance, access to wireless networks, advanced networking capabilities such as MPLS networks, cheap IP backbones, were coupled with a cost-competitive Amazon EC2-like cloud infrastructure.  This ‘consumer cloud’ infrastructure combined with the telco’s natural advantages makes for a formidable competitive advantage in picking up all of the new cloud apps, particularly those that service mobile device developers.</p>
<p>When looking at how the marketplace appears to be unfolding, it seems clear that enterprises will continue to be confused with hype and promises such as that of ‘hybrid clouds’, miss the boat on delivering in the short term, pushing IT departments into adopting public cloud solutions whether they like it or not.  It also seems clear that servicing these new cloud apps is also critical.</p>
<p>To us, this means it’s important to have *both* an enterprise cloud to capture and retain existing enterprise customers while also deploying a low-cost commodity cloud, built like Amazon and Google, targeted at consumers, developers, and new cloud applications.  Particularly those apps that drive the burgeoning smart phone market and play to any telcos inherent strengths.</p>
<hr />[1] I have a detailed analysis in the pipeline and I will be showing some of these numbers in upcoming presentations at conferences as well.<br />
[2] We’ll cover this in much more depth in the near future.<br />
[3] This will likely cause some negative feedback from legacy enterprise vendors whose marketing folks use this term extravagantly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/cloudscalings-ceo-randybias-on-vmworld-interop-2010/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Does OpenStack Change the Cloud Game?</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/does-openstack-change-the-cloud-game/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/does-openstack-change-the-cloud-game/#comments</comments>
		<pubDate>Tue, 20 Jul 2010 19:01:46 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[Cloud Standards]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[Community]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenStack]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1180</guid>
		<description><![CDATA[This week Rackspace Cloud, in conjunction with the NASA Nebula project, open sourced some of their Infrastructure-as-a-Service (IaaS) cloud software. This initiative, dubbed &#8216;OpenStack&#8217;, should have a dramatic impact on the current dynamics for building cloud computing infrastructure. Previously there &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/does-openstack-change-the-cloud-game/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This week Rackspace Cloud, in conjunction with the NASA Nebula project, open sourced some of their Infrastructure-as-a-Service (IaaS) cloud software.  This initiative, dubbed <a href="http://www.openstack.org/">&#8216;OpenStack&#8217;</a>, should have a dramatic impact on the current dynamics for building cloud computing infrastructure.  Previously there have been two major camps: Amazon API and architecture compatible and VMware&#8217;s vCloud.  Now there is a third alternative that could not only be a viable alternative to these two approaches, but more importantly, a fantastic option for service providers and telecommunications companies that face unique challenges.</p>
<p>Let&#8217;s dive in and I&#8217;ll explain.</p>
<p><strong>Cloud Stack Evolution &amp; &#8216;Camps&#8217;<br />
</strong>Amazon Web Services (AWS) spawned a huge ecosystem of knock-offs, management systems, tools, and vendors.  They include, but aren&#8217;t limited to:</p>
<ul>
<li>AWS API compatible &#8216;cloud stacks&#8217; including Eucalyptus, Open Nebula, and others</li>
<li>Cloud management systems for the AWS APIs and services such as RightScale and enStratus</li>
<li>Cloud services layered on top of AWS services such as Jungle Disk (S3), Heroku (S3, EBS, EC2), and more</li>
</ul>
<p>Prior, I wouldn&#8217;t have called the AWS ecosystem a &#8216;camp&#8217; per se, but if you read our most recent article on Google&#8217;s foray into cloud storage, you know that it seems likely they will provide a 100% compatible version of S3 and EC2 this year.  Imagine the impact of Google Compute &amp; Storage with Amazon Web Services compatible APIs.  Already the Google Storage API is nearly 100% compatible with S3.</p>
<p>Together, as a block, Amazon and Google could create a de facto duopoly for infrastructure clouds, which isn&#8217;t good for anyone.  We need competition and more than two major players.</p>
<p>Up against the Amazon camp is VMware. In my article on Amazon vs. VMware last year I highlighted how these two businesses were on a collision course. Nothing has changed and competition is mounting between them.  The reason is that telcos and service providers are under increasing threat from Amazon and soon Google. They need viable solutions and VMware is attempting to provide a competitive ecosystem.</p>
<p>The VMware cloud initiative, vCloud, is designed to arm enterprises and service providers to be competitive, but has not quite delivered yet. VMware has had a number of problems providing a full cloud stack. The software, now in beta, is codenamed &#8216;Redwood&#8217; has had significant delays in getting to market.  Their strategy for cloud infrastructure does not appear unified outside of delivering compute virtualization.</p>
<p>VMware, as a business, understands they need to make their customers competitive. They have made a number of strategic open source acquisitions such as SpringSource,  RabbitMQ, and Redis. There are also murmurings that they have some special projects inside that are &#8216;up the stack&#8217; from their virtualization offerings.  In total this shows that VMware &#8216;gets it&#8217; in that they want to create a competitive ecosystem.  While each of these is currently a point solution, there is yet to be a coherent story here.  Can VMW build a consistent story and strategy around these disparate pieces?  Only time will tell&#8230;</p>
<p>Besides these two camps, there is a long tail of clouds running various frameworks vying to establish themselves such as Cloud.com&#8217;s CloudStack, 3Tera, Hexagrid, Abiquo, OpenNebula, etc.  John Treadway recently had posted a roundup describing all of the various<a href="http://www.cloudbzz.com/the-red-ocean-of-cloud-infrastructure-stacks/"> cloud stacks out there.</a></p>
<p>OpenStack is stepping into the ring as a viable third camp. In particular, the OpenStack Storage solution is a clear contender to Amazon S3 &amp; Google Storage. Many service providers and telcos have struggled to find a viable solution using commodity hardware that was price competitive. Suddenly, there is a viable proven solution.</p>
<p>Yet this is only storage. How can it create an effective &#8216;third camp&#8217; alternative to Amazon and VMware for an entire cloud?</p>
<p><strong>Lock-in, Architecture, Standards and The Truth about Interoperability<br />
</strong><br />
Interoperability for infrastructure clouds is poorly understood. Most believe that the problem lies in the on-disk image format (e.g. VMDK vs. VHD vs. qcow) or in the &#8216;hypervisor&#8217; (although people don&#8217;t really unders/tand what this means). The truth is that lock-in has little or nothing to do with disk formats or the hypervisor.  Most on-disk image formats are simply representations of block storage (i.e. disk drives).  That means converting between a VMware VMDK and a Citrix XenServer/Hyper-V VHD is relatively trivial.</p>
<p>What about booting the converted disk image up on a new hypervisor? Guess what, since most hypervisors now rely on hardware virtualization (HVM) [1] using Intel-VT/AMD-V, that means that by default most will work with unmodified operating systems out of the box. No changes needed. The only downside of this is that usually the resulting performance is poor. This requires new paravirtualization (PV) drivers in the converted image.  What does that mean?  After converting the image from one format to another, you simply have to install the PV drivers for the correct OS.  A process that requires being methodical, but is in no way technically challenging.</p>
<p>Where is the lock-in then?  If it&#8217;s not the hypervisor, what makes moving from one cloud to another so difficult?  Simply put, it&#8217;s architectural differences.  Every cloud chooses to do storage and networking differently.</p>
<p>For example, if you wanted to move a virtual machine from GoGrid to Amazon, converting the GoGrid image to an AMI is not difficult.  Unfortunately, GoGrid uses two networks, a &#8216;frontend&#8217; and a &#8216;backend&#8217; where your cloud storage system is connected to via the backend network.  Every Amazon virtual server has only a single network interface.  If your application assumes a separate backend network then what happens when it moves to a cloud without one?  Or vice versa? Similar architectural incompatibilities exist between Rackspace Cloud, Savvis, Terremark, Hosting.com, Joyent, and all of the others.</p>
<p>The problem here, to be a bit more succinct, is that we need reference architectures for how infrastructure clouds are built. Amazon is one such reference. VMware&#8217;s vCloud is potentially another. Now there could be a truly open option with the gravity to gather community support.</p>
<p><strong>More on The Third Camp<br />
</strong><br />
OpenStack&#8217;s potential to build a real community and a set of reference architectures drives towards greater standardization and interoperability. Perhaps more important than a cloud storage alternative, is this possibility for a true OpenStack community to form a critical mass such that a similar level of developers contributing to it as Amazon or VMware. Then commercial and alternative offerings, such as Cloud.com, Hexagrid, and OpenNebula can match their APIs and architectures to this set of reference architectures.</p>
<p>Will it happen?  It&#8217;s hard to say, but the opportunity is there.  Rackspace and others are putting serious weight behind this initiative.</p>
<p><strong>What This Means for Telcos and Service Providers<br />
</strong><br />
For Telcos and SPs this means an alternative to VMware&#8217;s vCloud for commodity service offerings.  A way to compete and operate at scale like Amazon and at a similar price point.  Standardization through a similar reference architecture means greater compatibility between service provider clouds, which means greater benefit for customers and less lock-in, making them more desirable than the walled gardens.</p>
<p>You don&#8217;t want to differentiate on the basic compute, storage, and network offering.  You want this to be as standard and interoperable as possible, just like 3G networks, TCP/IP, and similar service provider technologies.  By creating a common open platform that everyone uses there is a better opportunity to facilitate wider adoption, create a competitive infrastructure service marketplace where providers work on differentiating in areas where they have an inherent advantage:</p>
<ul>
<li>Service and support</li>
<li>Network &amp; datacenter tie-ins (e.g. MPLS, hosting/co-lo)</li>
<li>Bundled service offerings</li>
<li>Differentiated value-added cloud services (VACS)</li>
</ul>
<p>This is a game that all telcos and service providers understand. They have been playing it for the past 15+ years.</p>
<p><strong>Conclusion<br />
</strong><br />
OpenStack, with a strong community behind it, should be an important tool for service providers and large telcos to compete at scale with the Amazon and Googles of this world.</p>
<p>We believe OpenStack and the reference architecture(s) associated with it will allow service providers (SP) to get their undifferentiated cloud offerings up and running early. For this reason, Cloudscaling will put real resources into supporting this effort. Getting basic cloud offerings up early then means providers can focus on support, services, bundling, and differentiated services as soon as possible, while embracing as large a customer base as possible. This is just as they compete on top of basic TCP/IP services today.</p>
<p>[1] Clearly, the market leader, Amazon, does not use HVM. They use PVM, a fully paravirtualized mode of Xen. However, even they seem to understand that HVM is the future. Their latest offering, designed for HPC, which is performance sensitive, uses HVM and supports unmodified operating systems. The reality is that the Intel-VT and AMD-V capabilities on the latest round of processors is incredibly fast and will only get faster. The battle is over.  HVM and silicon won in this case.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/does-openstack-change-the-cloud-game/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Rumor Mill: Google EC2 Competitor Coming in 2010?</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/rumor-mill-google-ec2-competitor-coming-in-2010/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/rumor-mill-google-ec2-competitor-coming-in-2010/#comments</comments>
		<pubDate>Mon, 05 Jul 2010 15:00:34 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google storage]]></category>
		<category><![CDATA[rumor mill]]></category>
		<category><![CDATA[S3]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1174</guid>
		<description><![CDATA[I&#8217;ve heard from a somewhat reliable source that Google is working on their Amazon EC2 competitor. Yes, some kind of on-demand virtual servers. I would have been the last person to guess that Google would take this direction[1], but you &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/rumor-mill-google-ec2-competitor-coming-in-2010/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve heard from a somewhat reliable source that Google is working on their Amazon EC2 competitor.  Yes, some kind of on-demand virtual servers.  I would have been the last person to guess that Google would take this direction[1], but you have to admit it makes a certain sense from their perspective.  Consider:</p>
<ul>
<li>Amazon&#8217;s EC2 is <a href="http://cloudscaling.com/blog/cloud-computing/amazons-ec2-generating-220m-annually">clearly generating Real Revenue (TM)</a> and could be at 500-750M in revenue for 2010</li>
<li>Google has a massive global footprint and is north of <a href="http://www.pandia.com/sew/481-gartner.html">one million server</a>s</li>
<li>The support structure for these servers includes a <a href="http://royal.pingdom.com/2008/04/11/map-of-all-google-data-center-locations/">huge investment</a> in datacenters, networking, and related</li>
<li>The Googleplex houses an extremely large number of talented engineers in relevant areas: networking, storage, Linux kernel, server automation, etc.</li>
<li>Google Storage <a href="http://code.google.com/apis/storage/">recently went into BETA</a> and is accepting developer signups</li>
</ul>
<p>This last is perhaps one of the more telling signs.  As you may be aware, Amazon&#8217;s Simple Storage Service (S3) pre-dates Amazon&#8217;s Elastic Compute Cloud (EC2).  When Amazon launched in Europe they first deployed S3 followed by EC2.  The same happened with their Asia/Pac deployment.</p>
<p>Amazon has built AWS in such a way that all of the services are synergistic, but in particular, EC2 is dependent on S3 as a persistent storage <a href="http://en.wikipedia.org/wiki/System_of_record">system of record</a>.  EC2 AMIs originate from and are stored in S3, it&#8217;s the long term backing store for <a href="http://aws.amazon.com/ebs/">Elastic Block Storage</a> (EBS) and EBS snapshots, and it&#8217;s safe to assume that many other kinds of critical data that AWS relies on are stored there.</p>
<p>Would Google take a different approach?  It&#8217;s doubtful.  Amazon&#8217;s S3 is built to be a highly scalable storage platform[2]. Google&#8217;s own GoogleFS and BigTable server similar purposes.  It&#8217;s certain that Google would use related design principles and hence we could see the Google Storage as a prelude to a Google on-demand virtual server service (Google Servers???).</p>
<p>Combined with the rumor I heard from a reasonably informed source I think we can look forward to an EC2 competitor, hopefully this year.</p>
<p>What I want to bring to folks attention here is that if another credible heavyweight enters into this market it will have a tremendous impact in further driving the <a href="http://cloudscaling.com/blog/cloud-computing/debunking-the-no-such-thing-as-a-private-cloud-myth">utilitization</a> of cloud services.  In the medium term it will also threaten hosting providers and &#8216;<a href="http://cloudscaling.com/blog/cloud-computing/bifurcating-clouds">enterprise</a> <a href="http://cloudscaling.com/blog/technology/must-read-on-the-cloud">clouds</a>&#8216;.</p>
<p>Why?  I think what many hosting providers fail to understand is that Amazon and Google, particularly if fueled by direct competition, <strong>must</strong> grow up into the enterprise space.  Just as in the Innovator&#8217;s Dilemma, they will eventually provide most of the features of any &#8216;enterprise&#8217; cloud, which means that if you aren&#8217;t building to be competitive with Amazon and Google, you aren&#8217;t in the public cloud game.</p>
<p>Much more detail on this in a future posting.</p>
<hr />[1] My best would have been that Google put more weight behind PaaS solutions like Google App Engine (GAE) and related, which are more &#8216;google-y&#8217;.<br />
[2] See the <a href="http://s3.amazonaws.com/AllThingsDistributed/sosp/amazon-dynamo-sosp2007.pdf">whitepaper</a> (PDF) on their Dynamo technology behind S3.  Also check out <a href="http://www.basho.com/Riak.html">Riak</a> from <a href="http://www.basho.com/">Basho</a> that is designed around the same techniques.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/rumor-mill-google-ec2-competitor-coming-in-2010/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Understanding Cloud Datacenter Economies of Scale</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/understanding-cloud-datacenter-economies-of-scale/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/understanding-cloud-datacenter-economies-of-scale/#comments</comments>
		<pubDate>Tue, 04 May 2010 14:03:28 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[Automation]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloudscaling]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=1023</guid>
		<description><![CDATA[James Hamilton&#8217;s recent MIX&#8217;10 presentation on economies of scale for large cloud providers was quite impressive. James &#8220;gets it&#8221; like few others in the industry. If you haven&#8217;t watched his hour-long presentation, I suggest you do. I also recommend this &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/understanding-cloud-datacenter-economies-of-scale/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>James Hamilton&#8217;s recent <a href="http://live.visitmix.com/MIX10/Sessions/EX01">MIX&#8217;10 presentation</a> on economies of scale for large cloud providers was quite impressive.  James &#8220;gets it&#8221; like few others in the industry.  If you haven&#8217;t watched his hour-long presentation, I suggest you do.  I also recommend this excellent response from <a href="http://news.cnet.com/8301-19413_3-20003591-240.html?part=rss&amp;tag=feed&amp;subj=TheWisdomofClouds">James Urquhart</a>.  My goal in this posting is to highlight, clarify and expand on a few of James Hamilton&#8217;s points.  I will focus on Infrastructure-as-a-Service (IaaS) clouds, but the concepts are relevant for other kinds of cloud services.</p>
<p>In his presentation, James focuses on power: utilization, distribution, etc., and while an important element, like him, I don&#8217;t think it&#8217;s the most important factor. <span style="color: #ff0000;"> </span></p>
<p>I also want to dispel the myth that only the largest companies can achieve these economies of scale.  Don&#8217;t get me wrong; providing a cloud service is a scale game.  It requires a certain amount of buying power to compete. However,you don&#8217;t need to be MSFT, YHOO, AMZN, or GOOG to compete effectively.  Buying power can be had at levels much lower than you might think.</p>
<p>In this article, I refer regularly to Jame&#8217;s comments in his presentation, so I suggest you watch his video first.  In order to minimize confusion, I&#8217;ve borrowed some pictures from  his slides and inserted them here for your reference.  This is a long entry, but it will be worth the read as I&#8217;ve got numbers for you which I hope you will find interesting.</p>
<p><strong>Background</strong><br />
Like James, the Cloudscaling team has a history of building large scale services.  I&#8217;ve worked in this area for 16+ years as has our COO, Adam Waters, and several of our team members. Understanding of the economies of scale, especially for service providers, cloud or otherwise is fundamental to our DNA.  For example, see my previous piece describing <a href="http://cloudscaling.com/blog/cloud-computing/subscription-modeling-cloud-performance">how oversubscription works</a>.</p>
<p>Enough of that! Let&#8217;s dig in and look at where <strong>you</strong> can achieve economies of scale, identifying areas James Hamilton may have neglected, and clarifying areas where I think there is still confusion.</p>
<p><span id="more-1023"></span></p>
<p><strong>Economies of Scale</strong><br />
There are a number of areas where you can achieve economies.  James touched on a few in his talk.  While this is not a 100% complete list, here are key areas of opportunity that I see:</p>
<ul>
<li>Datacenter and facilities architecture (power &amp; cooling)</li>
<li>Buying power (COGS) for Networking, Compute (Servers), and Storage</li>
<li>Development &amp; Labor Costs</li>
<li>Standardization &amp; Homogenization</li>
<li>Cash Flow</li>
</ul>
<p>In James Hamilton&#8217;s model (see pie chart below) server costs are the dominant cost, but he critically left out development &amp; labor costs.  This can be as much as 10% for a cloud and while it&#8217;s possible for large clouds to drive this down to a marginal cost, in practice there are no <em>Infrastructure-as-a-Service</em> clouds of sufficient size to achieve this yet. While James focuses primarily on power &amp; cooling in his presentation, let&#8217;s take a closer look at some other areas.</p>
<div id="attachment_1026" class="wp-caption alignnone" style="width: 310px"><img class="size-medium wp-image-1026 " title="james-hamilton-pie-chart" src="http://cloudscaling.com/wp-content/uploads/2010/05/james-hamilton-pie-chart-300x188.png" alt="james-hamilton-pie-chart" width="300" height="188" /><p class="wp-caption-text">Jame&#39;s Hamilton&#39;s Distribution of Cloud Datacenter Costs</p></div>
<p style="text-align: center;">
<p><strong>Networking</strong><br />
There are  two key areas of networking where you can achieve economies of scale:</p>
<ul>
<li>Buying IP (network) transit (OpEx)</li>
<li>Capital expense (CapEx)</li>
</ul>
<p>Unfortunately, James provides one example with numbers from 2006 which compare two companies purchasing bandwidth. The larger company purchased bandwidth at $15 per Mbps per month at the <a href="http://en.wikipedia.org/wiki/Burstable_billing">95th percentile</a> ($/Mbps/mo) vs <span style="color: #000000;">the smaller company&#8217;s expense of</span><span style="color: #000000;"> </span>$95/Mbps/Mo. James uses this number to show a 6 or 7 to 1 buying power difference.</p>
<p>The IP landscape has changed dramatically since 2006.  It&#8217;s a dirty little secret in the hosting and cloud world that bandwidth is dirt cheap[1].  In fact, you need very little buying power  to get rock bottom prices.  Street price for high quality <a href="http://en.wikipedia.org/wiki/Tier_1_network">Tier-1</a> IP transit is &lt;$5/Mbps/Mo if you buy 1Gbps commits.  That&#8217;s a mere $5,000/mo, which is well within the spending range of even small companies.  My local coffee shop could probably afford it.  Yes, it&#8217;s quite possible larger buyers are getting even lower rates than $5/Mbps, but there is a bottom and it&#8217;s not much less than $5/Mbps/Mo so the disparity in buying power is closer to 3:1 or 2:1.</p>
<p>To give you some perspective, I know for a fact that some Yahoo! datacenters push upwards of 40Gbps.  A lot, certainly, but at $1-3/Mbps, well within the buying capacity of even medium size businesses.</p>
<p><span style="color: #ff0000;"> </span>The second area is using buying power to purchase network hardware at much reduced rates. When buying bulk quantities of network gear, the theory is that network costs can be significantly reduced, hence larger players have significant cost advantages.  Unfortunately, this only works partially in practice.  Network equipment costs are still increasing significantly, not reducing as with compute and storage equipment costs.  Combined, networking equipment (CapEx) and power usage (OpEx) make up 21% (in the piechart above) of datacenter costs and <strong>both</strong> are steadily increasing.</p>
<p>Large cloud providers are beginning to address this by systematically removing brand name networking vendors like Cisco and Juniper[2].  It is now possible to buy very high quality, exceptionally cheap networking gear direct from Taiwanese manufacturers. Many of these are the original equipment manufacturers for name brands.</p>
<p>Most people don&#8217;t realize how marked up Cisco/Dell/HP/Juniper gear is.  For example, these Taiwanese OEMs have networking gear with price points as low as $100/10GE port.  Yes, $100 per 10Gig Ethernet port.  That&#8217;s about 1/10 the Cisco price point.  At the same time, the quality of the gear is quite high and in some cases the components and chips are a generation ahead of what&#8217;s available from the name brands.</p>
<p>In other words, times are changing.  We&#8217;re going to see a significant drop in the prices for networking gear across the board for the first time in ages and hopefully networking will get in line with the standard Moore&#8217;s Law curve.</p>
<p><strong>Servers</strong><br />
Flogged. Dead Horse.  There isn&#8217;t any significant buying power to be had in the commodity x86 server market.  x86 server vendors, particularly those providing commodity offerings, have thin margins to begin with.  25% is typical.  An Amazon or a Google can push these down somewhat by buying in bulk, but not enough to give them more than a marginal advantage.  Anyone who can buy $1M USD of servers at once can negotiate a pretty steep discount.  Many businesses can afford to buy at that price point.</p>
<p>James Hamilton understands this and points out where the real buying power is: buying customized hardware in bulk that allows for datacenter optimization and cost reductions in power, cooling, and space. <span style="color: #000000;">By purchasing</span> customized<span style="color: #ff0000;"> </span>server offerings from the likes of an <a href="http://www.sgi.com">SGI/Rackable</a> or <a href="http://www.verari.com">Verari</a> that include well spec&#8217;ed components, designing for their particular datacenter environment there are significant savings to be had.  That&#8217;s where the <strong>real</strong> opportunities lie.</p>
<p>Vendors like SGI/Rackable and Verari can afford to build to spec in large quantities and amortize that customization across large orders.  These vendors are learning from the large clouds what works best and productizing it.  You will be able to benefit from this learning and productization too. In fact, we help our clients figure out these kinds of issues every day and know that these opportunities are within the reach of all types of cloud businesses.</p>
<p><strong>Development &amp; Labor Costs</strong><br />
Although touched on only briefly in the presentation, but I think this is the heart of the matter.  Amazon leads the pack in rapid development of cloud services (see my post &#8220;<a href="http://cloudscaling.com/blog/cloud-computing/is-amazon-winning-the-cloud-race">Is Amazon Winning the Cloud Race?</a>&#8220;). Their ability to innovate both automation and technology allows them to drive  significant economies of scale. <em>This implies that d</em><em>evelopment is a much larger cost of a cloud than might be expected.<span style="color: #ff0000;"><span style="font-style: normal;"><br />
</span></span><span style="color: #ff0000;"> </span></em></p>
<p>Amazon&#8217;s EC2 was initially built using a 15 person engineering team.  I estimate that AWS as a whole probably has 50+ software engineers and 20-30+ support and operations staff [3].  Last year, I <a href="http://cloudscaling.com/blog/cloud-computing/amazons-ec2-generating-220m-annually">estimated there were approximately 40,000 servers</a> at a target price of $2,500 each for EC2.  That&#8217;s 100M in CapEx on servers.  A reasonable estimate of engineering and operational labor costs for AWS are probably close to 10-20M over the past 4 years.  A not inconsequential number compared to the CapEx costs.</p>
<p>One client recently asked us to compare the operational expense costs for server administration between large clouds (e.g. Microsoft and Google) and a typical enterprise.  Usually, enterprises can <a href="http://www.cio.com/article/457473/The_Google_ization_of_Bechtel">manage 100-200 servers</a> per admin.  Microsoft&#8217;s stated goals are 1,000-2,000 for their Chicago datacenter (confirmed in the presentation).  Google is managing at the scale of 10,000 servers per admin and trying to get to 100,000.</p>
<p>If you do the math, the basic cost for administration is $75/mo/server for the enterprise, $7.5/mo for Microsoft, and a mere $.75/mo for Google, <strong>a 100x difference!</strong> When calculating the long term TCO, you&#8217;ll find that investing heavily in automation is a &#8220;no-brainer&#8221; for those whose core competency *is* building at scale and operating IT at the lowest cost possible[4].</p>
<p>The lesson here, which James alludes to in his presentation (see his map of AWS releases in 2009 below), is that one major economy of scale is the ability to have significant resources deployed for software development purposes. The outcome of most cloud software development is generally automation or technology that enables the business to scale more efficiently.</p>
<div id="attachment_1034" class="wp-caption aligncenter" style="width: 588px"><img class="size-full wp-image-1034" title="james-hamilton-aws-rapid-innovation-chart" src="http://cloudscaling.com/wp-content/uploads/2010/05/james-hamilton-aws-rapid-innovation-chart.png" alt="james-hamilton-aws-rapid-innovation-chart" width="578" height="302" /><p class="wp-caption-text">James Hamilton&#39;s map of 2009 AWS releases</p></div>
<p style="text-align: center;">
<p><strong>Standardization &amp; Homogenization</strong><br />
Often overlooked is that businesses built at cloud scale *must* run relatively homogeneous environments.  By standardizing, they can achieve reasonable scale.  For example, Google is reputed to run as little as five hardware configurations across its one million+ server base.  In contrast, a typical enterprise  has hundreds of configurations across a much smaller server base, increasing operational overhead and expense dramatically.</p>
<p>Did you know that the primary driver behind Yahoo!&#8217;s cloud computing initiative was to normalize and cleanup their 800+ configurations?  It&#8217;s impossible to operate at massive scale without homogeneity and standardization.  A huge benefit of virtualization in cloud environments is that it allows the standardization of the physical hardware platform while running a plethora of operating systems at the virtualized layer.  This is also why I am occasionally a little sad when I see large enterprise IT shops insisting on purchasing their x86 server hardware vendor of choice.  It just doesn&#8217;t matter any more.  Not if your cloud (internal or external) is designed correctly.</p>
<p><strong>The Cash Flow Problem</strong><br />
A less understood, but just as interesting area of business scalability for clouds is the interplay of growth, speed, and cash reserves. There&#8217;s no question clouds can be very profitable and attractive for operators given their high margins and 100% compound annual growth rates (CAGR). Typical payback periods on installed cloud hardware average a fast 3-6 months. However, new entrants need to plan for an extended period of ever increasing hardware expenses that stay well ahead of free cash flow. Larger clouds will require $10M in liquidity to meet their rolling hardware acquisition needs, and even small clouds need to think about acquiring hardware at $1M per step. One key in building a highly profitable cloud lies in minimizing the lag between hardware acquisition and revenue generation. The difference between days and months is the difference between profitability and disaster.</p>
<p><strong>A Brief Aside on &#8216;Utilization Rates&#8217;</strong><br />
And that brings up an interesting point I want to address, which is not technically an economy of scale, but worth discussion.  I&#8217;m intrigued by the notion that  &#8217;utilization rates&#8217; are almost completely meaningless in the context of public cloud providers.  James Hamilton&#8217;s claims 30% server utilization (presumably CPU??) as a high metric even for cloud service companies. However,  this doesn&#8217;t matter when you sell your capacity like an IaaS cloud does.  Here&#8217;s why: whatever your particular cloud business model is (e.g. selling RAM, CPU, or &#8216;bundles&#8217; like Amazon&#8217;s instances), you *must* sell as much of it as possible at any given time.  Anything else is business suicide.  You can&#8217;t overbuild and only have 30% of your capacity sold.</p>
<p>Most IaaS providers target 75-80% sold capacity. Their cloud is therefor &#8216;utilized&#8217; at 75-80% in that the capacity is sold, even if not at 100% usage itself.  Unlike a business where unused capacity is <em>waste</em>,  in a business model that sells capacity, unused capacity is <em>unsold</em> capacity and hence, not competitive.  Not all usage of the term &#8216;utilization rates&#8217; is the same, especially when discussing an IaaS cloud[5].</p>
<p><strong>Conclusion</strong><br />
It&#8217;s critical to understand the potential economies of scale for cloud providers.  They can achieve these economies through size and focus. While larger players have some advantages,  many businesses can afford to buy servers and network in enough bulk to see significant price savings.  More important than sheer size is the ability to focus on innovation.</p>
<p>Public cloud providers have a core competency that involves delivering IT services at a very cost effective price point. They are the new IT utility companies of the near future.  Their ability to focus and spend development resources to achieve ever newer economies of scale will be something that traditional businesses can&#8217;t compete with. Traditional enterprise IT vendors will likely continually be playing &#8220;catch up&#8221; and be unable to provide competitive solutions in time.</p>
<p>Economies of scale are why other business infrastructure in the past, e.g. railways, telecommunications, and shipping, have consolidated into businesses who focus on delivery of the infrastructure as a core competency.  To think IT is any different is to bet against history.</p>
<p><strong>UPDATE</strong>: Added footnote to clarify that AWS is currently aggressively hiring so my initial estimates on engineering staff size are way off.</p>
<hr />[1] This means most hosting providers and clouds see very large markups on bandwidth.  They buy it cheap, but don&#8217;t typically pass on the cost savings to smaller customers.<br />
[2] If you listen carefully to Jame&#8217;s Hamiltons presentation he alludes to this several times.<br />
[3] It was pointed out in the comments that AWS currently has <a href="https://us-amazon.icims.com/jobs/search?ss=1&amp;in_iframe=1&amp;searchKeyword=%22http://aws.amazon.com%22">100+ technical positions</a> open, so I&#8217;m probably very off on my initial size estimate.  It&#8217;s hard to derive how big the AWS engineering team is from open positions because it depends on their aggressiveness at hiring.  A wild guess, assuming the answer is &#8216;very aggressive&#8217;, would be 200 engineers, putting their current operational costs for headcount around 30M annually.<br />
[4] Unlike James Urquhart, in his response, it&#8217;s seems apparent to the Cloudscaling team that cost-effective and enterprise-class high-quality clouds are not mutually exclusive.<br />
[5] Check out <a href="http://en.wikipedia.org/wiki/Capacity_utilization">this</a> tangentially related Wikipedia article on capacity utilization rates for the production base of nations.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/understanding-cloud-datacenter-economies-of-scale/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Virtual Server vs. Real Server Disk Drive Speed</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/virtual-disks/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/virtual-disks/#comments</comments>
		<pubDate>Sun, 13 Dec 2009 21:09:42 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloudscaling]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[performance]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=837</guid>
		<description><![CDATA[It&#8217;s important to understand the potential differences between virtual server disk drives and physical disk drives, so I wanted to post a very brief blog on the topic.  For this article I&#8217;ve chosen to compare the performance of an iSCSI &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/virtual-disks/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s important to understand the potential differences between virtual server disk drives and physical disk drives, so I wanted to post a very brief blog on the topic.  For this article I&#8217;ve chosen to compare the performance of an iSCSI SAN on Gigabit Ethernet to a single SATA disk drive.  The reason for this is two-fold: first, it more starkly highlights the relative performance differences between purchasing say a single dedicated server in a hosting environment with a single disk or a virtual machine hosted in a cloud environment.  Secondly, when you are looking at internal private clouds or a lot of the newer cloud offerings, they are commonly built using an <a href="http://en.wikipedia.org/wiki/Storage_area_network">iSCSI SAN</a> backend.</p>
<p>To be clear, the top three U.S. clouds do <strong>not</strong> use iSCSI SANs: Amazon&#8217;s <a href="http://aws.amazon.com/ec2">EC2</a>, <a href="http://www.rackspacecloud.com">Rackspace Cloud</a>, and <a href="http://www.gogrid.com">GoGrid</a>, all use local RAID subsystems.  This is common knowledge.  Of the early cloud pioneers, as far as I&#8217;m aware, mostly the U.K.-based clouds such as <a href="http://www.elastichosts.com">ElasticHosts</a> and <a href="http://www.flexiscale.com">FlexiScale</a> use iSCSI SANs.  The latest set of new cloud entrants, such as Savvis, Terremark, and Hosting.com all use either iSCSI or Fiber Channel-based SANs.  This is also commonly known.</p>
<p>Your Mileage May Vary on these performance numbers.  I&#8217;m not trying to highlight any &#8216;right&#8217; way to build a cloud here.  I&#8217;m simply trying to show what the difference in performance is between a single SATA disk and a VM disk drive backed by an iSCSI SAN over a single Gigabit Ethernet.</p>
<p>This is <strong>not</strong> a robust performance and benchmarking analysis.  It&#8217;s a simple &#8220;run the numbers and compare&#8221; blog posting.  These are by no means authoritative performance numbers and that&#8217;s not their purpose either.  Their purpose is to highlight how performance differs between a single spindle and many in a RAID configuration, even when that RAID is available via a SAN over Gigabit Ethernet.</p>
<p>Please avoid overly critiquing the testing technique here.  It&#8217;s not meant to be robust, so nitpicking it serves no purpose.</p>
<p><strong>Setup &amp; Methodology</strong><br />
This is a very simple test in the Cloudscaling hosting &amp; cloud lab environment.  Both servers running the test are on latest Ubuntu Jaunty Jackalope release.  One is a physical server with a single SATA disk and the other is a VMware vSphere VM backed by an iSCSI LUN.  The iSCSI LUN is provided by a ZFS-based SAN product called <a href="http://www.nexenta.com/corp/">NexentaStor</a> from Nexenta Systems.  This is an OpenSolaris derivative and a very cost effective alternative to say a NetApp or EqualLogic system.</p>
<p>The iSCSI SAN hardware is a simple Sun <a href="http://www.sun.com/servers/x64/x2200/">x2200 M2</a> with a Sun <a href="http://www.sun.com/storage/disk_systems/expansion/4200/">J4200 JBOD</a> and 6 15K RPM SAS drives.</p>
<p>The bonnie++ command line was as simple as possible:</p>
<hr />
<blockquote><p>bonnie++ -n 512</p></blockquote>
<hr />Note that the simplicity of the bonnie testing method may have caused some weird skewing of numbers.  See below for more.</p>
<p><strong>Basic Numbers</strong><br />
Here is a basic high-level chart showing the numbers.</p>
<div id="attachment_847" class="wp-caption alignnone" style="width: 563px"><img class="size-large wp-image-847  " title="iscsi-vs-local-disk-pic1" src="http://cloudscaling.com/wp-content/uploads/2009/12/iscsi-vs-local-disk-pic12-1024x646.png" alt="Figure 1. High level of SATA vs. VM disk" width="553" height="349" /><p class="wp-caption-text">Figure 1. High level of SATA vs. VM disk</p></div>
<p>The first thing you will notice, of course, is the two big spikes for sequential and random file reads.  These numbers are artificially inflated as clearly 325,000 IOPS for sequential and 460,000 IOPS for random reads are ridiculous.  This is likely due to caching either in the OS or the controller on the physical box.  bonnie++ is supposed to account for this, but for some reason, in this instance it did not.  So it might be a little easier to evaluate the relative performance on a logarithmic scale:</p>
<div id="attachment_846" class="wp-caption alignnone" style="width: 563px"><img class="size-large wp-image-846   " title="iscsi-vs-local-disk-pic2" src="http://cloudscaling.com/wp-content/uploads/2009/12/iscsi-vs-local-disk-pic2-1024x646.png" alt="Figure 2. Logarithmic Scale for High Level Results" width="553" height="349" /><p class="wp-caption-text">Figure 2. Logarithmic scale for test Results</p></div>
<p>Much better.  What is easier to notice here is that the VM generally performs better on both standard measures of disk speed: raw throughput and disk operations (I/O per second or <a href="http://en.wikipedia.org/wiki/IOPS">IOPS</a>) with the obvious exception of the two aberrant data points.</p>
<p>Removing those two data points will give us an even clearer picture:</p>
<div id="attachment_848" class="wp-caption alignnone" style="width: 563px"><img class="size-large wp-image-848  " title="iscsi-vs-local-disk-pic3" src="http://cloudscaling.com/wp-content/uploads/2009/12/iscsi-vs-local-disk-pic3-1024x646.png" alt="Figure 3. Normalized test results" width="553" height="349" /><p class="wp-caption-text">Figure 3. Normalized test results</p></div>
<p>Great.  Now this is very clear.  As you can see, the first half of the chart shows raw throughput (Kbytes/second).  When reading blocks from the VM disk we&#8217;re nearly saturating the gigabit ethernet link which should top out at 125Mbps theoretical, and we&#8217;re hitting 107MBps on average over 10 runs, so this is quite acceptable.  The SATA disk, in comparison gets just over 60MBps, which is about right, even though the SATA spec and controller are capable of more.  Sustained block reads from SATA disks will typically be 60-80MBps in the real world.</p>
<p>Much more interesting is the number of <a href="http://en.wikipedia.org/wiki/IOPS">IOPS</a>.  Many real world disk workloads, like a database spend the majority of their time doing large amounts of their &#8216;seeking&#8217; from one position of the disk to another, meaning lots of random file access.  They will bottleneck on waiting for the disk &#8216;head&#8217; to move from one position to another on a disk drive and read new data.  It&#8217;s hard to tell the difference above because the SATA disk is so slow it barely registers on the chart.</p>
<p>If we change to a logarithmic scale again the data becomes much easier to read:</p>
<div id="attachment_849" class="wp-caption alignnone" style="width: 563px"><img class="size-large wp-image-849  " title="iscsi-vs-local-disk-pic4" src="http://cloudscaling.com/wp-content/uploads/2009/12/iscsi-vs-local-disk-pic4-1024x646.png" alt="Figure 4. Normalized logarithmic scale test data" width="553" height="349" /><p class="wp-caption-text">Figure 4. Normalized logarithmic scale test data</p></div>
<p>Now you can see that doing random seeks (i.e. moving the head of the disk drive from one location to a new one to read a piece of data) are starkly different.  A single SATA disk gets about 185 IOPS while a set of 6 SAS disks in the SAN is right around 10,000 IOPS.  This is a huge performance difference.  There are several reasons for this.  One, a typical SATA disk has an average latency of 8.5ms and a 15K SAS disk has only 3ms.  Also, with 6 disks in a RAID configuration, I have 6x more disk heads to read with.</p>
<p>It&#8217;s still a bit hard to see with this chart, but for most of the rest of the IOPS tests above, the SAN solution is roughly 3x the performance of the single disk.  For example, Sequential File deletion is 2,573 (SAN) vs. 840 (SATA).</p>
<p>Rather than going through the entire set of results, I recommend you <a href="http://cloudscaling.com/files/iscsi-vs-local-disk-numbers.xlsx">download my simple spreadsheet</a>.</p>
<p>Note that for Amazon, Rackspace, or GoGrid, local VM disk results will likely look very similar to the iSCSI SAN results for IOPS and sequential read/write (first half of chart) will be <strong>much</strong> higher.</p>
<p>Amazon&#8217;s Elastic Block Storage (EBS) would have similar performance characteristics to the iSCSI SAN above and hence you can see why it can be acceptable for running a database.</p>
<p><strong>Summary</strong><br />
My point here is very simple.  I want to highlight the difference between purchasing a dedicated server with a single (or small number of) SATA disks vs. going with a cloud solution that uses a shared iSCSI SAN or local RAID on a single physical node.  Purchasing your  own dedicated server solution with a RAID can be extremely costly compared to a similar cloud solution.</p>
<p>More importantly, for those workloads that require random I/O and file access, like database applications, RAID is clearly a winner.  That&#8217;s why using a shared RAID (via an iSCSI SAN or a local RAID) on a physical node for your cloud VM can be a clear advantage of the cloud today.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/virtual-disks/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>More on Amazon&#8217;s SAS70 Type II</title>
		<link>http://www.cloudscaling.com/blog/cloud-computing/more-on-amazons-sas70-type-ii/</link>
		<comments>http://www.cloudscaling.com/blog/cloud-computing/more-on-amazons-sas70-type-ii/#comments</comments>
		<pubDate>Sat, 21 Nov 2009 02:59:23 +0000</pubDate>
		<dc:creator>Randy Bias</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[audits]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[elastic compute cloud]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[whitepapers]]></category>

		<guid isPermaLink="false">http://cloudscaling.com/blog/?p=829</guid>
		<description><![CDATA[Amazon hasn&#8217;t been forthcoming since my last post on their control and control objectives, which is disappointing, but expected.  I still believe that transparency here is more important than security through obscurity.  Hiding the controls and control objectives doesn&#8217;t provide &#8230; <a href="http://www.cloudscaling.com/blog/cloud-computing/more-on-amazons-sas70-type-ii/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Amazon hasn&#8217;t been forthcoming since my <a href="http://cloudscaling.com/blog/cloud-computing/why-amazons-sas70-is-bogus">last post</a> on their control and control objectives, which is disappointing, but expected.  I still believe that transparency here is more important than <a href="http://en.wikipedia.org/wiki/Security_through_obscurity">security through obscurity</a>.  Hiding the controls and control objectives doesn&#8217;t provide much in the way of particular security benefits, although I&#8217;m certain some will argue that it does.  Consider however, that while the <a href="http://en.wikipedia.com/wiki/SAS70">SAS70</a> controls would tell what is being audited, that doesn&#8217;t necessarily translate to all of the controls in place.</p>
<p>Regardless, a bit more light has been shed on Amazon&#8217;s controls and measures in their recent security webinar.  You can access it <a href="http://awsmedia.s3.amazonaws.com/Webinar_Overview_of_%20AWS_Security_Processes_102209_final.wmv">here</a>.</p>
<p>At a high level, CJ Moses, who presents the webinar talks to the core areas they covered in the control objectives, which are:</p>
<ol>
<li>Security organization</li>
<li>Amazon employee lifecycle</li>
<li>Logical security</li>
<li>Physical security</li>
<li>Environmental safeguards</li>
<li>Change management</li>
<li>Data integrity, availability, and redundancy</li>
<li>Incident handling</li>
</ol>
<p>This looks pretty reasonable at a high level.  Of course, it would be nice to see the actual controls and objectives, but at least they are covering the appropriate areas of security.  I do notice that there isn&#8217;t much around perimeter or related security.  I&#8217;m guessing they are trying to gloss over the AWS distributed firewall.  It would be nice if someone besides Amazon was vetting the way this was built.  They appear to consider it a piece of core intellectual property despite the fact it would be trivial to reproduce.  I&#8217;m not exactly certain why.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cloudscaling.com/blog/cloud-computing/more-on-amazons-sas70-type-ii/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
<enclosure url="http://awsmedia.s3.amazonaws.com/Webinar_Overview_of_%20AWS_Security_Processes_102209_final.wmv" length="64331157" type="video/asf" />
<enclosure url="http://awsmedia.s3.amazonaws.com/Webinar_Overview_of_%20AWS_Security_Processes_102209_final.wmv" length="64331157" type="video/asf" />
		</item>
	</channel>
</rss>

