Cloud = Virtualization + Automation The cloud is IT infrastructure as a service. The IT infrastructure is delivered as virtual machines. Automation moves those VMs around, thereby providing the delivery mechanism for the service. Rachel Chalmers, 451 group, October 20th, 2008
Lower-level, Less management Higher-level, More management EC2 Azure AppEngine Automatic scalability and failover
On the right, Sam Johnston s 6 layer Cloud Computing Stack http://samj.net/2008/09/taxonomy-6-layer-cloud-computing-stack.html
- Provisioning of computing resources: CPU, Memory, Processing - Basically an Operating System on demand - Usually billed on a per-hour usage model Players in this space Amazon EC2, Flexiscale, GoGrid, Joyent, AppNexeus Management providers: RightScale, ElasticServer Operational Differences - Wildly different CPU/Memory specs - Operating System differences - Difficult/Impossible to move whole images between providers
- Provisioning of data storage: Either file/object based or Database like functionality. - Billed on bandwidth and storage consumed Players in the space: Amazon S3, Nirvanix, Mosso, Amazon s SimpleDB, Google s BigTable, Azure Storage Management Providers: Jungle Disk, Elephant Disk, PutPlace.com Issues - Different types of data storage models - Limitations on the size of individual data units - Different billing models makes it hard to do a straight comparison - Access to the data generally uses non-standard query syntax - No common API - Performance issues
Provides a complete software stack - An IDE for the cloud Takes care of: Runtimes, Load balancing, Resource provisioning Players in the space - Google App Engine - Python (initially, now also Java) - Force.com (SalesForce) - Microsoft s Azure -.NET - Heroku.com (RubyOnRails) Issues Different languages -- Most platforms are unary Different operational philosophies - Google App Engine for example doesn t permit files Lots of limitations in terms of deployment Completely reliant on the provider for complete uptime and operation Widely different billing models
- Applications that are completely online - Operate on data that is stored in the cloud or ether - No client software generally required - Billing: Ad. Revenue, Premium Services Players in this space - Google Apps - Gmail / Google Docs - Apple s MobileMe - Microsoft s Live - Hotmail, Live Spaces - SalesForce.com Issues - Near on impossible to move between providers - GMail to Hotmail requires major disruption - End user focused o Consumer side of cloud computing - Completely reliant on the provider for complete uptime and operation
- Provides services to which other applications can utilise - Specific to vertical markets where most Web2.0 standards live - Usually free for non-commercial use Players in the space - Google/Yahoo Maps - PayPal / Google Checkout - Google / Yahoo WebServices - Amazon Merchant Services - Amazon Simple Queue Service Issues/Comments - Some Web2.0 services have attempted a standardization path - Most however are complete vendor lock-in - Mashup applications utilize Cloud Service
- Accessing the cloud - Clients utilize standard access protocols o XML o JSON o REST / SOAP Browsers o FireFox / IE / Chrome Mobile clients o Google Android / Symbian / iphone / J2ME Desktop Apps o Google Gears / Adobe AIR / Microsoft Azure
Clients Services Applications Platform Storage Increasing level of complexity and vendor lock-in Infrastructure
Resource Cost in Medium DC 1000 servers Cost in Very Large DC 50,000 servers Ratio Network $95 / Mbps / month $13 / Mbps / month 7.1x Storage $2.20 / GB / month $0.40 / GB / month 5.7x Administration 140 servers/admin >1000 servers/admin 7.1x Price per KWH Where Possible Reasons Why 3.6 Idaho Hydroelectric power; not sent long distance 10.0 California Electricity transmitted long distance over the grid; limited transmission lines in Bay Area; no coal fired electricity allowed in California. 18.0 Hawaii Must ship fuel to generate electricity
Jim Gray in 2003: economic necessity mandates putting the data near the application, since the cost of wide-area networking has fallen more slowly (and remains relatively higher) than all other IT hardware costs. Not to confuse with TCO, e.g. no admin cost.
Flexibility Eco-efficiency Credit crunch business imperatives - CapEx to OpEx pay-as-you-go - Fixed cost to variable cost Improved time to market
Pay by use instead of provisioning for peak Resources Capacity Demand Resources Capacity Demand Time Static data center Time Data center in the cloud Unused resources
Economics of Cloud Users Heavy penalty for under-provisioning Resources Capacity Demand Resources 1 2 3 Time (days) Lost revenue Capacity Demand 1 2 3 Time (days) Resources Capacity Demand 1 2 3 Time (days) Lost users
There are alternatives Feature Comparison * Extra per-hour charge
Cost for 1 month: Lowest Spec 1 month ~ 4 weeks ( 24 x 7 x 4 = 672 hours )
Cost for 1 month: Highest Spec 1 month ~ 4 weeks ( 24 x 7 x 4 = 672 hours ) CPU Cores differ
Your data in the cloud
Adoption Challenges Availability Data lock-in Challenge Data Confidentiality and Auditability Opportunity Multiple providers & DCs Standardization Encryption, VLANs, Firewalls; Geographical Data Storage
Challenge Data transfer bottlenecks Performance unpredictability Scalable storage Bugs in large distributed systems Scaling quickly Opportunity FedEx-ing disks, Data Backup/Archival Improved VM support, flash memory, scheduling VMs Invent scalable store Invent Debugger that relies on Distributed VMs Invent Auto-Scaler that relies on ML; Snapshots
Policy and Business Challenges Challenge Opportunity Reputation Fate Sharing Offer reputation-guarding services like those for email Software Licensing Pay-for-use licenses; Bulk use sales
Startups opportunities Startups and prototyping One-off tasks Research at scale Cloud Killer Apps Mobile and web applications Extensions of desktop software Matlab, Mathematica Batch processing / MapReduce
BalticGrid Planned activities BGi - BalticGrid Innovation Lab SA1-3, NA1-4, JRA BalticCloud BC Courses SME connectivity
31 Our choice: - Great team! Rich Wolski, UCSB. - Integration with - Integration with This Thursday: Ubuntu 9.04 on Thursday, with Eucalyptus 1.5! BalticCloud: Leveraging Grid Infrastructure at ISGC 2009, 23.04.2009, Taipei BC
SweCloud The Swedish Cloud Computing Infrastructure Proposed pilot in October 2007 (Fredrik, Åke, Lennart) Started January 2009, SNIC financed, lead by Åke Leveraging on the BC results Roadmap Connecting to BalticCloud this month Running pilot application this fall Evaluation September 2009
See more cloud computing being used by startups and in-house quick prototyping See more cloud computing being used in academia See more open source alternatives, and alternative cloud providers also in academia (BalticCloud, SweCloud) Improved solutions addressing the dangers in cloud computing. Low expectations on standard APIs (takes time, not sure about the interest from industry, looks like for grids). I.e. industry will not help here, this is the competition epicenter. Hype, problems, business issues: Successful usage will prove the value. If it is a paradigm shift, it will be clear this year. Plenty to opportunities for everyone!
Clouds and HPC Dan Reed take on this Twins Separated at Birth: Cloud Computing, HPC and How Microsoft Is Trying To Change How We Think About Scale, and others on supercomputer as a service a topic in its own Cloudbursting how companies outsource parts of their business to clouds, keeping the confident parts in-house Open Cloud Manifesto Dedicated to the belief that the cloud should be open OGF cloud API standardization efforts All the industry buzz, all the companies trying to give their customers their cloud offering Still ongoing discussions about definitions The 15 definitions of how to tell it is not a cloud, and why these definitions are semi-wrong. Many, many, cloud related projects also in Europe, eg. OpenNebula.org part of RESERVOIR Some slides on how in practice use AWS and Google AppEngine (see BC course) - how to do loadbalancing, databases in the cloud, mass storage in the clouds, and more about options to AWS, Google and Azure. Some more business reports..or maybe less? Stay tuned!
from http://geekandpoke.typepad.com