Exertion-based billing for cloud storage access
|
|
|
- Reginald Williamson
- 10 years ago
- Views:
Transcription
1 Exertion-based billing for cloud storage access Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger Carnegie Mellon University, VMware CMU-PDL March 2011 Parallel Data Laboratory Carnegie Mellon University Pittsburgh, PA Abstract Charging for cloud storage must account for two costs: the cost of the capacity used and the cost of access to that capacity. For the cost of access, current systems focus on the work requested, such as data transferred or I/O operations completed, rather than the exertion (i.e., effort/resources expended) to complete that work. But, the provider s cost is based on the exertion, and the exertion for a given amount of work can vary dramatically based on characteristics of the workload, making current charging models unfair to tenants, provider, or both. This paper argues for exertion-based metrics, such as disk time, for the access cost component of cloud storage billing. It also discusses challenges in supporting fair and predictable exertion accounting, such as significant inter-workload interference effects for storage access, and a performance insulation approach to addressing them. Acknowledgements: We thank the members and companies of the PDL Consortium (including APC, EMC, Facebook, Google, HP, Hitachi, IBM, Intel, LSI, Microsoft, NEC, NetApp, Oracle, Riverbed, Samsung, Seagate, STEC, Symantec, VMware, and Yahoo!) for their interest, insights, feedback, and support. We also thank Intel and NetApp for hardware donations that enabled this work. This material is based on research sponsored in part by the National Science Foundation, via grants #CNS , #CCF , and #CNS , by the Department of Energy, under award number DE-FC02-06ER25767, by CyLab at Carnegie Mellon, under agreement number DAAD from the Army Research Office and by an NDSEG Fellowship sponsored by the Department of Defense.
2 Keywords: cloud storage, cloud billing, cloud accounting, storage accounting, performance insulation, exertion
3 1 Introduction A core aspect of cloud computing, especially so-called Infrastructure-as-a-Service (IaaS) cloud computing, is tenants sharing provider services and paying for what they use. For this model to work, cloud providers must bill tenants. From the provider s perspective, higher bills are better, and fairness is irrelevant until tenants are able to hold them accountable for it. From the tenant s perspective, however, bills should accurately reflect the demands each given tenant places on provider resources workloads that induce less resource utilization should yield lower bills, and bills should not be inflated by activities of other tenants. Unfortunately, current billing models for cloud storage fall far short of this ideal. Storage capacity billing is reasonable, being based simply on bytes stored, but storage access billing is not. For the latter, billing is usually based on the aggregate count of bytes or I/Os requested, without consideration for the exertion (i.e., resources/effort) required to provide for them. But, it is well-known that storage access efficiency can vary by orders of magnitude, depending on workload characteristics such as locality and transfer size the cost to the provider for a given byte or I/O count would vary accordingly. As a result, tenant bills for storage access may bear little to no relationship to the actual costs. This problem with storage access billing has several consequences. Most importantly, it is unfair to tenants with efficient access patterns. Since the provider must recover all costs, a tenant s bill will likely reflect average efficiency across all tenants, rather than that tenant s efficiency. Moreover, it acts as a disincentive for tenants to invest in efficiency. Having programmers improve access efficiency may not make economic sense, if bills will not reflect a benefit. Storage access billing should be based on some measure of exertion, such as disk time or server time. An exertion-based metric would more accurately reflect the actual cost to the provider of a given tenant s activities. Indeed, for other resources, this approach is already taken for example, CPU resources are often billed in terms of CPU hours. Exertion-based billing for storage access does raise several challenges. For example, storage access can involve a number of resources (e.g., network, server CPU, server cache, disk), and all would ideally be accounted for. More insidiously, access efficiency (and thus exertion for a given workload) can vary widely, in most storage systems, when they are shared by multiple clients (i.e., tenants). Inter-workload interference can reduce efficiency significantly, depending on how the workloads mix. With exertion-based billing, therefore, bills would also vary widely, depending on the activities of other tenants rather than on one s own activities. Such variation is highly undesirable indeed, an ideal billing model might yield the same bill each time a tenant applies the same workload, rather than penalizing a tenant for the provider s inability to manage interference. One option might be to create mechanisms to account for the effects of interference in exertion measurements. Instead, we promote an approach based on storage services that bound the impact of interference on efficiency (and thus exertion). As an example, we describe a request scheduler, called Argon, that provides this property and thereby allows a cloud storage provider to bill fairly for the inherent costs of each tenant s workload. 2 Storage accounting A cloud service provider pays to acquire a set of resources and to make them available at a particular point in time to customers. The number of resources (such as CPUs or disks) and the decision to operate a particular subset of the available resources (for example, to power up a server or spin up a disk) during a given period is driven by client demand. Thus, clients should pay for the load they present to each of the resources. In accounting for storage costs, there are two reasons why a cloud service provider might need to pay to expand its storage resources. First, clients might require more capacity, thus requiring more platters to store their data. Second, clients might require more data accesses, or more performant data accesses. To accommo- 1
4 Data access Provider Capacity Requests Bytes Performance Amazon EC2 1 1 Amazon EBS 2 Amazon EBS S3 3 Amazon S3 3 4 Google App Engine Microsoft Windows Azure vcloud Express 1. VMs are provisioned from a selection of tiers; discrete levels of maximum capacity and performance are available. 2. Requests = IOs. 3. Requests = PUT/GET/etc. 4. Billing for byte transfer only applies when crossing geographical region boundaries, so is ultimately a network charge. Table 1: Popular cloud service providers generally charge for capacity and IOs. date this type of growth, the cloud provider might buy more independent disk heads to service requests in parallel, or more cache, or more bandwidth on the network links to the storage system. Consequently, storage accounting should incorporate both the capacity used by a given workload and the data accesses it makes over time. Using the right metric. The obvious candidates for metering data accesses are based on throughput and bandwidth. Accounting for MBs transferred or IOs performed is straightforward, and such metrics are easily available from the storage system and visible to both the cloud service provider and the user. Table 1 shows the most popular cloud providers generally charge for capacity and IOs, with some exceptions ([1, 2, 3, 4, 5]). One might also propose accounting based on rates of MB/s or IOPS provisioned for a workload. Unfortunately, performance and transfer metrics are poor proxies for storage exertion. Different workloads may require different levels of exertion to receive the same throughput or bandwidth. One common example is the distinction between sequential and random accesses. In Table 2, we show that sequential and random workloads require dramatically different exertion to achieve, for example, 1 MB/s. The sequential workload achieves up to 63.5 MB/s bandwidth given dedicated disk access, whereas the random workload only achieves 1.5 MB/s. Based on full-disk performance, the disk exertion to achieve 1 MB/s for the sequential and random workloads is 1.6% and 67%, respectively. If accounting is based on MB/s, then in this case, for the same 1 MB/s performance, the owner of each workload would pay the same, but running the sequential workload would consume significantly less disk time and thus incur less exertion. Accounting based on the number of MBs transferred in a window of time is not fundamentally different; for instance, transferring 1 MB in a second-long window might take s (1.6% of 1 s) or 0.67 s (67% of 1 s). The metrics of IOs and IOPS suffer from the same problem; for instance, for a workload with fixed request sizes, IOs and MBs, and IOPS and MB/s, simply differ by a scaling factor (request size). Thus, performanceor transfer-based accounting would be unfair because a user running a low difficulty level (for example, sequential) workload would overpay, which deviates from the cloud accounting rule of pay for what you use. The aspect of data access that represents the fraction of the disk resource being used by a workload, and the fraction of the resource not available to other workloads, is disk time. Because there is not a strong correspondence between throughput or bandwidth and disk time, throughput and bandwidth represent poor proxies for storage exertion. Because disk time is a simple measure which captures disk exertion precisely, we propose that storage access billing should be performed using an exertion metric that incorporates disk 2
5 Workload Dedicated disk bandwidth Disk exertion to achieve 1 MB/s Sequential 63.5 MB/s 1.6% Random 1.5 MB/s 67% Table 2: Sequential and random workloads require dramatically different levels of exertion to achieve the same performance or quantity transferred. time (alongside exertion for other storage resources such as the caches 1 ). Because a provider knows what each storage resource, such as a disk, costs to acquire and operate, using exertion metrics such as disk time allows the provider to bill each user fairly and accurately. It s not just about locality. The distinction between sequential and random workloads is only one example of the fact that there is not a 1-to-1 mapping between user-visible performance and exertion. Metadata accesses may have very different costs than data accesses. Workloads with high cacheability may achieve high bandwidth and throughput with even less disk exertion than a very easy sequential workload. Alternative storage technologies such as solid-state disks present very different performance characteristics, different classifications of easy and hard workloads, and different ratios between performance and acquisition costs. All of these factors further compound the limitations of using bandwidth or throughput for accounting and all of these scenarios can be appropriately incorporated into an exertion-based billing model. Does exertion make sense to users? Admittedly, disk time is not a perfect choice. While it can be thought of in different ways, such as having one quarter of a disk or having a virtual disk that runs at one quarter the speed of a real disk, many users may find it to be hard to interpret or predict. This may lead some to propose that the more directly visible metrics of throughput or bandwidth should be used anyway, despite their flaws. While these metrics are indeed more visible, we argue that they are only marginally more meaningful than disk time to a user. Ultimately, the metric that users value is application progress, which for even moderately complex applications will not be directly represented by any metric visible to the underlying storage system. For instance, a particular user might care about credit card transactions per second when thinking about performance, and cents per credit card transaction when thinking about accounting. A credit card transaction might take a variable number of IOs depending upon data layout, caching, and other effects, and therefore metrics such as IOPS are not especially interpretable from the user s perspective either. In this sense, all of the metrics, including IOPS, bandwidth, and disk exertion are only weakly meaningful to application progress. As a result, we believe using storage exertion, based in part on disk and other storage resource time, as the accounting metric is most reasonable because it is at least more natural for providers. 3 Making exertion fair For exertion to be a reasonable accounting metric, it should be fair and predictable. Each workload should operate efficiently, accomplishing its work with as low a level of exertion as is practical. In addition, the exertion required to service a workload should be independent of the activities of other workloads or other artificial, external influences. One consequence of these attributes is that the cost across multiple runs of the same workload should be the same. Exertion-based accounting should always match the intrinsic nature of the workload rather than transient effects outside a user s control. 1 Appropriate metrics for exertion for each of these resources must be chosen, but we leave them outside the scope of this paper. For systems that provide resiliency through techniques such as RAID, access cost would include multiple accesses to independent disks, and capacity cost would capture the extra space used. Today, such features might only be imprecisely accounted for as different tiers or classes of storage with different rates. 3
6 3.1 Problem However, in practice, these goals can be difficult to achieve. Interference between workloads during runtime request scheduling is a common source of poor efficiency and unpredictability. For example, when a sequential workload is scheduled to run concurrently with another workload using the same real disk, their respective access patterns may become interleaved. In this case, the locality of the sequential workload may be lost in the mix. Consequently, its performance may become random-like and it may thus incur excessive exertion as compared to when it runs alone and excessive exertion compared the natural expectation for sequential workloads. Table 3 shows an example of this scenario using experimental results from sequential and random microbenchmarks running together on a real system [9]. It would be unfair for the user to pay more for an intrinsically easy workload whose difficulty was artificially increased by external interference. But the cloud provider would not want to absorb the extra costs caused by an order-of-magnitude or more increase in exertion either. 3.2 Solution Since inter-workload interference may lead to poor predictability and unfairness of exertion-based billing, it must be addressed for the billing mechanism to be acceptable. Rather than suggesting a way to account for or allocate the increase in costs, we believe a more direct and satisfying solution is to avoid the increase in exertion in the first place. Fortunately, effective solutions exist. To address the interference problem, the cloud provider should use a scheduler that limits interference between workloads while retaining efficiency and locality for each of them. Argon [9] is a locality-preserving scheduler that helps address the interference problem and promotes each of the four goals for exertion: predictability, fairness, efficiency, and independence. Argon has a tuning knob called the R-value which represents the minimum efficiency level the system will maintain despite the overhead of sharing the system between workloads. Argon seeks to maintain the property of performance insulation: Each workload receiving a fraction f of disk time should receive performance shared standalone f R. Argon is a quality-of-service system which operates over bandwidth performance metrics. However, Argon s focus on efficiency also has implications for exertion, and its use of disk time fractions maps to exertion. As an example of how Argon manages performance, if the system is configured to maintain R = 0.9, then each workload should receive at least 90% of its dedicated-disk performance within its share of disk time and the performance loss due to sharing should be no more than 10% 2. As a concrete example, a workload that receives 40 MB/s on a dedicated disk and is receiving 25% of a disk s total time would ideally receive 10 MB/s if there were no overhead from sharing the disk, and Argon would ensure at least 9 MB/s if the R-value is set to 0.9. This is in constrast to systems that do not explicitly maintain efficiency or any lower bound on performance due to highly variable and complex sharing overheads. The performance benefits of Argon directly translate into exertion benefits as well. Because Argon strictly bounds the decrease in performance, it also bounds the increase in exertion (disk time fraction f ) necessary to maintain a fixed level of performance shared. Manipulating the equation for performance insulation, f (1/R) shared/standalone. The value shared/standalone represents the exertion level one would ideally expect to yield a workload the performance level shared based on extrapolating from its dedicated-disk performance standalone. Argon s performance insulation limits the inflation of exertion above this ideal fraction to 1/R. For instance, if 90% efficiency is being maintained, then the overhead in exertion for a workload due to sharing should not exceed 11%. One of the key techniques used by Argon to maintain efficiency is disk-head timeslicing. Each workload is assigned a controlled fraction of disk head time, which in this context can be interpreted as the disk 2 The R-value represents a balance between efficiency and maximum latency / variance in latency. 4
7 Exertion for 1 MB/s Workload Ideal (no interference) Interference example Sequential 1.6% 23% Random 67% 74% Table 3: The exertion required to achieve the same level of performance can vary widely if the cloud service provider does not control inter-workload interference. exertion the workload is allowed to consume. These fractions dictate the relative lengths of per-workload timeslices in a round-robin schedule. The absolute lengths are calculated based on the R-value; timeslices are sized to be long enough that the cost of seeking between respective workloads data, which is done at the boundary between timeslices, is amortized across a whole timeslice s worth of useful work to yield the desired efficiency level. Argon also partitions the storage server s cache among workloads and selects appropriate workload cache partition sizes based on their respective access patterns. This allows each workload to maintain hit rates close to their dedicated-server hit rates, except in the case (which Argon identifies up-front) where the workloads are too demanding to coexist on the same server. As a result, Argon protects workloads from efficiency loss and inflation of exertion due to poor cache sharing as well. Earlier, we identified four goals for exertion: predictability, fairness, efficiency, and independence. Argon achieves each. Exertion under Argon is predictable because it is a simple function of the desired performance level and dedicated-disk performance, scaled up by the reciprocal of the R-value. Exertion under Argon is fair because the overhead of sharing is limited and does not fall disproportionately on any one workload. Argon is able to maintain high efficiency levels, such as 90%, without unduly affecting the latency of most requests [9]. And Argon promotes the independence of exertion values: The behavior of other workloads does not appear in the equation f (1/R) shared/standalone that characterizes the exertion needed to achieve a given level of performance under Argon. The influence of other workloads is straightforward and limited to the presence of an R-value less than 1 (representing the overhead of sharing) and to the fact that the fractions assigned to the set of workloads must sum to at most 100% (or else the system will be oversubscribed). The specific behavior of other workloads does not have a complex and unpredictable relationship with a workload s exertion under Argon as it does under systems that do not explicitly bound efficiency loss. 4 Discussion A number of other issues arise when billing for storage exertion. This section surveys a few of them. Are surplus resources free? As described in Section 2, there are two reasons to buy more disks, and thus two reasons to charge for disk use: capacity and performance. We expect providers will assign fees for capacity used and fees for exertion demanded, and bill for the sum of these charges. However, in some scenarios, one of the two might be essentially free. For instance, if demand for capacity dominates (e.g., as it would for archival workloads), then the provider may own a vast number of platters that are nearly full of data, but whose disk heads are virtually idle. In this situation, marginal exertion may cost essentially zero. If demand for performance dominates, then the provider may own a vast number of spindles that are nearly constantly performing transfers, but whose platters have a large amount of free space. If even a modest amount of free time were available, marginal capacity for low-intensity workloads such as archival storage may similarly have virtually no cost. If either of these scenarios are true, it may be most appropriate to only bill for the single metric that is causing system growth, and not for the other that is virtually free. Of course, if full but idle disks are spun down then the simplified view of idle resources being available at no marginal cost ceases to be true. 5
8 Off-peak usage. Another consideration involving marginal cost involves provisioning for the peak. For instance, telephone networks are provisioned for the maximum number of concurrent calls, and many of the dominant costs are associated with building a system that can handle this high watermark. The incremental cost of a single call may be vanishingly small. As a result, some providers offer discounted rates during offpeak hours, and some mobile providers provide unlimited free night and weekend minutes. Usage during peak hours incurs a higher charge because such usage contributes to the high watermark level. Similarly, storage in a cloud (and other resources in the cloud) may be provisioned for peak demand. The incremental cost of keeping a drive spinning during off-peak hours may not be neglibible, but there may still be motivation to charge a reduced rate for usage that is not part of peak load. Competition among providers. A standard billing model based on storage exertion and diverse rates for it over time and across different providers would introduce market forces [6]. Once providers publish their rates, customers have the freedom to compare them and choose the provider that offers the lowest overall cost for their particular workload. A healthy market will require providers to compete for customers. In order to be competitive, they will need to bill for exertion in a predictable, fair, efficient, and independent manner. Thus, there will be an incentive and an imperative to implement efficient request scheduling, effective on-disk data layout policies, and other such approaches that minimize the overhead of virtualization. Common units across resources. Disk exertion can be expressed as the percentage of disk time assigned to a given workload. Percentages are also natural metrics for other cloud resources, such as CPU time, memory footprints, and network bandwidth reservations. While there is no meaningful conversion between percentages for one resource and percentages for another, the use of a common unit enables certain types of analysis. For instance, customers or providers could assess whether their use of resources is balanced or bottlenecked based on percentages. If a workload uses CPU time much more heavily than memory or storage, then perhaps an alternative algorithm that chooses a different spot on the space-time tradeoff (e.g., increased memoization) would be suitable. In addition, by considering exertion along with capacity and being able to do a bottleneck analysis, different storage systems or technologies can better be compared. For instance, if a workload is bandwidth-heavy but capacity-light, billing properly for exertion might better allow the costs of upgrading to solid-state disks, or increasing cache sizes, to be justified. Choosing storage systems. Exertion charging may improve providers ability to choose among different storage systems and capabilities. For instance, higher-end storage systems may provide better performance and/or resiliency at higher prices. With more predictable and meaningful exertion charges, providers may better be able to assess whether higher-end systems are cost-effective. It may allow providers to better justify specialized storage for specific usage types (such as archival storage) or simply to choose a general-purpose system better matched to the range of applications in use. Workload tagging. As storage requests move from a workload down to an underlying storage system, passing through layers of virtualization, file systems, schedulers, drivers, and so on, they must be tracked and accounted back to the originating workload. This is likely to require protocol / API modifications to tag individual requests with workload identifiers. However, it has been done successfully in real, complex systems [8]. Other related work. Various authors, such as Wang [10] and Li [7], have looked at accounting in a cloud environment and examined some of the same issues mentioned in this paper. Wang proposes using metrics such as cost in addition to more traditional metrics such as throughput. Both identify issues of unfairness and unpredictability in pricing. For example, they both cite real examples of cost varying significantly from run to run of the same workload. We focus on one cause of run-to-run variance, and suggest a request scheduler that reduces it. Also, Wang demonstrates that users comparison shop among vendors for the best price, and Li demonstrates that cost and performance for the same task can be significantly different across different cloud vendors. Together, these introduce competitive market forces like the ones we mentioned earlier. 6
9 5 Summary Cloud storage access billing should be exertion-based, charging tenants for the costs actually induced by their I/O activities rather than an inaccurate proxy (e.g., byte or I/O count) for those costs. Although challenges exist in moving to such exertion-based billing, doing so rewards efficiency and can be fair to tenants. One set of challenges, related to inter-workload interference, can be addressed with explicit performance insulation, such as is provided by the Argon storage server. With such mechanisms, exertion-based billing can be fair and repeatable, increasing tenant ability to account for costs and justify efficiency improvements. Acknowledgements We thank Orran Krieger for discussions which inspired this work, and our colleagues in the Parallel Data Lab for helping expand on our initial ideas. References [1] [2] [3] [4] [5] [6] O. Krieger, P. McGachey, and A. Kanevsky. Enabling a marketplace of clouds: Vmware s vcloud director. ACM SIGOPS Operating Systems Review, 44(4): , [7] A. Li, X. Yang, S. Kandula, and M. Zhang. Cloudcmp: Shopping for a cloud made easy. In HotCloud 10: 2nd USENIX Workshop on Hot Topics in Cloud Computing, [8] B. H. Sigelman, L. A. Barroso, M. Burrows, P. Stephenson, M. Plakal, D. Beaver, S. Jaspan, and C. Shanbhag. Dapper, a large-scale distributed systems tracing infrastructure. Technical Report dapper , Google, April [9] M. Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. Argon: performance insulation for shared storage servers. In FAST 07: Proceedings of the 5th USENIX Conference on File and Storage Technologies, [10] H. Wang, Q. Jing, R. Chen, B. He, Z. Qian, and L. Zhou. Distributed systems meet economics: Pricing in the cloud. In HotCloud 10: 2nd USENIX Workshop on Hot Topics in Cloud Computing,
Exertion-based billing for cloud storage access
Exertion-based billing for cloud storage access Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger Carnegie Mellon University, VMware Abstract Charging for cloud storage must account for two
Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1
Performance Study Performance Characteristics of and RDM VMware ESX Server 3.0.1 VMware ESX Server offers three choices for managing disk access in a virtual machine VMware Virtual Machine File System
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
WHITE PAPER Optimizing Virtual Platform Disk Performance
WHITE PAPER Optimizing Virtual Platform Disk Performance Think Faster. Visit us at Condusiv.com Optimizing Virtual Platform Disk Performance 1 The intensified demand for IT network efficiency and lower
Flash Memory Arrays Enabling the Virtualized Data Center. July 2010
Flash Memory Arrays Enabling the Virtualized Data Center July 2010 2 Flash Memory Arrays Enabling the Virtualized Data Center This White Paper describes a new product category, the flash Memory Array,
Everything you need to know about flash storage performance
Everything you need to know about flash storage performance The unique characteristics of flash make performance validation testing immensely challenging and critically important; follow these best practices
Application-Focused Flash Acceleration
IBM System Storage Application-Focused Flash Acceleration XIV SDD Caching Overview FLASH MEMORY SUMMIT 2012 Anthony Vattathil [email protected] 1 Overview Flash technology is an excellent choice to service
Speeding Up Cloud/Server Applications Using Flash Memory
Speeding Up Cloud/Server Applications Using Flash Memory Sudipta Sengupta Microsoft Research, Redmond, WA, USA Contains work that is joint with B. Debnath (Univ. of Minnesota) and J. Li (Microsoft Research,
Leveraging EMC Fully Automated Storage Tiering (FAST) and FAST Cache for SQL Server Enterprise Deployments
Leveraging EMC Fully Automated Storage Tiering (FAST) and FAST Cache for SQL Server Enterprise Deployments Applied Technology Abstract This white paper introduces EMC s latest groundbreaking technologies,
Oracle Database Scalability in VMware ESX VMware ESX 3.5
Performance Study Oracle Database Scalability in VMware ESX VMware ESX 3.5 Database applications running on individual physical servers represent a large consolidation opportunity. However enterprises
Data Backup and Archiving with Enterprise Storage Systems
Data Backup and Archiving with Enterprise Storage Systems Slavjan Ivanov 1, Igor Mishkovski 1 1 Faculty of Computer Science and Engineering Ss. Cyril and Methodius University Skopje, Macedonia [email protected],
Technical White Paper. Symantec Backup Exec 10d System Sizing. Best Practices For Optimizing Performance of the Continuous Protection Server
Symantec Backup Exec 10d System Sizing Best Practices For Optimizing Performance of the Continuous Protection Server Table of Contents Table of Contents...2 Executive Summary...3 System Sizing and Performance
Comparison of Windows IaaS Environments
Comparison of Windows IaaS Environments Comparison of Amazon Web Services, Expedient, Microsoft, and Rackspace Public Clouds January 5, 215 TABLE OF CONTENTS Executive Summary 2 vcpu Performance Summary
Top 5 Reasons to choose Microsoft Windows Server 2008 R2 SP1 Hyper-V over VMware vsphere 5
Top 5 Reasons to choose Microsoft Windows Server 2008 R2 SP1 Hyper-V over VMware Published: April 2012 2012 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and
The State of Cloud Storage
205 Industry Report A Benchmark Comparison of Speed, Availability and Scalability Executive Summary Both 203 and 204 were record-setting years for adoption of cloud services in the enterprise. More than
Understanding Microsoft Storage Spaces
S T O R A G E Understanding Microsoft Storage Spaces A critical look at its key features and value proposition for storage administrators A Microsoft s Storage Spaces solution offers storage administrators
Architectural Implications of Cloud Computing
Architectural Implications of Cloud Computing Grace Lewis Research, Technology and Systems Solutions (RTSS) Program Lewis is a senior member of the technical staff at the SEI in the Research, Technology,
The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage
The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage sponsored by Dan Sullivan Chapter 1: Advantages of Hybrid Storage... 1 Overview of Flash Deployment in Hybrid Storage Systems...
Exploring RAID Configurations
Exploring RAID Configurations J. Ryan Fishel Florida State University August 6, 2008 Abstract To address the limits of today s slow mechanical disks, we explored a number of data layouts to improve RAID
Xangati Storage Solution Brief. Optimizing Virtual Infrastructure Storage Systems with Xangati
Optimizing Virtual Infrastructure Storage Systems with Xangati Virtualized infrastructures are comprised of servers, switches, storage systems and client devices. Of the four, storage systems are the most
Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array
Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array Evaluation report prepared under contract with Lenovo Executive Summary Even with the price of flash
Performance Management for Cloudbased STC 2012
Performance Management for Cloudbased Applications STC 2012 1 Agenda Context Problem Statement Cloud Architecture Need for Performance in Cloud Performance Challenges in Cloud Generic IaaS / PaaS / SaaS
Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.
Object Storage: A Growing Opportunity for Service Providers Prepared for: White Paper 2012 Neovise, LLC. All Rights Reserved. Introduction For service providers, the rise of cloud computing is both a threat
Accelerating Server Storage Performance on Lenovo ThinkServer
Accelerating Server Storage Performance on Lenovo ThinkServer Lenovo Enterprise Product Group April 214 Copyright Lenovo 214 LENOVO PROVIDES THIS PUBLICATION AS IS WITHOUT WARRANTY OF ANY KIND, EITHER
Software-defined Storage Architecture for Analytics Computing
Software-defined Storage Architecture for Analytics Computing Arati Joshi Performance Engineering Colin Eldridge File System Engineering Carlos Carrero Product Management June 2015 Reference Architecture
Deep Dive: Maximizing EC2 & EBS Performance
Deep Dive: Maximizing EC2 & EBS Performance Tom Maddox, Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved What we ll cover Amazon EBS overview Volumes Snapshots
Storage I/O Control: Proportional Allocation of Shared Storage Resources
Storage I/O Control: Proportional Allocation of Shared Storage Resources Chethan Kumar Sr. Member of Technical Staff, R&D VMware, Inc. Outline The Problem Storage IO Control (SIOC) overview Technical Details
EMC XTREMIO EXECUTIVE OVERVIEW
EMC XTREMIO EXECUTIVE OVERVIEW COMPANY BACKGROUND XtremIO develops enterprise data storage systems based completely on random access media such as flash solid-state drives (SSDs). By leveraging the underlying
Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation
Overview of I/O Performance and RAID in an RDBMS Environment By: Edward Whalen Performance Tuning Corporation Abstract This paper covers the fundamentals of I/O topics and an overview of RAID levels commonly
EMC DATA DOMAIN OPERATING SYSTEM
EMC DATA DOMAIN OPERATING SYSTEM Powering EMC Protection Storage ESSENTIALS High-Speed, Scalable Deduplication Up to 58.7 TB/hr performance Reduces requirements for backup storage by 10 to 30x and archive
Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers
BASEL UNIVERSITY COMPUTER SCIENCE DEPARTMENT Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers Distributed Information Systems (CS341/HS2010) Report based on D.Kassman, T.Kraska,
EMC DATA DOMAIN OPERATING SYSTEM
ESSENTIALS HIGH-SPEED, SCALABLE DEDUPLICATION Up to 58.7 TB/hr performance Reduces protection storage requirements by 10 to 30x CPU-centric scalability DATA INVULNERABILITY ARCHITECTURE Inline write/read
Q & A From Hitachi Data Systems WebTech Presentation:
Q & A From Hitachi Data Systems WebTech Presentation: RAID Concepts 1. Is the chunk size the same for all Hitachi Data Systems storage systems, i.e., Adaptable Modular Systems, Network Storage Controller,
VMware and Xen Hypervisor Performance Comparisons in Thick and Thin Provisioned Environments
VMware and Xen Hypervisor Performance Comparisons in Thick and Thin Provisioned Environments Devanathan Nandhagopal, Nithin Mohan, Saimanojkumaar Ravichandran, Shilp Malpani [email protected],
Cloud Computing: Making the right choices
Cloud Computing: Making the right choices Kalpak Shah Clogeny Technologies Pvt Ltd 1 About Me Kalpak Shah Founder & CEO, Clogeny Technologies Passionate about economics and technology evolving through
Characterizing Task Usage Shapes in Google s Compute Clusters
Characterizing Task Usage Shapes in Google s Compute Clusters Qi Zhang 1, Joseph L. Hellerstein 2, Raouf Boutaba 1 1 University of Waterloo, 2 Google Inc. Introduction Cloud computing is becoming a key
How To Store Data On An Ocora Nosql Database On A Flash Memory Device On A Microsoft Flash Memory 2 (Iomemory)
WHITE PAPER Oracle NoSQL Database and SanDisk Offer Cost-Effective Extreme Performance for Big Data 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table of Contents Abstract... 3 What Is Big Data?...
Optimize VDI with Server-Side Storage Acceleration
WHITE PAPER Optimize VDI with Server-Side Storage Acceleration Eliminate Storage Bottlenecks for Fast, Reliable Virtual Desktop Performance 1 Virtual Desktop Infrastructures (VDI) give users easy access
Navisphere Quality of Service Manager (NQM) Applied Technology
Applied Technology Abstract Navisphere Quality of Service Manager provides quality-of-service capabilities for CLARiiON storage systems. This white paper discusses the architecture of NQM and methods for
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a
Masters Project Proposal
Masters Project Proposal Virtual Machine Storage Performance Using SR-IOV by Michael J. Kopps Committee Members and Signatures Approved By Date Advisor: Dr. Jia Rao Committee Member: Dr. Xiabo Zhou Committee
NetApp FAS Hybrid Array Flash Efficiency. Silverton Consulting, Inc. StorInt Briefing
NetApp FAS Hybrid Array Flash Efficiency Silverton Consulting, Inc. StorInt Briefing PAGE 2 OF 7 Introduction Hybrid storage arrays (storage systems with both disk and flash capacity) have become commonplace
Amazon Cloud Storage Options
Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object
Energy aware RAID Configuration for Large Storage Systems
Energy aware RAID Configuration for Large Storage Systems Norifumi Nishikawa [email protected] Miyuki Nakano [email protected] Masaru Kitsuregawa [email protected] Abstract
Comparing major cloud-service providers: virtual processor performance. A Cloud Report by Danny Gee, and Kenny Li
Comparing major cloud-service providers: virtual processor performance A Cloud Report by Danny Gee, and Kenny Li Comparing major cloud-service providers: virtual processor performance 09/03/2014 Table
Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array
Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array Evaluation report prepared under contract with Lenovo Executive Summary Love it or hate it, businesses rely on email. It
Benchmarking Hadoop & HBase on Violin
Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages
PARALLELS CLOUD SERVER
PARALLELS CLOUD SERVER An Introduction to Operating System Virtualization and Parallels Cloud Server 1 Table of Contents Introduction... 3 Hardware Virtualization... 3 Operating System Virtualization...
An Oracle White Paper November 2010. Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager
An Oracle White Paper November 2010 Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager Introduction...2 Oracle Backup and Recovery Solution Overview...3 Oracle Recovery
White Paper. Educational. Measuring Storage Performance
TABLE OF CONTENTS Introduction....... Storage Performance Metrics.... Factors Affecting Storage Performance....... Provisioning IOPS in Hardware-Defined Solutions....... Provisioning IOPS in Software-Defined
Server Virtualization: Avoiding the I/O Trap
Server Virtualization: Avoiding the I/O Trap How flash memory arrays and NFS caching helps balance increasing I/O loads of virtualized servers November 2010 2 Introduction Many companies see dramatic improvements
The Methodology Behind the Dell SQL Server Advisor Tool
The Methodology Behind the Dell SQL Server Advisor Tool Database Solutions Engineering By Phani MV Dell Product Group October 2009 Executive Summary The Dell SQL Server Advisor is intended to perform capacity
89 Fifth Avenue, 7th Floor. New York, NY 10003. www.theedison.com 212.367.7400. White Paper. HP 3PAR Adaptive Flash Cache: A Competitive Comparison
89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com 212.367.7400 White Paper HP 3PAR Adaptive Flash Cache: A Competitive Comparison Printed in the United States of America Copyright 2014 Edison
Directions for VMware Ready Testing for Application Software
Directions for VMware Ready Testing for Application Software Introduction To be awarded the VMware ready logo for your product requires a modest amount of engineering work, assuming that the pre-requisites
Part 1: Price Comparison Among The 10 Top Iaas Providers
Part 1: Price Comparison Among The 10 Top Iaas Providers Table of Contents Executive Summary 3 Estimating Cloud Spending 3 About the Pricing Report 3 Key Findings 3 The IaaS Providers 3 Provider Characteristics
Cloud Server. Parallels. An Introduction to Operating System Virtualization and Parallels Cloud Server. White Paper. www.parallels.
Parallels Cloud Server White Paper An Introduction to Operating System Virtualization and Parallels Cloud Server www.parallels.com Table of Contents Introduction... 3 Hardware Virtualization... 3 Operating
PostgreSQL Performance Characteristics on Joyent and Amazon EC2
OVERVIEW In today's big data world, high performance databases are not only required but are a major part of any critical business function. With the advent of mobile devices, users are consuming data
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
Best Practices for Deploying Citrix XenDesktop on NexentaStor Open Storage
Best Practices for Deploying Citrix XenDesktop on NexentaStor Open Storage White Paper July, 2011 Deploying Citrix XenDesktop on NexentaStor Open Storage Table of Contents The Challenges of VDI Storage
Evaluating HDFS I/O Performance on Virtualized Systems
Evaluating HDFS I/O Performance on Virtualized Systems Xin Tang [email protected] University of Wisconsin-Madison Department of Computer Sciences Abstract Hadoop as a Service (HaaS) has received increasing
Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts
Part V Applications Cloud Computing: General concepts Copyright K.Goseva 2010 CS 736 Software Performance Engineering Slide 1 What is cloud computing? SaaS: Software as a Service Cloud: Datacenters hardware
Cloud Computing Trends
UT DALLAS Erik Jonsson School of Engineering & Computer Science Cloud Computing Trends What is cloud computing? Cloud computing refers to the apps and services delivered over the internet. Software delivered
How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
High Performance SQL Server with Storage Center 6.4 All Flash Array
High Performance SQL Server with Storage Center 6.4 All Flash Array Dell Storage November 2013 A Dell Compellent Technical White Paper Revisions Date November 2013 Description Initial release THIS WHITE
Understanding Data Locality in VMware Virtual SAN
Understanding Data Locality in VMware Virtual SAN July 2014 Edition T E C H N I C A L M A R K E T I N G D O C U M E N T A T I O N Table of Contents Introduction... 2 Virtual SAN Design Goals... 3 Data
An Oracle White Paper November 2010. Oracle Real Application Clusters One Node: The Always On Single-Instance Database
An Oracle White Paper November 2010 Oracle Real Application Clusters One Node: The Always On Single-Instance Database Executive Summary... 1 Oracle Real Application Clusters One Node Overview... 1 Always
White paper: Unlocking the potential of load testing to maximise ROI and reduce risk.
White paper: Unlocking the potential of load testing to maximise ROI and reduce risk. Executive Summary Load testing can be used in a range of business scenarios to deliver numerous benefits. At its core,
How AWS Pricing Works
How AWS Pricing Works (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 15 Table of Contents Table of Contents... 2 Abstract... 3 Introduction... 3 Fundamental
Rackspace Cloud Databases and Container-based Virtualization
Rackspace Cloud Databases and Container-based Virtualization August 2012 J.R. Arredondo @jrarredondo Page 1 of 6 INTRODUCTION When Rackspace set out to build the Cloud Databases product, we asked many
Windows Server Performance Monitoring
Spot server problems before they are noticed The system s really slow today! How often have you heard that? Finding the solution isn t so easy. The obvious questions to ask are why is it running slowly
Technology Insight Series
Evaluating Storage Technologies for Virtual Server Environments Russ Fellows June, 2010 Technology Insight Series Evaluator Group Copyright 2010 Evaluator Group, Inc. All rights reserved Executive Summary
Cloud Based Application Architectures using Smart Computing
Cloud Based Application Architectures using Smart Computing How to Use this Guide Joyent Smart Technology represents a sophisticated evolution in cloud computing infrastructure. Most cloud computing products
MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products
MaxDeploy Ready Hyper- Converged Virtualization Solution With SanDisk Fusion iomemory products MaxDeploy Ready products are configured and tested for support with Maxta software- defined storage and with
TPC-W * : Benchmarking An Ecommerce Solution By Wayne D. Smith, Intel Corporation Revision 1.2
TPC-W * : Benchmarking An Ecommerce Solution By Wayne D. Smith, Intel Corporation Revision 1.2 1 INTRODUCTION How does one determine server performance and price/performance for an Internet commerce, Ecommerce,
Total Cost of Solid State Storage Ownership
Total Cost of Solid State Storage Ownership An In-Depth Analysis of Many Important TCO Factors SSSI Members and Authors: Esther Spanjer, SMART Modular Technologies Dan Le, SMART Modular Technologies David
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
White Paper. Cloud Native Advantage: Multi-Tenant, Shared Container PaaS. http://wso2.com Version 1.1 (June 19, 2012)
Cloud Native Advantage: Multi-Tenant, Shared Container PaaS Version 1.1 (June 19, 2012) Table of Contents PaaS Container Partitioning Strategies... 03 Container Tenancy... 04 Multi-tenant Shared Container...
Reallocation and Allocation of Virtual Machines in Cloud Computing Manan D. Shah a, *, Harshad B. Prajapati b
Proceedings of International Conference on Emerging Research in Computing, Information, Communication and Applications (ERCICA-14) Reallocation and Allocation of Virtual Machines in Cloud Computing Manan
Relocating Windows Server 2003 Workloads
Relocating Windows Server 2003 Workloads An Opportunity to Optimize From Complex Change to an Opportunity to Optimize There is much you need to know before you upgrade to a new server platform, and time
A Close Look at PCI Express SSDs. Shirish Jamthe Director of System Engineering Virident Systems, Inc. August 2011
A Close Look at PCI Express SSDs Shirish Jamthe Director of System Engineering Virident Systems, Inc. August 2011 Macro Datacenter Trends Key driver: Information Processing Data Footprint (PB) CAGR: 100%
Performance Analysis: Benchmarking Public Clouds
Performance Analysis: Benchmarking Public Clouds Performance comparison of web server and database VMs on Internap AgileCLOUD and Amazon Web Services By Cloud Spectator March 215 PERFORMANCE REPORT WEB
The Total Cost of (Non) Ownership of a NoSQL Database Cloud Service
The Total Cost of (Non) Ownership of a NoSQL Database Cloud Service Jinesh Varia and Jose Papo March 2012 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1
Using Synology SSD Technology to Enhance System Performance Synology Inc.
Using Synology SSD Technology to Enhance System Performance Synology Inc. Synology_SSD_Cache_WP_ 20140512 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges...
Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration
Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration Part I of an ebook series of cloud infrastructure and platform fundamentals not to be avoided when preparing
Using Synology SSD Technology to Enhance System Performance Synology Inc.
Using Synology SSD Technology to Enhance System Performance Synology Inc. Synology_WP_ 20121112 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges... 3 SSD
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006
OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006 EXECUTIVE SUMMARY Microsoft Exchange Server is a disk-intensive application that requires high speed storage to deliver
DELL s Oracle Database Advisor
DELL s Oracle Database Advisor Underlying Methodology A Dell Technical White Paper Database Solutions Engineering By Roger Lopez Phani MV Dell Product Group January 2010 THIS WHITE PAPER IS FOR INFORMATIONAL
Energy Constrained Resource Scheduling for Cloud Environment
Energy Constrained Resource Scheduling for Cloud Environment 1 R.Selvi, 2 S.Russia, 3 V.K.Anitha 1 2 nd Year M.E.(Software Engineering), 2 Assistant Professor Department of IT KSR Institute for Engineering
Accelerate the Performance of Virtualized Databases Using PernixData FVP Software
WHITE PAPER Accelerate the Performance of Virtualized Databases Using PernixData FVP Software Increase SQL Transactions and Minimize Latency with a Flash Hypervisor 1 Virtualization saves substantial time
Amazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
Nimble Storage for VMware View VDI
BEST PRACTICES GUIDE Nimble Storage for VMware View VDI N I M B L E B E S T P R A C T I C E S G U I D E : N I M B L E S T O R A G E F O R V M W A R E V I E W V D I 1 Overview Virtualization is an important
FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency
FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency Dimitris Krekoukias Systems Engineer NetApp Data ONTAP 8 software operating in Cluster-Mode is the industry's only unified, scale-out
