Understanding SAN and Storage Constraints In Private Clouds



Similar documents
ioscale: The Holy Grail for Hyperscale

Storage Systems Performance Testing

Brocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters

Microsoft Private Cloud Fast Track

Scaling Cloud Storage. Julian Chesterfield Storage & Virtualization Architect

White Paper. Educational. Measuring Storage Performance

Flash Use Cases Traditional Infrastructure vs Hyperscale

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Building a cost-effective and high-performing public cloud. Sander Cruiming, founder Cloud Provider

Memory Channel Storage ( M C S ) Demystified. Jerome McFarland

Answering the Requirements of Flash-Based SSDs in the Virtualized Data Center

Maximizing SQL Server Virtualization Performance

Traditional v/s CONVRGD

Scaling Database Performance in Azure

Deploying Affordable, High Performance Hybrid Flash Storage for Clustered SQL Server

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Deploying F5 BIG-IP Virtual Editions in a Hyper-Converged Infrastructure

The Production Cloud

Flash Storage Gets Priority with Emulex ExpressLane

SOLUTION BRIEF. Resolving the VDI Storage Challenge

MS Exchange Server Acceleration

Virtualizing Microsoft SQL Server on Dell XC Series Web-scale Converged Appliances Powered by Nutanix Software. Dell XC Series Tech Note

OPTIMIZING SERVER VIRTUALIZATION

How To Connect Virtual Fibre Channel To A Virtual Box On A Hyperv Virtual Machine

High Performance Oracle RAC Clusters A study of SSD SAN storage A Datapipe White Paper

All-Flash Storage Solution for SAP HANA:

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Application-Focused Flash Acceleration

Microsoft Exchange Server 2003 Deployment Considerations

Transforming Data Center Economics and Performance via Flash and Server Virtualization

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Eliminating Backup System Bottlenecks: Taking Your Existing Backup System to the Next Level. Jacob Farmer, CTO, Cambridge Computer

Cloud Optimize Your IT

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

Virtualization of the MS Exchange Server Environment

NEXENTA S VDI SOLUTIONS BRAD STONE GENERAL MANAGER NEXENTA GREATERCHINA

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

WHITEPAPER. One Cloud For All Your Critical Business Applications.

DVS Enterprise. Reference Architecture. VMware Horizon View Reference

Expert Reference Series of White Papers. Visions of My Datacenter Virtualized

An Introduction - ZNetLive's Hybrid Dedicated Servers

White Paper. Fueling Successful SMB Virtualization with Smart Storage Decisions. 89 Fifth Avenue, 7th Floor. New York, NY

NET ACCESS VOICE PRIVATE CLOUD

The next step in Software-Defined Storage with Virtual SAN

CA Cloud Overview Benefits of the Hyper-V Cloud

ZADARA STORAGE. Managed, hybrid storage EXECUTIVE SUMMARY. Research Brief

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

How To Get A Storage And Data Protection Solution For Virtualization

Deep Dive: Maximizing EC2 & EBS Performance

Cloud Optimize Your IT

Is Cloud Right For Your Company? Prepared for MTUG 2014 L. Mark Stone

Enterprise Cloud Services HOSTED PRIVATE CLOUD

my forecasted needs. The constraint of asymmetrical processing was offset two ways. The first was by configuring the SAN and all hosts to utilize

New Features in SANsymphony -V10 PSP1 Software-defined Storage Platform

Dell Converged Infrastructure

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

Technology Insight Series

NexentaStor Enterprise Backend for CLOUD. Marek Lubinski Marek Lubinski Sr VMware/Storage Engineer, LeaseWeb B.V.

DELL s Oracle Database Advisor

All-Flash Arrays Weren t Built for Dynamic Environments. Here s Why... This whitepaper is based on content originally posted at

SLIDE 1 Previous Next Exit

Automated Data-Aware Tiering

Nutanix Tech Note. Configuration Best Practices for Nutanix Storage with VMware vsphere

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

Navisphere Quality of Service Manager (NQM) Applied Technology

Parallels Cloud Storage

Business white paper Invest in the right flash storage solution

FLASH STORAGE SOLUTION

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

How To Store Data On A Server Or Hard Drive (For A Cloud)

Introducing the Dynamic and Scalable Data Center

RFP-MM Enterprise Storage Addendum 1

Zadara Storage Cloud A

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

Amazon Cloud Storage Options

Benchmarking Hadoop & HBase on Violin

Microsoft Cloud Platform System. powered by Dell

Windows 8 SMB 2.2 File Sharing Performance

Server and Storage Virtualization with IP Storage. David Dale, NetApp

Building Storage Service in a Private Cloud

Drupal Performance Tips and Tricks. Khalid Baheyeldin. Drupal Camp Toronto 2014

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

How SSDs Fit in Different Data Center Applications

ERP performance and security considerations for cloud and on-premises deployments. White Paper

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Big data Devices Apps

Windows Server 2008 R2 Hyper-V Live Migration

Selenium Grid Build vs. Buy

Copyright 2015 Splunk Inc. Go Big or Go Home. Sean Delaney Specialist SE Mustafa Ahamed Director, Product Management

Scaling from Datacenter to Client

Accelerating Database Applications on Linux Servers

The Data Placement Challenge

Enhancements of ETERNUS DX / SF

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

Analysis of VDI Storage Performance During Bootstorm

7 Real Benefits of a Virtual Infrastructure

Transcription:

Understanding SAN and Storage Constraints In Private Clouds L. Mark Stone, Founder and CIO (207) 772-5678 mark.stone@reliablenetworks.com

Agenda Non-technical Executive Summary Choke Points (there are many) Technical Level Setting Key Challenges Robust I/O = Fast, Wide and Consistent Use Cases: MSSQL Transaction Logs, Tuned MySQL server, and LAMP-stack IVR System Evaluating Cloud Providers: Due Diligence Checklists and SLAs Q&A

Introduction L. Mark Stone Founder and CIO of Reliable Networks CIO, NYC-based corporate trading company 19 offices in 16 countries Directed $1.2B in procurement spend Responsible for global infrastructure and $5.0 million software development Technology and Media M&A, 13 years M.Sc. Econ., London School of Economics B.S. Finance, State U. of NY @ Albany DJ and Chief Engineer 91FM WCDB

Executive Summary Most Public Cloud workloads to date do not require Robust I/O. Backups and Disaster Recovery Web Apps File Sharing Development and Testing

Executive Summary Most (public portions of) Hybrid Clouds to date do not require Robust I/O either: Production database servers stay on premises Scalable web front-end expands to Cloud Replica/Reporting database servers moving to Cloud

Executive Summary Many traditional Private Cloud workloads do require Robust I/O : Electronic Health Record systems Busy email systems Loaded LAMP stack systems Heavy database usage Heavy disk usage (e.g. map rendering)

Executive Summary Most Cloud providers to date have: Offered lower-cost storage solutions to meet the needs of typical web workloads Developed infrastructure architectures where storage I/O was not expected to be a bottleneck Favored investments in high-performance CPU cycles and RAM over storage

Consequently: Executive Summary High I/O applications work occasionally-tointermittently well in public clouds Cloud providers starting to invest in Robust I/O hardware Hybrid Clouds poised to take off, but: Storage QoS solutions primitive at best, so careful technical due diligence and performance-oriented SLAs suggested Know your own application s attributes (IOPs, read:write ratio and average I/O size most critical)

Key Challenges ACID/near-ACID-compliant Legacy (database-driven) applications generally require Robust I/O Atomicity, Consistency, Isolation, Durability Apps coded to BASE standards (i.e. most Mobile apps) generally don t require Robust I/O Basically Available, Soft state, Eventual consistency First-Gen Public Clouds unsuitable for many busy Legacy systems:

Key Challenges Choke Points Poor Random Read:Write Performance Storage backends used low cost SATA-backed SANs or midline DASD Too many VMs per compute node Linux top shows high %wa and %st Windows queue depth high Too many compute nodes per storage frame Only so many IOPs available Only so much compute node-to-storage frame bandwidth available

Key Challenges Choke Points Piggish compute node neighbors Your application gets starved (throttled) for CPU, I/O and storage bandwidth resources Distributed file systems/databases create additional choke points Results: Inconsistent performance Benchmark results non-repeatable Consequently, Legacy apps stay in-house

Wake Up Call Your own Private Cloud will face the same challenges as First-Gen Public Cloud providers Your Hybrid Cloud will perform only as well as the weakest link in the chain Consistent high performance from Legacy apps requires Robust I/O

Robust I/O Defined What is Robust I/O? Includes all I/O-related infrastructure upon exiting the compute nodes: Cabling, switches, routers, spinning disks, SSDs, disk controllers, SANs, network storage frames, etc. Must be as good or better than premisesbased DASD, which means

Robust I/O Defined Speed. Sustained and random I/O at the disk subsystem level must be fast enough to support the intended workloads. Bandwidth. Traffic channels between and within the compute and storage nodes must be wide enough to support the intended data traffic flows. Repeatability. Performance must be consistent whether the workloads are conducted at 2:00am or 2:00pm.

IOPs How Best To Measure? Input-Output Instructions per Second Domain Controller = ~50 SMB Email Server = ~150 Busy Email MTA Server = ~500 Map Rendering Server = ~1,000

IOPs Quantified How many IOPs per disk? 15K SAS Enterprise DASD = ~180 IOPs 7.2K SATA Midline DASD = ~80 IOPs SAN-Grade SSD = ~3,000 to ~6,000 IOPs Does not include RAID Penalties Q: How many Read IOPs from a 24 x 15K SAS RAID 10 array? (Ignore write penalty for the moment) A: (24 / 2) *180 = 4,320 IOPs (maybe)

Why Maybe? Compute Node-to-Storage Frame Bandwidth can get in the way of consuming all of your SAN s IOPs: Need to know your application s average I/O size to see if IOPs and/or bandwidth will be the choke point(s)

Example One: Do The Math MSSQL Server transaction log LUN Writes typically 4K each 1,000 IOPs = 0.31 Gbps of compute nodeto-storage frame bandwidth required (Note 2x RAID 10 write penalty) No problem to host this on mid-tier storage frames.

Example Two: Do The Math MySQL database averaging 32K I/O 500 IOPs consumed; presume 100% reads 1.25 Gbps of compute node-to-storage frame bandwidth required for just one server in the farm Key Takeway: One-half the IOPs consumes four times the bandwidth

Example Three: Do The Math Integrated LAMP IVR system: About 50:50 read:write Inbound/outbound Email Third-party IVR hosting Peak Load = ~5,000 calls per hour ~1,000 IOPs and ~1.0 Gbps compute node-to-storage bandwidth consumed

Technical Level Setting IOPs at the application level is always less than IOPs needed at the storage level unless the system is 100% read Storage IOPs needed = (App IOPs x % read) + ((App IOPs x %write) x RAID penalty) RAID Penalties (for writes only) = RAID1 = 2; RAID6 = 6

Technical Level Setting Typical I/O Sizes It Depends SANs we access are reporting averages of 10K to 50K Bottom Line 1: Storage IOPs consumed are typically a multiple of IOPs produced at the application layer Bottom Line 2: No set rule of thumb correlating IOPs and bandwidth

What To Do? Step One: Know Your Application Benchmark IOPs, read:write percentage and average I/O size Step Two: Provider Due Diligence Eliminate those providers who have too Robust I/O and those who have not Robust-enough I/O As soon as you start talking tech, your provider s salesperson will refer you to a knowledgeable engineer

What To Do? Provider Due Diligence Continued Ask about the choke points Ask about the architecture Redundancy and resiliency One provider uses DASD storage for performance; if a compute node fails, your VM will need to be restored from a backup Note: If your app is coded to BASE standards, redundancy/resiliency within a data center is effectively irrelevant (true Cloud deployment)

What To Do? Step Three: Negotiate Performance- Related SLAs over Uptime-Related SLAs No mature storage QoS solutions yet, so need to rely on close proxies at the application layer, e.g. Linux: %st as reported by top Windows: Disk queue depth

What To Do? Step Four: Avoid Square Peg-Round Hole Syndrome: There is nothing wrong with dedicated, non-virtualized servers using DASD if the workloads require it Recoding Legacy apps to BASE standards is at best a non-trivial task

Summary Conclusions Storage systems and the connectivity to them have many choke points which can make the performance of your application erratic at best Obtaining Robust I/O in a Private Cloud for Legacy applications is a challenge; know your application so you can conduct informed provider due diligence

Summary Conclusions In the absence of storage QoS, application performance proxies should be monitored Negotiate for SLAs based on performance primarily, in addition to uptime There s no shame (today) in nonvirtualized, dedicated server hosting if the workloads require it

Q&A and Contact Info L. Mark Stone, Founder and CIO http://www.reliablenetworks.com/blog/ mark.stone@reliablenetworks.com @LMStone http://www.linkedin.com/in/lmarkstone Reliable Networks 477 Congress Street, Suite 812 Portland, ME 04101 (207) 772-5678