Evaluation Report: Database Acceleration with HP 3PAR StoreServ 7450 All-flash Storage



Similar documents
HP SN1000E 16 Gb Fibre Channel HBA Evaluation

Evaluation Report: HP Blade Server and HP MSA 16GFC Storage Evaluation

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Evaluation Report: Supporting Multiple Workloads with the Lenovo S3200 Storage Array

Analysis of VDI Storage Performance During Bootstorm

HP Smart Array Controllers and basic RAID performance factors

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Violin Memory 7300 Flash Storage Platform Supports Multiple Primary Storage Workloads

Accelerating Server Storage Performance on Lenovo ThinkServer

Optimizing SQL Server Storage Performance with the PowerEdge R720

QLogic 16Gb Gen 5 Fibre Channel for Database and Business Analytics

Lab Validation Report

Intel RAID SSD Cache Controller RCS25ZB040

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

EMC XTREMIO EXECUTIVE OVERVIEW

Using Synology SSD Technology to Enhance System Performance Synology Inc.

The 8Gb Fibre Channel Adapter of Choice in Oracle Environments

Microsoft SQL Server 2014 Fast Track

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Benchmarking Cassandra on Violin

Comparison of Hybrid Flash Storage System Performance

HP and SanDisk Partner for the HP 3PAR StoreServ 7450 All-flash Array

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

Sun 8Gb/s Fibre Channel HBA Performance Advantages for Oracle Database

Accelerating Microsoft Exchange Servers with I/O Caching

Leveraging EMC Fully Automated Storage Tiering (FAST) and FAST Cache for SQL Server Enterprise Deployments

Measuring Interface Latencies for SAS, Fibre Channel and iscsi

Deploying Affordable, High Performance Hybrid Flash Storage for Clustered SQL Server

3PAR Fast RAID: High Performance Without Compromise

High Performance Oracle RAC Clusters A study of SSD SAN storage A Datapipe White Paper

Virtuoso and Database Scalability

Flash-optimized Data Progression

Maximum performance, minimal risk for data warehousing

IBM FlashSystem storage

Performance Beyond PCI Express: Moving Storage to The Memory Bus A Technical Whitepaper

Data Center Storage Solutions

Flash Storage Gets Priority with Emulex ExpressLane

Cloud Optimized Performance: I/O-Intensive Workloads Using Flash-Based Storage

HP 3PAR StoreServ 8000 Storage - what s new

High Performance SQL Server with Storage Center 6.4 All Flash Array

FUSION iocontrol HYBRID STORAGE ARCHITECTURE 1

HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering

Increase Database Performance by Implementing Cirrus Data Solutions DCS SAN Caching Appliance With the Seagate Nytro Flash Accelerator Card

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

Windows 8 SMB 2.2 File Sharing Performance

Software-defined Storage at the Speed of Flash

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

MS Exchange Server Acceleration

Achieving a High Performance OLTP Database using SQL Server and Dell PowerEdge R720 with Internal PCIe SSD Storage

HPe in Datacenter HPe3PAR Flash Technologies

Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55%

SPC BENCHMARK 1 EXECUTIVE SUMMARY 3PAR INC. 3PARINSERV T800 STORAGE SERVER SPC-1 V1.10.1

Condusiv s V-locity 4 Boosts Virtual Machine Performance Over 50% Without Additional Hardware

The MAX5 Advantage: Clients Benefit running Microsoft SQL Server Data Warehouse (Workloads) on IBM BladeCenter HX5 with IBM MAX5.

SAN Conceptual and Design Basics

Amadeus SAS Specialists Prove Fusion iomemory a Superior Analysis Accelerator

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

Performance Brief: MegaRAID SAS 9265/9285 Series

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

Why Use 16Gb Fibre Channel with Windows Server 2012 Deployments

Virtualization of the MS Exchange Server Environment

How To Connect Virtual Fibre Channel To A Virtual Box On A Hyperv Virtual Machine

Lab Validation Report

Cloud-Optimized Performance: Enhancing Desktop Virtualization Performance with Brocade 16 Gbps

HP 3PAR storage technologies for desktop virtualization

How To Evaluate Netapp Ethernet Storage System For A Test Drive

All-Flash Arrays Weren t Built for Dynamic Environments. Here s Why... This whitepaper is based on content originally posted at

The Data Placement Challenge

Oracle Database Deployments with EMC CLARiiON AX4 Storage Systems

Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

REFERENCE ARCHITECTURE. PernixData FVP Software and Splunk Enterprise

Benchmarking Hadoop & HBase on Violin

Dell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820

SMB Direct for SQL Server and Private Cloud

SLIDE 1 Previous Next Exit

8Gb Fibre Channel Adapter of Choice in Microsoft Hyper-V Environments

LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment

Performance and scalability of a large OLTP workload

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Transcription:

Evaluation Report: Database Acceleration with HP 3PAR StoreServ 7450 All-flash Storage Evaluation report prepared under contract with HP Executive Summary Solid state storage is transforming the entire computing industry and becoming the de facto standard for high-performance storage systems. Increasing demands of database workloads, growing numbers of virtual machines and more powerful processors are driving demand for ever-faster storage systems, especially those that can deliver that performance with significantly lower latency than in the past. The HP 3PAR StoreServ 7450 All-flash storage system provides the high performance and low latency that meet today s datacenter needs. HP commissioned Demartek to run database workloads that show the performance of the HP 3PAR StoreServ 7450 All-flash Storage system. Not only did the 3PAR StoreServ 7450 show excellent low latency and fast performance for a typical OLTP database workload, Demartek also found that the 3PAR StoreServ 7450 was not very heavily taxed with a single OLTP database accessing the array. As a result, we proceeded to run a combination of database workloads including two online transaction processing (OLTP) workloads and a data warehouse workload to see how well this storage system would handle a fairly heavy, mixed workload. We deployed a mixture of 16Gb and 8Gb Fibre Channel infrastructure in this series of tests. Key Findings For a combination of OLTP and data warehousing workloads from three different host servers, we found that the HP 3PAR StoreServ 7450 All-flash storage system easily handled a peak of 300,000 IOPs, sustained nearly 100,000 IOPS, and achieved 4000 MBPS while maintaining low average latency while all three workloads were running. These combined workloads, as viewed by the storage system, were 93% read and 7% write. The average read block size was 36.4 KB and the average write block size was 8.8 KB. In our opinion, the HP 3PAR StoreServ 7450 All-flash storage system works well for database workloads because it provides low latency in addition to higher raw performance.

The Need for All-flash Storage Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 2 of 16 Today s database applications face a variety of challenges brought on by seemingly insatiable demands on server and storage infrastructure. At the same time, new technologies are being introduced that offer possible solutions to meet these growing challenges. A number of factors contribute to the need for all-flash storage, including higher performance and lower latency demands from database applications, increasing numbers of virtualized servers and the availability of high-speed networks such as 8Gb and 16Gb Fibre Channel. All-flash storage systems provide not only higher raw performance in terms of IOPS and bandwidth, but have the advantage of operating with lower latency. In many cases, especially OLTP environments, low latency is becoming more important than the IOPS and bandwidth performance.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 3 of 16 HP 3PAR StoreServ 7450 All-flash Storage Architecture The HP 3PAR StoreServ 7450 All-flash storage system provides high-speed performance with low latency for mission-critical applications such as database applications. It provides high-performance and the same enterprise-grade levels of resiliency (up to 4 nodes), rich Tier 1 data services, efficiency, data mobility and disaster tolerance that are available on other HP 3PAR StoreServ platforms. It does this with a flash-optimized architecture that reduces the performance bottlenecks that can choke general-purpose disk arrays that have been retrofitted by stuffing them with solid-state disks (SSDs), and includes enterprise-grade features that are not always available with some of the newer all-flash offerings from others. HP 3PAR StoreServ 7450 storage system has a number of design optimizations to leverage performance from flash media while keeping the costs down. Some of them are described below briefly: Adaptive Read and Write Adaptive Read and Write is a software innovation that enables a more granular approach than with spinning media by matching host I/O size reads and writes to flash media to avoid unnecessary data reads and writes. This significantly reduces latency and optimizes back end performance to enable more applications to be consolidated. Autonomic Cache Offload Autonomic Cache Offload is another new flash software-based optimization that eliminates cache bottlenecks by automatically changing the frequency at which data is offloaded from cache to flash media, based on utilization rate and without requiring any user intervention. This ensures consistently high performance levels as you scale workloads to hundreds of thousands of IOPS. Another important aspect of the cache offload algorithm is the decision around which cache data should be flushed to the back end, and which should not be. HP 3PAR StoreServ Storage keeps track of read cache hits and keeps hot data in cache itself, thereby lowering latencies of frequently accessed data. In addition, for handling flash, flusher threads have been added to 3PAR cache management so it can perform more operations in parallel.

Multi-tenant I/O Processing Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 4 of 16 New multi-tenant I/O processing innovations enable performance optimization for mixed workloads and VDI deployments by breaking large I/O into 32 KB sub-i/o blocks. This prevents small read I/O chunks from getting held up behind large I/O requests, therefore assuring reduced latency. Breaking large sequential read I/O into sub-i/o chunks distributes these reads and lowers the possibility of smaller transactional read I/O operations of get held up because of a previous large read I/O operation, therefore ensuring consistently low latency for transactional I/O, even in mixed workload scenarios. With the multiple workloads (OLTP and Decision Support) running on 7450 in this evaluation, enhancements like multi-tenant I/O processing help 7450 to deliver consistently good latency for the OLTP application while deliver good bandwidth for the Decision Support application. Specialized ASIC for Mixed Workload Support and Zero Detection In the HP 3PAR StoreServ Architecture, transactional and throughput-intensive workloads are able to run on the same storage resources without contention because of the HP 3PAR StoreServ Gen4 ASIC, which offloads some of the work of the controllers, allowing them to process smaller I/O without inordinate delay due to large block I/O. This is important, because precious CPU cycles are not wasted for data movement; instead the CPU cycles are available for delivering advanced Tier 1 data services. The HP 3PAR StoreServ ASIC features an efficient, silicon-based zero detection mechanism. This hardware feature removes allocated but unused space without impacting performance. Every block of flash storage that gets reclaimed immediately becomes available for other applications requiring space. This ensures that existing flash storage is utilized efficiently, possibly delaying the purchase of additional capacity. In addition, with zero detection built into the ASIC, a stream of zeroes that may be present in a write I/O can be eliminated before being written to flash. In the world of flash, where there is a penalty for every single write (in terms of media endurance), this write elimination helps elongate the life of flash-based media. System-wide Striping Data and I/O for each volume are striped widely across all system resources, including CPUs, ports, cache, and drives. This wide striping enables the system to deliver

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 5 of 16 accelerated performance levels (with all resources supporting each volume) while avoiding any single point of contention. Even a small volume can leverage the performance of hundreds of flash devices and all the system s controller nodes for optimal performance. Because the system autonomically manages this system-wide load balancing, no extra time or complexity is required to create or maintain an optimally configured system. With system-wide striping, data is distributed across all SSDs in a granular fashion, keeping the command queues low on any individual SSD. System-wide Sparing The HP 3PAR StoreServ Architecture reserves spare chunklets in all flash media. In contrast to traditional architectures that enforce the need to reserve dedicated spares that sit idle, the HP 3PAR StoreServ Architecture uses every single flash device, reserving spare chunklets in each of them. This ensures a balanced load and wearing across all flash media. Should there be a media failure, system-wide sparing also helps protect against performance degradation by enabling many-to-many rebuild, resulting in faster rebuilds. Moreover, with Adaptive Sparing, 3PAR StoreServ is able to release the spare chunklets present in each SSD media to the SSD itself, allowing SSDs to use that spare capacity as additional over-provisioned capacity. This increase in over-provisioned capacity enhances endurance of SSDs within a 3PAR StoreServ array Media Wear Gauge The HP 3PAR Operating System monitors the wear of each flash device and allows users to see the wear level of each device at any given time. This keeps users informed as to the amount of media wear taking place and helps them replace SSDs in a planned fashion.

Data Warehouse Workload Descriptions Database Workloads Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 6 of 16 In order to show the performance of the storage system under a mixed workload, Demartek ran a combination of online transaction processing and data warehouse workloads with the configuration shown below. For these tests, we ran the TPC Benchmark E (TPC-E) and TPC Benchmark H (TPC-H), which were used to generate real-world workloads for measuring the performance of the storage system, but not for the purpose of publishing official benchmark results. TPC-E is a variation of an On-Line Transaction Processing (OLTP) workload that models a financial brokerage firm with customers who generate transactions related to trades, account inquiries, and market research. The brokerage firm in turn interacts with financial markets to execute orders on behalf of the customers and updates relevant account information. This workload consists of a mixture of mostly reads and some writes to its database. TPC-H is an industry-standard, decision-support benchmark that simulates broad business intelligence database environments most relevant to information systems that provide organizations with answers to critical business analytics. This workload consists of almost all reads. Real vs. Synthetic Workloads The workloads employed in this test used a real database (Microsoft SQL Server) with database tables, indexes, etc., performing actual database transactions. When using real database workloads, I/O rate will vary as the workload progresses because the database performs operations that consume varying amounts of CPU and memory resources in addition to I/O resources. These results more closely resemble a real customer environment. These are unlike benchmarks that use synthetic workloads that perform the same I/O operations repeatedly, resulting in relatively steady I/O rates which, although potentially faster, do not resemble real customer environments.

Performance Results Host Perspective Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 7 of 16 When measuring performance of storage systems, three key metrics are I/Os per second (IOPS), bandwidth and latency. Online transaction applications tend to generate higher numbers of IOPS with lower bandwidth. Data warehousing workloads tend to generate higher bandwidth but lower IOPS. Latency is important for both types of workloads. Definitions IOPS I/O s per second a measure of the total I/O operations (reads and writes) issued by the application servers. Bandwidth a measure of the data transfer rate, or I/O throughput, measured in bytes per second or MegaBytes per second (MBPS). Latency a measure of the time taken to complete an I/O request, also known as response time. This is frequently measured in milliseconds (one thousandth of a second). Workload Combinations The data shown below is divided into two groups: Two OLTP workloads Two OLTP workloads plus one data warehousing workload For all the charts shown below, the performance data is for the database volumes only, not the log volumes.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 8 of 16 Two OLTP Workloads OLTP workloads are transaction-oriented workloads, and one of the key metrics for these workloads is latency. The overall average latency for the two OLTP workloads shown below is 0.61 milliseconds. This measurement was taken without any data warehousing workloads running and illustrated good, consistent low latency for OLTP only workloads..

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 9 of 16 Two OLTP Workloads and One Data Warehousing Workload Performance data was collected by the host application servers and by the storage system. In this section, we provide both perspectives of the data side-by-side. Although similar, the data shown is different, primarily because the storage system averages the data over five-minute intervals. This five-minute interval data collection has the effect of smoothing out spikes in the graph. The OLTP workloads began to finish and decrease their load on the storage system at approximately 3 hours (10800 seconds). Server-side Observations As expected, the OLTP workloads issued a higher number of IOPS than the data warehousing workload, and the data warehousing workload achieved higher bandwidth (throughput) than the OLTP workloads. Storage-side Observations On average, the reads were 93% of the combined IOPS, and were 98% of the combined bandwidth.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 10 of 16 Data warehousing workloads, because of their large queries and table scans, tend to have much higher latencies that tend to be bursty. OLTP workloads tend to have shorter latencies because of the transactional nature of their operations. Latency is supremely important parameter for OLTP applications and often requires an entire storage system to be dedicated to run OLTP applications. The data warehousing applications are much more bandwidth hungry. Server-side Observations As expected, data warehousing workload had the effect of increasing the overall latency for the combined workloads. Storage-side Observations The latency for writes was fairly consistent. The overall average latency for the combined OLTP and data warehousing workloads was 1.55 milliseconds. HP 3PAR StoreServ 7450 is able to handle a multi-tenant environment, where response time of the OLTP application remains good even with a bursty data warehousing application running on the same system.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 11 of 16 One final measure provided by the storage system is the average I/O block sized for reads and writes. As the OLTP workloads were winding down at about 3 hours into the run, the data warehousing workload increased its activity, noted by the larger block sizes.

Gen 5 (16Gb) Fibre Channel Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 12 of 16 Generation 5 (16 Gbps) Fibre Channel provides an increased ceiling in terms of the possible performance for each connection, doubling the throughput of the previous generation (8 Gbps). Database administrators often tune their applications to take advantage of every possible speed benefit, and Gen 5 Fibre Channel connections provide an important performance improvement opportunity. Other benefits include a reduced number of links needed to achieve the same bandwidth, reduced power consumption needed to achieve the same bandwidth and fewer cables to manage. In addition, the higher speed allows fabrics to be connected with fewer inter-switch links (ISLs), which is especially helpful in large fabrics. Gen 5 Fibre Channel includes retimers in the optical modules and transmitter training, features that improve link performance characteristics, electronic dispersion compensation and backplane links. Because of Fibre Channel s backward compatibility, host servers can deploy 16Gb FC HBAs and 16Gb FC switches independently, as each work with the two previous generations (8Gb and 4Gb). Table 1 Fibre Channel Speed Characteristics Speed Throughput (Mbps) Line Rate (Gbps) Encoding Retimers in the module Transmitter training 1 GFC 100 1.0625 8b/10b No No 2 GFC 200 2.125 8b/10b No No 4 GFC 400 4.25 8b/10b No No 8 GFC 800 8.5 8b/10b No No 16 GFC 1600 14.025 64b/66b Yes Yes Table 2 Fiber Optic Cable Link Distance Speed Multi-Mode Single-Mode OM1 OM2 OM3 OM4 OS1 1 GFC 300 500 860 * 10,000 2 GFC 150 300 500 * 10,000 4 GFC 50 150 380 400 10,000 8 GFC 21 50 150 190 10,000 16 GFC 15 35 100 125 10,000 * The link distance for OM4 fiber optic cable has not been defined for these speeds.

Summary and Conclusion Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 13 of 16 We believe that the HP 3PAR StoreServ 7450 All-flash storage system provides excellent performance for mission critical workloads. This storage system is well-suited to handle mixed workloads thereby providing a fast ROI to customers investing in an all-flash storage system. In our opinion, all-flash arrays do well with database workloads because they provide low latency in addition to higher raw performance.

Appendix Test Description and Environment Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 14 of 16 Database workloads were run on the three servers combined to provide an intensive mixture of random and sequential I/Os on the storage system, stressing the storage system. This set of tests was designed to be representative of a typical mission-critical environment with mixed database workloads. These tests were run with the Gen 5 (16Gb) infrastructure running at 16 Gb/sec including the Fibre Channel HBAs in the servers and the 16Gb Fibre Channel switch. The HP 3PAR StoreServ 7450 storage system used 4x 8Gb Fibre Channel host ports.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 15 of 16 Server #1 HP ProLiant DL380 G7 2x Intel Xeon X5690, 3.47GHz, 12 total cores, 24 logical processors 28GB RAM HP SN1000E 16Gb FC HBA (Emulex LPe16002) Microsoft Windows Server 2012 Microsoft SQL Server 2012, Microsoft TPC-E benchmark kit Server #2 HP ProLiant DL380 Gen8 2x Intel Xeon E5-2609, 2.4GHz, 8 total cores, 24 logical processors 64GB RAM HP SN1000E 16Gb FC HBA (Emulex LPe16002) Microsoft Windows Server 2012 Microsoft SQL Server 2012, Microsoft TPC-E benchmark kit Server #3 HP ProLiant DL980 G7 8x Intel Xeon E7-2870, 2.4GHz, 80 total cores, 160 logical processors 320GB RAM HP SN1000E 16Gb FC HBA (Emulex LPe16002) Microsoft Windows Server 2012 Microsoft SQL Server 2012, HammerDB 2.14 and TPC-H benchmark kit Fibre Channel Switch HP StoreFabric SN6000B Fibre Channel Switch Storage Array HP 3PAR StoreServ 7450 Storage 3PAR Operating System 312 MU2 48x 400GB 6Gb SAS SSD RAID5 configuration for database volumes RAID10 configuration for log volumes 4x 8GFC host ports Note: HP 3PAR StoreServ 7000 and StoreServ 10000 Storage arrays currently use Emulex quad-port 8GFC HBAs, configured in target mode.

Database Acceleration with HP 3PAR StoreServ 7450 Storage Page 16 of 16 The original version of this document is available at: http://www.demartek.com/demartek_hp_3par_storeserv_flashstorage_2014-05.html on the Demartek website. Intel and Xeon are registered trademarks of Intel Corporation. Microsoft, Windows, and Windows Server are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries. TPC Benchmark and TPC-H are trademarks of the Transaction Performance Council. Demartek is a trademark of Demartek, LLC. All other trademarks are the property of their respective owners.