Flash Databases: High Performance and High Availability

Similar documents
Top 10 Reasons why MySQL Experts Switch to SchoonerSQL - Solving the common problems users face with MySQL

High Availability Solutions for the MariaDB and MySQL Database


Scaling Database Performance in Azure

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

<Insert Picture Here> Oracle In-Memory Database Cache Overview

Real-time Data Replication

ScaleArc idb Solution for SQL Server Deployments

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

Cloud DBMS: An Overview. Shan-Hung Wu, NetDB CS, NTHU Spring, 2015

TOP FIVE REASONS WHY CUSTOMERS USE EMC AND VMWARE TO VIRTUALIZE ORACLE ENVIRONMENTS

ScaleArc for SQL Server

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Cloud Based Application Architectures using Smart Computing

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

MySQL Enterprise Edition Most secure, scalable MySQL Database, Online Backup, Development/Monitoring Tools, backed by Oracle Premier Lifetime Support

How to choose High Availability solutions for MySQL MySQL UC 2010 Yves Trudeau Read by Peter Zaitsev. Percona Inc MySQLPerformanceBlog.

SERVER VIRTUALIZATION IN MANUFACTURING

Overview: X5 Generation Database Machines

VirtualclientTechnology 2011 July

Copyright 2012 EMC Corporation. All rights reserved.

be architected pool of servers reliability and

Relational Databases in the Cloud

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Instant-On Enterprise

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

Tushar Joshi Turtle Networks Ltd

Microsoft SharePoint 2010 on VMware Availability and Recovery Options. Microsoft SharePoint 2010 on VMware Availability and Recovery Options

Rackspace Cloud Databases and Container-based Virtualization

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

Server Virtualization in Manufacturing

SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V

MakeMyTrip CUSTOMER SUCCESS STORY

Scalable Architecture on Amazon AWS Cloud

Eloquence Training What s new in Eloquence B.08.00

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Business Continuity with the. Concerto 7000 All Flash Array. Layers of Protection for Here, Near and Anywhere Data Availability

How To Scale Myroster With Flash Memory From Hgst On A Flash Flash Flash Memory On A Slave Server

SQL Server Virtualization

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

COMPARISON OF VMware VSHPERE HA/FT vs stratus

Veritas Storage Foundation High Availability for Windows by Symantec

High Availability Database Solutions. for PostgreSQL & Postgres Plus

Symantec Storage Foundation High Availability for Windows

Red Hat Satellite Management and automation of your Red Hat Enterprise Linux environment

Tier Architectures. Kathleen Durant CS 3200

High Availability for Database Systems in Cloud Computing Environments. Ashraf Aboulnaga University of Waterloo

OLTP Meets Bigdata, Challenges, Options, and Future Saibabu Devabhaktuni

High Availability Solutions for MySQL. Lenz Grimmer DrupalCon 2008, Szeged, Hungary

Building Private Cloud Architectures

HRG Assessment: Stratus everrun Enterprise

Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise

ORACLE DATABASE 10G ENTERPRISE EDITION

MySQL Cluster New Features. Johan Andersson MySQL Cluster Consulting johan.andersson@sun.com

Deployment Options for Microsoft Hyper-V Server

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Software Performance, Scalability, and Availability Specifications V 3.0

Welcome to Virtual Developer Day MySQL!

Fax Server Cluster Configuration

Guide to Scaling OpenLDAP

TABLE OF CONTENTS THE SHAREPOINT MVP GUIDE TO ACHIEVING HIGH AVAILABILITY FOR SHAREPOINT DATA. Introduction. Examining Third-Party Replication Models

SQL Server Virtualization 101. David Klee, Group Principal and Practice Lead. SQL PASS Virtualization VC,

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

Building a Flash Fabric

Module 14: Scalability and High Availability

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

Virtualizing Exchange

Leveraging the Cloud. September 22, Digital Government Institute Cloud-Enabled Government Conference Washington, DC

Maximum Availability Architecture

Introduction 1 Performance on Hosted Server 1. Benchmarks 2. System Requirements 7 Load Balancing 7

Architectures Haute-Dispo Joffrey MICHAÏE Consultant MySQL

Tips and Tricks for Using Oracle TimesTen In-Memory Database in the Application Tier

The Benefits of Virtualizing

Veritas InfoScale Availability

Veritas Cluster Server from Symantec

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Software-Defined Networks Powered by VellOS

MySQL Reference Architectures for Massively Scalable Web Infrastructure

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

MySQL High-Availability and Scale-Out architectures

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

MySQL CLUSTER. Scaling write operations, as well as reads, across commodity hardware. Low latency for a real-time user experience

Database Decisions: Performance, manageability and availability considerations in choosing a database

SQL Server on Azure An e2e Overview. Nosheen Syed Principal Group Program Manager Microsoft

Solution Brief Availability and Recovery Options: Microsoft Exchange Solutions on VMware

VERITAS Business Solutions. for DB2

MySQL. Leveraging. Features for Availability & Scalability ABSTRACT: By Srinivasa Krishna Mamillapalli

Online Transaction Processing in SQL Server 2008

Maxta Storage Platform Enterprise Storage Re-defined

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework

BigMemory & Hybris : Working together to improve the e-commerce customer experience

Virtualization Support - Real Backups of Virtual Environments

Database Scalability and Oracle 12c

MySQL Replication. openark.org

Transcription:

Flash Databases: High Performance and High Availability Flash Memory Summit Software Tutorial August 11,2011 Dr John R Busch Founder and CTO Schooner Information Technology JohnBusch@SchoonerInfoTechcom

Business Most Valuable Asset: Its Data Data Most important and valuable component of modern applications and websites Driving revolutionary changes in computing and the internet New opportunities for generating revenue More efficient use of current business processes and infrastructure Data access downtime or poor performance has a major cost to a business bottom line

The Mission-Critical Imperative Let me tell you the difference between Facebook and everyone else, we don't crash EVER! If our service is down for even a day, our entire reputation is irreversibly destroyed! Facebook and Google invest hundreds of millions of dollars every year on custom software and hardware infrastructure to optimize availability, performance, administration, and cost

Mission Critical Imperative Maintaining data availability and response time is critical for key classes of businesses Web 20 ecommerce High-volume websites Telecommunications IT departments and application developers seek architectures and deployments providing high service availability resilient performance scalability Meet rising service demand while controlling capital and operating expenses

Mission-Critical Database Requirements

Mission-Critical Database Goals and Metrics High Availability High Data Integrity High Performance and Scalability Simple and powerful administration Cost effective Goals Standards and Compatibility Metrics Service unavailability (minutes/year) from failures, disaster recovery, or during planned administration Probability of data loss or corruption; data consistency levels Transaction throughput, response time; performance scalability; performance stability Ease of cluster administration; fail-over automation; monitoring and optimization tools Total cost of ownership (TCO); return on investment (ROI) Level of standards compliance and certification

Opportunity for Large Impact on All Mission Critical Dimensions Flash Multi-core Optimized Database Architecture Cloud

Tightly-Coupled Flash Optimized Database with Synchronous Replication Cluster Admin clients clients Concurrently Executing Transactions Parallel Execution Threads MySQL Parallel Replication Threads Concurrently Executing Transactions Parallel Execution Threads MySQL Parallel Replication Threads OS Standard X86 Server Parallel Synchronous Replication During Transaction Execution OS Standard X86 Server

Key Resource Management Algorithm Design Requirements Processor Multi-core scalability fine-grained locking, concurrent data structures Storage Log files on HDD with persistent DRAM controller Fast, saves flash for high access data Concurrent DRAM buffer-pool management algorithms Multi-threaded background write of dirty blocks so clean on misses Batched commits Highly-parallel multi-threaded flash-memory access Utilizes ~150k IOPS for balancing a 2 socket Westmere Server with 64GB DRAM Flash Cache give ~80% throughput if database working set fits in flash: must size Network Memory to memory multi-threaded parallel synchronous replication

Tightly Coupled Database Design Enables Effective Vertical Scaling with Commodity Flash Memory and Horizontal Scaling DBT2 open-source OLTP version of TPC- C 1000 warehouses, 32 connections 0 think-time Result metric: TPM (new order) Measurement Configuration 2 node Master-Slave configuration 2 socket Westmere 72GB DRAM Transaction Throughput with Hard Disc Drives

Lower Response Times Response Time (ms)

Higher Performance Stability 55 Async 55 Semi-sync SAC

Increased Service Availability and Data Integrity

Lower Total Cost of Ownership Lower Cost Reduced capital and operating costs through reduction in servers, power, space, admin Savings from increased service availability and associated revenue and customer retention TCO and ROI models are customer and workload specific Function (throughput/server; server, rack, and network costs, software license and support costs, admin costs; space and power costs; cost of downtime)

Simplified Administration Fail-over can be completely automatic and instant requiring no administrator intervention or service interruption Cluster Administrator GUI and CLI can provide a single point for clusterwide management single click slave creation and database migration; monitoring; trouble-shooting; tuning

Mission-Critical Database Best Practices High Availability High Data Integrity Excellent Performance and Scalability Simple and powerful administration Cost effective Goal Standards and Compatibility Best Practice Replication (synch local and asynch parallel WAN); automation of failure detection and recovery Synchronous replication to eliminate data loss and fully consistent data; combined with parallel asynchronous replication for WAN disaster recovery Effective vertical and horizontal scaling for exploiting flash and multicore Centralized management; automation; visibility (statistics); alerts Leverage commodity hardware and software; achieve high hardware utilization 100% standards compliance and certification

Cloud Requirements and Challenges for Scaled Enterprise Services Cloud providers must deliver: guaranteed service availability, performance, and elastic scale multi-tenant management and security and a net TCO savings vs dedicated data centers Barriers in deploying enterprise class services into the cloud at scale For many classes of applications and services: the realized performance and availability characteristics of cloud deployments are disappointing at scale the large quantity of cloud instances needed to support scaling a deployment drive the cost of cloud deployment to unacceptable levels Opportunity for flash, but innovation is required

Current Cloud Virtualization : Successes and Limitations Cloud server-virtualization Provisioning application instances in virtual machines on servers combine existing applications with multi-core systems to increase utilization elasticity of service capacity through dynamic provisioning of more or fewer application instances based on the current workload demand Successes applications that scale horizontally and can run under a VM hypervisor within a server s DRAM (eg web application tier) works well for low volume apps and services (start-ups, new games, ) Problems : scaled production databases virtualization kills performance if they do not fit in DRAM

Cloud Virtualization Impact on Production Databases Databases in production cloud environments: provide additional data partitioning (very small data bases) provide additional caching layers to minimize I/O (breaks ACID) provision many more database instances than in a nonvirtualized environment Net Impact drives up application and management complexity increases cost reduces service availability and data integrity Less than 10 percent of production data-tier server workloads are virtualized today

Fusing Cloud + Flash + Optimized Databases Short term virtualized machine instances for the web and application tiers non-virtualized, vertically scaling data-tier solutions Exploit balanced commodity, flash-based, multi-core system configurations custom management APIs and tools to link together in a hybrid cloud

Fusing Cloud + Flash + Optimized Databases Longer Term : Innovation Required Need improved virtualization technologies Flash optimized virtualization cutting flash access overhead unified virtual administration model applicable to all tiers in the data center including flash-optimized data tier dynamic provisioning, management, monitoring, and accounting Large potential Quality of Service and TCO Benefits increased performance, scalability, and service availability reduced capital and operating expenses

Thank You! Schooners, first built in the 1700s, applied an innovative design to the standard cargo sailing ship, enabling stupendous levels of speed and range They enabled a set of visionary companies to enter new markets on a global basis Where can a Schooner take your company?

The Schooner MySQL Schooner MySQL with Active Cluster Advantage with Active Cluster High Availability No service interruption for planned or unplanned database downtime Instant automatic fail-over On-line upgrade and migration 90% less downtime vs MySQL 55 Full WAN support with master autofailover Great Performance and Scalability 4-20x more throughput/server vs MySQL 55 High performance synchronous and asynchronous replication Compelling Economics Cut server capex (consolidation) Cut opex (power, pipe, DBA time) Increase revenue (eliminate service interruptions) TCO 70% cheaper than MySQL 55 Data Integrity No lost data Cluster-wide data consistency Visibility and Control Easy cluster administration No error-prone manual processes Monitoring and Optimization 100% MySQL Enterprise Compatible and Certified Broad Industry Deployments ecommerce, telco, financial, high volume websites Out-of-the-box Product Full MySQL + InnoDB: not a toolkit Free your staff to build your business, not a custom database wwwschoonerinfotechco m

How Best to Provide WAN Replication and Disaster WAN/geographically dispersed data centers Typically requires Asynchronous replication Can t add additional ~200ms with high potential variance to query response time for synchronous replication Goal: WAN asynch slave should automatically fail-over when synchronous master fail-over occurs Requires WAN asynch replication to be loosely integrated with synch replication group Goal : Limit remote slave lag and recovery to ~ WAN latency Maximize WAN data consistency Minimize disaster recovery time Requires high performance asynchronous replication Need multi-threaded asynch for parallelizing updates

Limitations of Synchronous Replication: How Best to Scale Database Queries Query Scaling in a Synchronous Replication Group Fully replicated Master/Slave cluster No cluster overhead for adding queries to a slave Can add synchronous query nodes linearly Update synchronization and cluster management eventually limit workload dependent With partitioned databases, scaling is sub-linear with severe cross node query degradation

Limitations of Synchronous Replication: How Best To Scale Queries (ctd) 1 Synch Replication Group + Multiple Integrated Asynchronous Replication Groups Can infinitely scale reads No data loss, auto-failover near zero slave lag requires asynchronous parallel update slave replication

Limitations of Synchronous Replication: How Best to Scale Updates Database Update Scalability Vertically scale with commodity : flash memory, more cores, higher frequency Compelling option exploiting low cost, high performance commodity technology

Limitations of Synchronous Replication: How Best to Scale Updates (ctd) Database Update Scalability After Optimal Vertical Scaling: Horizontally Scale Through Sharding Application Transparent Automated internal partitioning (MySQL NDB Cluster and Clustrix)» High query performance sensitivity (very slow cross partition queries) Administrator analysis and set-up tools (Schooner MySQL Active Cluster)» DB Shards allows layout and query data access optimization Application Aware

MySQL Specific Database Alternatives for Mission-Critical Deployments Fail-Over Downtime Automated Failover MySQL 55 Minuteshours MySQL NDB Cluster Clustrix Cost 2011 Schooner (TCO) Information Technology, p High Med High Low Schooner MySQL Active Cluster seconds seconds seconds No Yes Yes Yes Data Loss Yes No No No Data Consistency No Yes Yes Yes Performance Med Med/Low Med/Low High Scalability Low Med/Low Med/Low High Ease of Management WAN perf and failover Low High High High No No No Yes InnoDB Compatible High Med/Low Med/Low High Custom Hardware No No Yes No

MySQL-Independent Replication: Linux DRBD Passive Standby Master MySQL clients Master MySQL MySQL Passive SlaveReplicatio n : MySQL Linux DRBD Passive MySQL Standby Master Eliminates data loss Master failure-over to standby in minutes But not transitionally consistent: Stand-by master cannot service load No warm re-start => hours for full service Can propagate corruptions(no log checksums) Linux Standard X86 Server Disc block level sync replication Linux Standard X86 Server Slaves are still operating with asynchronou replication => same issues as MySQL 55/5 high administrative complexity reduced service availability inconsistent slave data poor performance high TCO Linux DRBD (Distributed Replicated Block Device)

MySQL-Independent Replication for Heterogeneous Database Interoperability Very loosely-coupled external replication services based on MySQL s asynchronous replication Bin log Oracle Golden Gate: Converts MySQL Asynchronous Bin log to a common log format Heterogeneous database replication interoperability: Oracle, IBM DB2, and Microsoft SQL Server Continuent Tungsten Replicator Converts the MySQL asynchronous Bin log to a transaction history log Uses JDBC through a client proxy to access MySQL indirectly Heterogeneous database replication interoperability: PostgreSQL If used in MySQL Master - Slave deployments: Performance is significantly worse than MySQL 55/56 Same issues as all loosely coupled asynch Bin log approaches reduced service availability poor data integrity high administrative complexity high TCO

MySQL Database Alternatives for Mission-Critical Deployments MySQL 55 MySQL NDB Cluster Clustri x Linux DRDB Continue nt Tungste n Golde n Gate Schooner MySQL Active Cluster Fail-Over Downtime Minuteshours seconds seconds minute s seconds Minutes -hours seconds Automated Failover No Yes Yes No Yes No Yes Data Loss Yes No No No Yes Yes No Data Consistency No Yes Yes No No No Yes Performance Med Med/Low Med/ Low Scalability Low Med/Low Med/ Low Ease of Management WAN perf and auto- fail-over InnoDB Compatible Med Low Low High Low Low Low High Low High High Low Med Med High No No No No No No Yes High Med/Low Med/ Low High High High High Custom No No Yes No No No No Hardware