Experience in running relational databases on clustered storage

Similar documents
Backup and restore of Oracle databases: introducing a disk layer

10th TF-Storage Meeting

Pricing - overview of available configurations

Introducing NetApp FAS2500 series. Marek Stopka Senior System Engineer ALEF Distribution CZ s.r.o.

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

Entry level solutions: - FAS 22x0 series - Ontap Edge. Christophe Danjou Technical Partner Manager

Introduction to NetApp Infinite Volume

Datasheet The New NetApp FAS3200 Series Enables Flash, Clustering to Improve IT Agility and Performance

Flash Pool Design and Implementation Guide

NetApp Storage. Krzysztof Celmer NetApp Poland. Virtualized Dynamic Infrastructure. Applications. Virtualized Storage. Servers

Clustered Data ONTAP 8.3 Administration and Data Protection Training

FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency

Datasheet NetApp FAS8000 Series

The Data Placement Challenge

RFP-MM Enterprise Storage Addendum 1

NetApp for Oracle Database

SolidFire and NetApp All-Flash FAS Architectural Comparison

June Blade.org 2009 ALL RIGHTS RESERVED

ENABLING RHEV WITH NETAPP STORAGE. Jon Benedict Systems Architect, NetApp Geert Jansen RHEV Product Marketing Manager, Red Hat May 6, 2011

Flash Accel, Flash Cache, Flash Pool, Flash Ray Was? Wann? Wie?

How To Write A Server On A Flash Memory On A Perforce Server

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

The future of Storage and Storage Management Using Virtualization to Increase Productivity. Storyflex VISION 2007 Hans Lamprecht NetApp SEE Vienna

Evaluation of Cloud ONTAP and AltaVault using AWS

CASE STUDY SAGA - FcoE

SnapManager for Oracle 2.2. Anand Ranganathan Product & Partner Engineer (PPE)

New Cluster-Ready FAS3200 Models

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

NetApp FAS Mailbox Exchange 2010 Mailbox Resiliency Storage Solution

Datasheet NetApp FAS6200 Series

Hyperscale Use Cases for Scaling Out with Flash. David Olszewski

Ultimate Guide to Oracle Storage

Investing Strategically in All Flash Arrays

Enhancements of ETERNUS DX / SF

CERN Achieves Database Scalability and Performance with Oracle and NetApp session S319046

Re-Invent Your Recovery

Uncompromised business agility with Oracle, NetApp and VMware

Experience and Lessons learnt from running High Availability Databases on Network Attached Storage

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

NetApp FAS2000 Series

Get Success in Passing Your Certification Exam at first attempt!

FlexArray Virtualization

Microsoft Exchange Server 2010 and SnapManager for Exchange on Data ONTAP 8.1 Cluster-Mode Best Practices Guide

EqualLogic PS Series Load Balancers and Tiering, a Look Under the Covers. Keith Swindell Dell Storage Product Planning Manager

mplementing Oracle11g Database over NFSv4 from a Shared Backend Storage Bikash Roy Choudhury (NetApp) Steve Dickson (Red Hat)

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

EMC Backup and Recovery for Oracle Database 11g Without Hot Backup Mode using DNFS and Automatic Storage Management on Fibre Channel

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

Worry-free Storage. E-Series Simple SAN Storage

SAP Running on an EMC Virtualized Infrastructure and SAP Deployment of Fully Automated Storage Tiering

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

Comparison of Hybrid Flash Storage System Performance

WHITE PAPER. Software Defined Storage Hydrates the Cloud

Best Practices for Managing Storage in the Most Challenging Environments

FlexPod for VMware The Journey to Virtualization and the Cloud

Addendum No. 1 to Packet No Enterprise Data Storage Solution and Strategy for the Ingham County MIS Department

NOTICE ADDENDUM NO. TWO (2) JULY 8, 2011 CITY OF RIVIERA BEACH BID NO SERVER VIRTULIZATION/SAN PROJECT

NetApp Unified Storage

Understanding Enterprise NAS

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

How To Store Data On A Server Or Hard Drive (For A Cloud)

Nondisruptive Operations and Clustered Data ONTAP

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

SoftLayer Fundamentals. Storage and Backup. August, 2014

Successfully Deploying Alternative Storage Architectures for Hadoop Gus Horn Iyer Venkatesan NetApp

Top 10 Things You Always Wanted to Know About Automatic Storage Management But Were Afraid to Ask

NEXENTA S VDI SOLUTIONS BRAD STONE GENERAL MANAGER NEXENTA GREATERCHINA

<Insert Picture Here> Oracle Cloud Storage. Morana Kobal Butković Principal Sales Consultant Oracle Hrvatska

High Availability and MetroCluster Configuration Guide For 7-Mode

High Availability Databases based on Oracle 10g RAC on Linux

Solid State Storage in the Evolution of the Data Center

Oracle Maximum Availability Architecture with Exadata Database Machine. Morana Kobal Butković Principal Sales Consultant Oracle Hrvatska

Oracle Database 11g Release 2 Performance: Protocol Comparison Using Clustered Data ONTAP 8.1.1

ORACLE DATABASE HIGH AVAILABILITY STRATEGY, ARCHITECTURE AND SOLUTIONS

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Distributed File System Choices: Red Hat Storage, GFS2 & pnfs

Maxta Storage Platform Enterprise Storage Re-defined

Xangati Storage Solution Brief. Optimizing Virtual Infrastructure Storage Systems with Xangati

Product Overview. UNIFIED COMPUTING Managed Hosting - Storage Data Sheet

Sizing and Best Practices for Deploying Microsoft Exchange Server 2010 on VMware vsphere and Dell EqualLogic Storage

Building a Flash Fabric

New Features in PSP2 for SANsymphony -V10 Software-defined Storage Platform and DataCore Virtual SAN

PARALLELS CLOUD STORAGE

Clustered Data ONTAP 8.3

Big data Devices Apps

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

NetApp FAS3200 Series

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Kaminario K2 All-Flash Array

Copy-Free Transition Guide

NetApp FAS/V-Series. NetApp BDM

89 Fifth Avenue, 7th Floor. New York, NY White Paper. HP 3PAR Adaptive Flash Cache: A Competitive Comparison

Blackboard Managed Hosting SM Disaster Recovery Planning Document

Transcription:

Experience in running relational databases on clustered storage Ruben.Gaspar.Aparicio_@_cern.ch CERN, IT Department CHEP 2015, Okinawa, Japan 13/04/2015

Agenda Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions 3

CERN s Databases ~100 Oracle databases, most of them RAC Mostly NAS storage plus some SAN with ASM ~600 TB of data files for production DBs in total Using a variety of Oracle technologies: Active Data Guard, Golden Gate, Clusterware, etc. Examples of critical production DBs: LHC logging database ~250 TB, expected growth up to ~90 TB / year 13 production experiments databases ~15-25 TB in each Read-only copies (Active Data Guard) Database on Demand (DBoD) single instances 172 MySQL Open community databases (5.6.17) 19 PostgreSQL databases (9.2.9) 9 Oracle11g databases (11.2.0.4) 4

A few 7-mode concepts client access Thin provisioning Private network Independent HA pairs raid_dp or raid4 File access NFS, CIFS Block access FC,FCoE, iscsi Remote Lan Manager Service Processor raid.scrub.schedule once weekly raid.media_scrub.rate constantly FlexVolume Rapid RAID Recovery reallocate Maintenance center (at least 2 spares) 5

A few C-mode concepts client access cluster node shell Cluster interconnect Private network Cluster mgmt network systemshell C-mode C-mode cluster ring show RDB: vifmgr + bcomd + vldb + mgmt Logging files from the controller no longer accessible by simple NFS export Cluster should never stop serving data Vserver (protected via Snapmirror) Global namespace 6

Agenda Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions 7

NAS evolution at CERN (last 8 years) FAS3000 scaling up FAS6200 & FAS8000 100% FC disks Flash pool/cache = 100% SATA disk + SSD 2gbps 6gbps DS14 mk4 FC DS4246 Data ONTAP 7-mode scaling out Data ONTAP Clustered-Mode 8

Network architecture Public Network Storage network 10GbE mtu 1500 10GbE Cluster mgmt network 1GbE 10GbE mtu 9000 2x10GbE 10 GbE Bare metal server trunking 10GbE Private Network 2x10GbE Cluster interconnect Just cabling of first element of each type is shown cabled Each switch is in fact a set of switches (4 in our latest setup) managed as one by HP Intelligent Resilient Framework (IRF) ALL our databases run with same network architecture NFSv3 is used for data access 9

Disk shelf cabling: SAS Owned by 1 st Controller Owned by 2 nd Controller SSD SAS loop at 6gpbs 12gbps per stack due to multi-pathing ~3GB/s per controller 10

Mount options Oracle and MySQL are well documented Mount Options for Oracle files when used with NFS on NAS devices (Doc ID 359515.1) Best Practices for Oracle Databases on NetApp Storage, TR- 3633 What are the mount options for databases on NetApp NFS? KB ID: 3010189 PostgreSQL not popular with NFS, though it works well if properly configured MTU 9000, reliable NFS stack e.g. NetApp NFS server implementation Do not underestimate impact 11

After setting new mount points options (peaks due to autovacuum): 12

Mount options: database layout Oracle RAC, cluster database: global namespace MySQL and PostgreSQL single instance 13

Agenda Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions 14

Flash Technologies Flash Cache Flash Pool Depending where SSD are located. Controllers Flash Cache Disk shelf Flash Pool Flash pool (hybrid aggregates) based on a Heat Map in order to decide which block stays and for how long Sequential data is not cached ( > 16KB). Data can not be pinned Works on random reads and writes workloads Writes (μs) warm up cache much faster (ms) Data is not sensible to cluster takeover/givebacks it reduces warm-up period 15

Agenda Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions 16

Backup management using snapshots Backup workflow: some time later mysql> FLUSH TABLES WITH READ LOCK; mysql> FLUSH LOGS; or Oracle>alter database begin backup; Or Postgresql> SELECT pg_start_backup('$snap'); snapshot resume new snapshot mysql> UNLOCK TABLES; Or Oracle>alter database end backup; or Postgresql> SELECT pg_stop_backup(), pg_create_restore_point('$snap'); 17

Snapshots for Backup and Recovery Storage-based technology Strategy independent of the RDBMS technology in use Speed-up of backups/restores: from hours/days to seconds SnapRestore requires a separate license API can be used by any application, not just RDBMS Consistency should be managed by the application Backup & Recovery API Oracle ADCR: 29TB size, ~ 10 TB archivelogs/day Alert log: 8 secs 18

Cloning of RDBMS Based on snapshot technology (FlexClone) on the storage. Requires license. FlexClone is a snapshot with a RW layer on top Space efficient: at first blocks are shared with parent file system We have developed our own API, RDBMS independent Archive logs are required to make the database consistent Solution being developed initially for MySQL and PostgreSQL on our DBoD service. Many use cases: Check application upgrade, database version upgrade, general testing Check state of your data on a snapshot (backup) Both clone and parent present similar performance 19

Cloning of RDBMS (II) 20

Agenda Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions 21

Vol move Powerful feature: rebalancing, interventions, whole volume granularity Transparent but watch-out on high IO (writes) volumes Based on SnapMirror technology Example vol move command: rac50::> vol move start -vserver vs1rac50 -volume movemetest -destination-aggregate aggr1_rac5071 -cutoverwindow 45 -cutover-attempts 3 -cutover-action defer_on_failure

Compression & deduplication Mainly used for Read Only data and our backup to disk solution (Oracle) It is transparent to applications NetApp compression provides similar gains as Oracle12c low compression level. It may vary depending on datasets compression ratio Total Space used: 641TB Savings due to compression and dedup: 682TB ~51.5% savings 23

Conclusions Positive experience so far running on C-mode Data safety features (raid_dp, scrubbing, checksum, ) has been proven to be very reliable but bugs may be encountered, relying on e.g. checksums at the application layer when available is advisable. Mid to high end NetApp NAS provide good performance using the flash pool SSD caching solution Design of stacks and network access require careful planning Cluster resilience has being proven in a number of planned interventions and unplanned incidents Online interventions are key for critical services Good contacts with vendor specialists has been proven to be very effective Flexibility with clustered ONTAP, helps to reduce the investment Same infrastructure used to provide iscsi object storage via CINDER New service functionality being built based on storage features 24

Questions 25

Flash Technologies Flash Cache Flash Pool Depending where SSD are located. Controllers Flash Cache Disk shelf Flash Pool Flash pool based on a Heat Map read read read hot warm neutral cold evict Eviction scanner write neutral overwrite cold evict Insert into SSD Write to disk Every 60 secs & SSD consumption > 75% Insert into SSD Eviction scanner 26

Flash pool + Oracle directnfs Oracle12c, enable dnfs by: $ORACLE_HOME/rdbms/lib/make -f ins_rdbms.mk dnfs_on