CERN Achieves Database Scalability and Performance with Oracle and NetApp session S319046



Similar documents
Experience in running relational databases on clustered storage

10th TF-Storage Meeting

The future of Storage and Storage Management Using Virtualization to Increase Productivity. Storyflex VISION 2007 Hans Lamprecht NetApp SEE Vienna

FlexArray Virtualization

Uncompromised business agility with Oracle, NetApp and VMware

Datasheet NetApp FAS6200 Series

Scalable NAS for Oracle: Gateway to the (NFS) future

NetApp Storage. Krzysztof Celmer NetApp Poland. Virtualized Dynamic Infrastructure. Applications. Virtualized Storage. Servers

Entry level solutions: - FAS 22x0 series - Ontap Edge. Christophe Danjou Technical Partner Manager

NetApp Unified Storage

Technical Case Study CERN the European Organization for Nuclear Research

High Availability Databases based on Oracle 10g RAC on Linux

Server and Storage Virtualization with IP Storage. David Dale, NetApp

Datasheet The New NetApp FAS3200 Series Enables Flash, Clustering to Improve IT Agility and Performance

Pricing - overview of available configurations

FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency

NetApp FAS2000 Series

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

Deploying NetApp Storage Solutions in a Database Environment

Ultimate Guide to Oracle Storage

Virtual server management: Top tips on managing storage in virtual server environments

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

EMC Unified Storage for Oracle Database 11g/10g Virtualized Solution. Enabled by EMC Celerra and Linux using NFS and DNFS. Reference Architecture

(Scale Out NAS System)

Datasheet NetApp FAS8000 Series

Oracle Maximum Availability Architecture with Exadata Database Machine. Morana Kobal Butković Principal Sales Consultant Oracle Hrvatska

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

NetApp Replication-based Backup

AIX NFS Client Performance Improvements for Databases on NAS

Introduction to NetApp Infinite Volume

Oracle Database Deployments with EMC CLARiiON AX4 Storage Systems

Experience and Lessons learnt from running High Availability Databases on Network Attached Storage

EMC Unified Storage for Microsoft SQL Server 2008

The Leader in Unified All-Flash Storage. All-Flash Arrays for Enterprise and Cloud August 2013

New Cluster-Ready FAS3200 Models

NetApp FAS3200 Series

HPC Advisory Council

NetApp for Oracle Database

Best Practices for Data Sharing in a Grid Distributed SAS Environment. Updated July 2010

Tier0 plans and security and backup policy proposals

June Blade.org 2009 ALL RIGHTS RESERVED

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

mplementing Oracle11g Database over NFSv4 from a Shared Backend Storage Bikash Roy Choudhury (NetApp) Steve Dickson (Red Hat)

Datasheet NetApp FAS3200 Series

The Data Placement Challenge

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers

REDUCING DATABASE TOTAL COST OF OWNERSHIP WITH FLASH

Virtualize Without Compromise. Protecting and Storing Virtualized Data

Extend the Benefits of VMware vsphere with NetApp Storage

Using NetApp Unified Connect to Create a Converged Data Center

Reducing the Cost and Complexity in the Data Centre

Performance Baseline of Hitachi Data Systems HUS VM All Flash Array for Oracle

ZFS Administration 1

ORACLE DATABASE HIGH AVAILABILITY STRATEGY, ARCHITECTURE AND SOLUTIONS

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

Solution Brief July All-Flash Server-Side Storage for Oracle Real Application Clusters (RAC) on Oracle Linux

RFP-MM Enterprise Storage Addendum 1

Symantec Enterprise Vault And NetApp Better Together

Choosing Storage Systems

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

EMC Backup and Recovery for Oracle Database 11g Without Hot Backup Mode using DNFS and Automatic Storage Management on Fibre Channel

OnCommand Report 1.2. OnCommand Report User Guide. NetApp, Inc. 495 East Java Drive Sunnyvale, CA U.S.

Introducing NetApp FAS2500 series. Marek Stopka Senior System Engineer ALEF Distribution CZ s.r.o.

Virtualizing SQL Server 2008 Using EMC VNX Series and Microsoft Windows Server 2008 R2 Hyper-V. Reference Architecture

SAS System for Windows: Integrating with a Network Appliance Filer Mark Hayakawa, Technical Marketing Engineer, Network Appliance, Inc.

SMB Direct for SQL Server and Private Cloud

iscsi SANs: Ideal Applications, Large and Small Jason Blosil, NetApp Gary Gumanow, Dell

High Availability and MetroCluster Configuration Guide For 7-Mode

Clustered Data ONTAP 8.3 Administration and Data Protection Training

DPAD Introduction. EMC Data Protection and Availability Division. Copyright 2011 EMC Corporation. All rights reserved.

3 Red Hat Enterprise Linux 6 Consolidation

Building a Scalable Storage with InfiniBand

Maurice Askinazi Ofer Rind Tony Wong. Cornell Nov. 2, 2010 Storage at BNL

NetApp FAS Mailbox Exchange 2010 Mailbox Resiliency Storage Solution

NetApp FAS/V-Series. NetApp BDM

Tároló vezérelt infrastruktúrák

Vault 8000 Series. Product Overview KEY BENEFITS

SLIDE 1 Previous Next Exit

IP Storage in the Enterprise Now? Why? Daniel G. Webster Unified Storage Specialist Commercial Accounts

Oracle Database 11g Release 2 Performance: Protocol Comparison Using Clustered Data ONTAP 8.1.1

Virtual Desktop Infrastructure (VDI) Overview

OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006

Storage Management for the Oracle Database on Red Hat Enterprise Linux 6: Using ASM With or Without ASMLib

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Business-centric Storage for small and medium-sized enterprises. How ETERNUS DX powered by Intel Xeon processors improves data management

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

modular Storage Solutions MSS Series

Backup and Recovery Redesign with Deduplication

Symantec NetBackup 5000 Appliance Series

Implementing Enterprise Disk Arrays Using Open Source Software. Marc Smith Mott Community College - Flint, MI Merit Member Conference 2012

Philip Lawrence. Senior Solution Architect Sun Microsystems UK & Ireland

ORACLE 11g AND 12c DATABASE CONSOLIDATION AND WORKLOAD SCALABILITY WITH EMC XTREMIO 3.0

Solutions Brief. Unified All Flash Storage. FlashPoint Partner Program. Featuring. Sustainable Storage. S-Class Flash Storage Systems

EMC Business Continuity for Microsoft SQL Server 2008

Business-centric Storage for small and medium-sized enterprises. How ETERNUS DX powered by Intel Xeon processors improves data management

EMC ISILON NL-SERIES. Specifications. EMC Isilon NL400. EMC Isilon NL410 ARCHITECTURE

How To Store Data On A Server Or Hard Drive (For A Cloud)

Transcription:

CERN Achieves Database Scalability and Performance with Oracle and NetApp session S319046 Eric Grancher eric.grancher@cern.ch CERN IT department Steve Daniel steve.daniel@netapp.com NetApp Image courtesy of Forschungszentrum Jülich / Seitenplan, with material from NASA, ESA and AURA/Caltech https://edms.cern.ch/document/1093461/1

Driving Storage as a Value Center Reduce Complexity Unified infrastructure Combines technology and process seamlessly Maximize Asset Utilization Storage efficiency Protect data while avoiding data duplication Provide multi-use datasets without copying Eliminate duplicate copies of data Reduce power, cooling & space consumption Control Hidden Costs Comprehensive data management Complete data protection Application-level end-to-end provisioning Policy-based automation 2010 NetApp. All rights reserved. 2

Flexible Storage A Single, Unified Platform Single, Unified Storage Platform Unified Management Low-to-High Scalability Multiple Networks Multiple Protocols Multiple Disks Same tools and processes: learn once, run everywhere SAN FC Integrated management NAS SATA Integrated data protection iscsi SSD Storage Virtualization Unified Flash Unified Scale Out Multi-vendor virtualization FlashCache SSD FlexCache 2010 NetApp. All rights reserved. 3

FlexClone Writable Copies Application development often requires substantial primary storage space for essential test operations such as platform and upgrade rollouts FlexClone improves storage efficiency for applications that need temporary, writable copies of data volumes Creates a virtual clone copy of the primary dataset and stores only the data changes between parent volume and clone Multiple clones are easily created Resulting space savings of 80% or more FlexVol 1 250 GB FlexClone Copy C FlexClone Copy B FlexClone Copy A FlexVol 2 500GB FlexVol 3 1TB RAID Group 1 RAID Group 0 RAID Group 2 Aggregate 2010 NetApp. All rights reserved. 4

NetApp Unified Storage Architecture High-end Data Center Mid-range Data Center FAS/V6080 Remote Office / Mid-size Enterprise FAS2020 68 TB 68 drives FAS2040 FAS2050 104 TB 104 drives 272 TB 136 drives FAS/V3170 FAS/V3160 FAS/V3140 840 TB 420 drives/ LUNs 1344 TB 672 drives/ LUNs 1680 TB 840 drives/luns FAS/V6040 1680 TB 840 drives/luns 2352 TB 1,176 drives/luns Unified storage architecture for SAN and NAS Data ONTAP provides a single application interface One set of management tools and software V-Series for heterogeneous storage virtualization 2010 NetApp. All rights reserved. 5

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 6

Personal introduction Joined CERN in 1996 to work on Oracle database parallelism features OakTable member since April 2005 Team leader for the Database Services section in the CERN IT department Specific interest in database application and storage performance 7

CERN CERN Annual budget: ~1000 MSFr (~600 M ) Staff members: 2650 Member states: 20 + 270 Fellows, + 440 Associates + 8000 CERN users Basic research Fundamental questions High E accelerator: Giant microscope (p=h/λ) Generate new particles (E=mc 2 ) Create Big Bang conditions J. Knobloch/CERN: European Jürgen Grid Initiative Knobloch- EGI cern-it Slide-8

Large Hadron Collider - LHC 1.9 K 16 micron squeeze 27 km circumference Cost ~ 3000 M (+ detectors) Proton-proton (or lead ion) collisions at 7+7 TeV Bunches of 10 11 protons cross every 25 nsec 600 million collisions/sec Physics questions Origin of mass (Higgs?) Dark matter? Symmetry matter-antimatter Forces supersymmetry Early universe quark-gluon plasma 10-9 to 10-12 J. Knobloch/CERN: European Grid Initiative EGI

LHC accelerator and experiments J. Knobloch/CERN: European Grid Initiative EGI

LHC Instantaneous Luminosity: August Record Slide from Ralph Assmann http://op-webtools.web.cern.ch/opwebtools/vistar/vistars.php?usr=lhc1 8:30 meeting

Events at LHC Luminosity : 10 34 cm -2 s -1 40 MHz every 25 ns 20 events overlaying J. Knobloch/CERN: European Grid Initiative EGI

Trigger & Data Acquisition J. Knobloch/CERN: European Grid Initiative EGI

Data Recording J. Knobloch/CERN: European Grid Initiative EGI

LHC Computing Challenge Signal/Noise 10-9 Data volume High rate * large number of channels * 4 experiments 15 PetaBytes of new data each year Compute power >140 sites ~150k CPU cores >50 PB disk Worldwide analysis & funding Computing funding locally in major regions & countries Efficient analysis everywhere GRID technology J. Knobloch/CERN: European Grid Initiative EGI

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 16

Oracle at CERN history 1982: Oracle at CERN Solaris SPARC 32 and 64 1996: Solaris SPARC with OPS 2000: Linux x86 on single node, DAS 2005: Linux x86_64 / RAC / EMC with ASM >=2006: Linux x86_64 / RAC / NFS / NetApp (now, 96 databases) 17

Accelerator databases (1/2) Use cases ACCCON Accelerator Settings and Controls Configuration necessary to drive all accelerator installations, unavailability may require to stop accelerator operation ACCLOG Accelerator long-term Logging database 3.5TB growth per month 18

ACCLOG daily growth (GB/day) 19

ACCLOG total space 20

Accelerator databases (2/2) Implementation Oracle RAC 10.2.0.5 with partitioning Intel x86_64 NetApp 3040 and 3140 with Data OnTap8-7 mode Example aggregate dbdska210 Data 12 August 2010 to ~mid July 2011 RAID-DP 30 SATA disks, each 2TB 2 raid groups 38 743GB usable 21

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 22

PVSS Oracle scalability Target = 150 000 changes per second (tested with 160k) 3 000 changes per client 5 nodes RAC 10.2.0.4 2 NAS 3040, each with one aggregate of 13 disks (10k rpm FC) 23

PVSS Oracle scalability Load on one of the instances, stable data loading 24

PVSS NetApp storage load NVRAM plays a critical role in order to have write operations happen quickly dbsrvc235>-rac>-pvsstest1:~/work/pvsstest/changestorage$ ssh -2 root@dbnasc210 sysstat -x 1 CPU NFS CIFS HTTP Total Net kb/s Disk kb/s Tape kb/s Cache Cache CP CP Disk in out read write read write age hit time ty util 64% 5506 0 0 5506 136147 1692 1568 207148 0 0 >60 100% 82% Df 79% 58% 5626 0 0 5626 139578 1697 1040 137420 0 0 >60 100% 62% D 58% 57% 5420 0 0 5420 127307 1618 1080 136384 0 0 >60 100% 79% D 62% 61% 5142 0 0 5142 130298 1562 1927 149545 0 0 >60 100% 57% Dn 57% 25

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 26

Scalability on IOPS DataOntap 8 enables stripping over large number of disks (depends on FAS model and disk size) Enables very good scalability 27

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 28

IOPS and flash cache Help to increase random IOPS on disks Warm-up effect will be an increasingly important issue (2 level of large caches is likely of help) For databases select volumes for which caching will benefit (not archive redo logs for example) set flexscale.lopri_blocks on 29

IOPS and flash cache 30

IOPS and flash cache 31

IOPS and flash cache 32

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 33

Scalability: bandwidth 10GbE 10GbE is becoming mainstream (cards, switches) TX: 289Mb (/s), RX: 6.24Gb (/s) TOTAL:6.52Gb (/s) CPU usage NAS: 3140 cluster Host: dual E5410 with Intel 82598EB 10-Gigabit card (19% CPU) 34

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 35

D(irect)-NFS One of the nicest features of Oracle11g Enables using multiple paths to storage Makes Oracle on NFS from simple to extremely simple Just a symlink in $ORACLE_HOME/lib List of paths to be declared Oracle instance running with ODM: Oracle Direct NFS ODM Library Version 2.0... Direct NFS: channel id [0] path [dbnasg301] to filer [dbnasg301] via local [] is UP Direct NFS: channel id [1] path [dbnasg301] to filer [dbnasg301] via local [] is UP Promising with NFS 4.1/pNFS Scalability, on demand Move of volumes, upgrades 36

Performance with DB usage Have reallocate enabled by default (backup!) and filesystem_options = setall (async+directio) NetApp NVRAM makes writing fast (see PVSS testcase) Key for OLTP commit time DataOntap 8 enables large aggregates (40TB on 3140, up to 100TB on 61xx) Gain in management Gain in performance NFS or TCP/IP overhead, CPU usage (large transfer): network roundtrip and disk access Scales much better than what many think 37

Oracle DB/NetApp tips Use NFS/DNFS (11.1 see Note 840059.1 /11.2) Resilient to errors TCP/IP and NFS extremely stable and mature Extremely simple, good productivity per DBA Use different volumes for log files, archive redo logs and data files Have several copies of control files and OCR on different aggregate / filer (at least different aggregates) Split storage network Cost for the switches is not very high Use MTU = 9000 on the storage network 38

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 39

Oracle VM tips NFS is extremely well suited for virtualisation Mount database volumes from the guest Separation OS/data Scalability (add mount points if necessary) Same as physical can easily migrate from physical to/from virtual Disk access might more expensive than local Limit swap (do you need any swap?) Check for file inexistence (ias SSL semaphores) 5.4. 10-6 second per stat system call on local filesystem 18.1. 10-6 second per stat system call on NFS mounted filesystem 40

Oracle VM live migration From Anton Topurov 41

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 42

Simplicity and availability Simplicity Shared log files for the database (tail alertsid*.log) No need for ASM, day to day simpler operations Operations under stress made easier (copy control file with RMAN) Rename a file in ASM 10.2? Install a 2 nodes RAC with NFS or ASM (multi-pathing, raw on 10.2, FC drivers, ASM...) Reliability Do a snapshot before upgrade Simplicity is key for reliability (even experienced DBA do basic errors linked with complex storage) More robust than ASM normal redundancy RAID-DP (double parity) 43

Disk and redundancy (1/2) Disks are larger and larger speed stay ~constant -> issue with speed bit error rate stay constant (10-14 to 10-16 ), increasing issue with availability With x as the size and α the bit error rate 44

Disks, redundancy comparison (2/2) 5 14 28 5 14 28 1 TB SATA desktop RAID 1 7.68E-02 Bit error rate 10^-14 RAID 5 (n+1) 3.29E-01 6.73E-01 8.93E-01 ~RAID 6 (n+2) 1.60E-14 1.46E-13 6.05E-13 ~triple mirror 8.00E-16 8.00E-16 8.00E-16 450GB FC RAID 1 4.00E-04 Bit error rate 10^-16 RAID 5 (n+1) 2.00E-03 5.58E-03 1.11E-02 ~RAID 6 (n+2) 7.20E-19 6.55E-18 2.72E-17 ~triple mirror 3.60E-20 3.60E-20 3.60E-20 1TB SATA enterprise RAID 1 7.96E-03 Bit error rate 10^-15 RAID 5 (n+1) 3.92E-02 1.06E-01 2.01E-01 ~RAID 6 (n+2) 1.60E-16 1.46E-15 6.05E-15 ~triple mirror 8.00E-18 8.00E-18 8.00E-18 10TB SATA enterprise RAID 1 7.68E-02 Bit error rate 10^-15 RAID 5 (n+1) 3.29E-01 6.73E-01 8.93E-01 ~RAID 6 (n+2) 1.60E-15 1.46E-14 6.05E-14 ~triple mirror 8E-17 8E-17 8E-17 45

Outline Few words about CERN and computing challenge Oracle and RAC at CERN and NetApp for accelerator databases example DataOntap 8 scalability PVSS to 150 000 changes/s IO operations per second Flash Cache 10GbE Oracle DB on NFS experience Oracle VM experience Reliability and simplicity Conclusions 46

NetApp platform benefits Well supported (recommendations at NetApp NOW and Oracle MOS) Well managed (AutoSupport, new DOT releases include firmware/...) Very good scalability in performance and size with Data Ontap 8 Impressive stability, cluster failover just works, non-disruptive upgrade (all upgrades since 2006) Checksum, scrubbing, multipathing... RAID-DP double parity (always more important) Snapshots and associated feature 47

Conclusion CERN has standardised part of its database infrastructure (all for accelerators, mass storage and administrative applications) on NetApp/NFS DataOntap 8 (7 mode) provides scalability, ease of maintenance and management Our experience is that Oracle/NFS on NetApp is a rock-solid combination, providing performance and scalability Scalability with 64bits aggregate, 10Gb/s Ethernet, Direct NFS, flash caching Oracle VM on NFS is simple, extensible and stable 48

Q&A session S319046 Steve Daniel, Steve.Daniel@netapp.com Eric Grancher, Eric.Grancher@cern.ch 49

References Required Diagnostic for Direct NFS Issues and Recommended Patches for 11.1.0.7 Version https://supporthtml.oracle.com/ep/faces/secure/km/documentdisplay.jspx?id=840059.1&h =Y Oracle : The Database Management System For LEP http://cdsweb.cern.ch/record/443114 Oracle 11g Release 1 Performance: Protocol Comparison on Red Hat Enterprise Linux 5 Update 1 http://media.netapp.com/documents/tr- 3700.pdf 50