SCALE-OUT SAS GRID WITH XTREMIO & ISILON REDEFINE WORKLOAD AGILITY WITH ARCHITECTURAL FLEXIBILITY - TED BASILE & JOHN MALLORY

Similar documents
EMC - XtremIO. All-Flash Array evolution - Much more than high speed. Systems Engineer Team Lead EMC SouthCone. Carlos Marconi.

XTREMIO S TRANSFORMATIONAL TECHNOLOGY

ACCELERATING SQL SERVER WITH XTREMIO

ווירטואליזציה להאצת המערכות הרפואיות

MODERNIZE WITH ALL-FLASH

利 用 EMC XtremIO 重 新 定 義 VDI. 徐 志 良 Edward Hsu 技 術 顧 問 EMC 2 Taiwan

REDUCING DATABASE TOTAL COST OF OWNERSHIP WITH FLASH

FLASH 15 MINUTE GUIDE DELIVER MORE VALUE AT LOWER COST WITH XTREMIO ALL- FLASH ARRAY Unparal eled performance with in- line data services al the time

EMC FLASH STRATEGY. Flash Everywhere - XtremIO. Massimo Marchetti. Channel Business Units Specialty Sales EMC massimo.marchetti@emc.

CONSOLIDATING MICROSOFT SQL SERVER OLTP WORKLOADS ON THE EMC XtremIO ALL FLASH ARRAY

FLASH ARRAY MARKET TRENDS

How To Get The Most Out Of An Ecm Xtremio Flash Array

ENABLING SDDC WITH XTREMIO & BROCADE

EMC XtremIO. and the Future of Application Services. K. Aggelakopoulos EMC - Sr. Systems Engineer 28 th of May 2014

Choosing Right All-Flash-Array

EMC SOLUTION FOR SPLUNK

ACCELERATING VMWARE HANDS-ON LABS WITH EMC XTREMIO

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

ASKING THESE 20 SIMPLE QUESTIONS ABOUT ALL-FLASH ARRAYS CAN IMPACT THE SUCCESS OF YOUR DATA CENTER ROLL-OUT

Kaminario K2 All-Flash Array

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

A KAMINARIO WHITE PAPER. Changing the Data Center Economics with Kaminario s K2 All-Flash Storage Array

ORACLE 11g AND 12c DATABASE CONSOLIDATION AND WORKLOAD SCALABILITY WITH EMC XTREMIO 3.0

ORACLE 11g AND 12c DATABASE CONSOLIDATION AND WORKLOAD SCALABILITY WITH EMC XTREMIO 4.0

Best Practices for Running SQL Server on EMC XtremIO

VMware and Primary Data: Making the Software-Defined Datacenter a Reality

THESUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Understanding Enterprise NAS

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

The Advantages of Flash Storage

XtremIO Flash Memory, Performance & endurance

EMC XTREMIO EXECUTIVE OVERVIEW

EMC ISILON ONEFS OPERATING SYSTEM

Getting the Most Out of VMware Mirage with Hitachi Unified Storage and Hitachi NAS Platform WHITE PAPER

Nimble Storage + OpenStack 打 造 最 佳 企 業 專 屬 雲 端 平 台. Nimble Storage Brian Chen, Solution Architect Jay Wang, Principal Software Engineer

EMC IRODS RESOURCE DRIVERS

EMC SOLUTIONS FOR OPTIMIZED EPIC EMR ENVIRONMENTS

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Storage made simple. Essentials. Expand it... Simply

Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

EMC ISILON SCALE-OUT STORAGE PRODUCT FAMILY

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

REDEFINE SIMPLICITY TOP REASONS: EMC VSPEX BLUE FOR VIRTUALIZED ENVIRONMENTS

The BIG Data Era has. your storage! Bratislava, Slovakia, 21st March 2013

Journey to the All-Flash Data Center

EMC XTREMIO AND MICROSOFT EXCHANGE DATABASES

Big + Fast + Safe + Simple = Lowest Technical Risk

ZFS Storage Solutions for Unstructured Data Challenges

Introducing the New Hitachi Storage Virtualization Operating System and Hitachi Virtual Storage Platform G1000

Business Continuity with the. Concerto 7000 All Flash Array. Layers of Protection for Here, Near and Anywhere Data Availability

Diagram 1: Islands of storage across a digital broadcast workflow

Managing MySQL Scale Through Consolidation

Maxta Storage Platform Enterprise Storage Re-defined

EMC Storage Strategy. Madis Pärn Senior System Engineer EMC

ALL-FLASH STORAGE ARRAY. A Hyper-Converged Infrastructure for High I/O Applications and Virtual Desktops

OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Protecting Big Data Data Protection Solutions for the Business Data Lake

Surak Thammarak. Advisory Systems Engineer EMC

Protect Data... in the Cloud

VMware vsphere Data Protection 6.1

How To Create A Server Virtualization Solution For A Large-Scale Data Center

Inge Os Sales Consulting Manager Oracle Norway

Flash Storage: Trust, But Verify

Microsoft SQL Server Native High Availability with XtremIO

EMC Isilon: Data Lake 2.0

IS IN-MEMORY COMPUTING MAKING THE MOVE TO PRIME TIME?

SimpliVity OmniStack with Vormetric Transparent Encryption

BUSINESS CONTINUITY FOR XTREMIO ALL FLASH ARRAY TAMIR SEGAL AND EFRI NATTEL-SHAY. Copyright 2015 EMC Corporation. All rights reserved.

Storage Solutions to Maximize Success in VDI Environments

Leith Automotive Group: Private Hybrid Cloud Enables Company-Wide Desktop Virtualization

Overview: X5 Generation Database Machines

Workspace & Storage Infrastructure for Service Providers

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

SOLUTION BRIEF. Resolving the VDI Storage Challenge

Nimble Storage for VMware View VDI

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

How To Manage A Single Volume Of Data On A Single Disk (Isilon)

WHITE PAPER. Effectiveness of Variable-block vs Fixedblock Deduplication on Data Reduction: A Technical Analysis

Advanced Data Mobility To Power Your Hybrid Cloud

CERNER EMR: OPTIMIZING IT INFRASTRUCTURES

Transform Your Business Using the IBM FlashSystem

efficient protection, and impact-less!!

Pure Storage and VMware Integration

THE VIRTUAL DATA CENTER OF THE FUTURE

EMC ISILON AND ELEMENTAL SERVER

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

A KAMINARIO WHITE PAPER. K2 All-Flash Array Architecture White Paper

BUSINESS CONTINUITY AND DISASTER RECOVERY FOR SAP HANA TAILORED DATACENTER INTEGRATION

OPTIMIZING MICROSOFT EXCHANGE AND SHAREPOINT WITH EMC XTREMIO

VNX HYBRID FLASH BEST PRACTICES FOR PERFORMANCE

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

Cloud Optimize Your IT

(Scale Out NAS System)

EMC ISILON SCALE-OUT STORAGE PRODUCT FAMILY

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May Copyright 2014 Permabit Technology Corporation

VMware vsphere Data Protection 6.0

Transcription:

1

SCALE-OUT SAS GRID WITH XTREMIO & ISILON REDEFINE WORKLOAD AGILITY WITH ARCHITECTURAL FLEXIBILITY - TED BASILE & JOHN MALLORY 2

Session objectives Challenges of Application & Data Silos Big Data Defined (Data Lake Foundation) Why EMC Scale-out Storage for SAS Grid? Storage Architecture Flexibility improves SAS Workflow Impacts of XtremIO & Isilon in SAS environments 3

Every Big Data Journey is Unique Big Data 1.0 Decentralized Datawarehousing Silo d approach is inefficient & complex Lack of cross-lob collaboration Focused on rearview mirror of business Reporting What Happened Big Data 2.0 Analytics for Mixed Data Sets Complimentary to EDW Integration of new data types (unstructured, dark Mainly LOB-oriented Understand Why It Happened Big Data 3.0 Federated Big Data Lake Collect and store data Bring the analytical tools to the data Agile service-oriented (aas) model & architecture Determine What Will Happen 4

What Exactly Is Big Data? Any data-set that cannot be processed with traditional systems Traditional Emerging Structured Data Unstructured Data Public records Social Networks, UGC Dark Data Internet Of Things Location Data 5

What Exactly Is Big Data? Any data-set that cannot be processed with traditional systems Traditional Traditional Emerging Structured Data Unstructured Data Dark Data Emerging Structured Data Unstructured Data Public records Social Networks, UGC Public records Social Networks, UGC Internet Of Things Location Data Dark Data Internet Of Things Location Data 6

Big Data is a big problem Increasing Silos, management and IT complexity Traditional Large volume Structured Data Unstructured Data Many sources Dark Data Emerging Public records Social Networks, UGC Rapid growth Internet Of Things Location Data IT complexity Sources Source Hadoop 7

Data Lake Foundations Scale-out storage for traditional, emerging workloads Traditional No copying Structured Data Unstructured Data Simplification Dark Data Emerging Data Lake Public records Social Networks, UGC No data triplication Internet Of Things Location Data Faster Insights Sources Shared Source Storage Hadoop 8

Why EMC Scale-out for SAS Grid Grid Compute Scale seamlessly as users, data and work grow Utilize resources more efficiently Consolidate silos of redundant SAS HW/SW into a shared services model Improve job scheduling & performance via parallelization Non-disruptive maintain, upgrade & reconfigure SAS environment without downtime Scale-Out Storage Independently scale compute - SAS WORK (XtremIO) and SAS DATA (Isilon) performance & capacity Highly efficient data protection (Isilon) and Inline data services (XtremIO) for maximum efficiency and workflow Optimized for high concurrency of throughput (sequential & random) Non-disruptive operations, dynamic scale-out, enterprise data protection & disaster recovery Your Data Lake Foundation 9

SAS Grid - Storage Requirements SAS DATA Repository for all incoming unstructured/structured data Required shared file-system Read/write ratios job dependent on SAS jobs Generally requires 50MB/sec per stream All processed results Data Cubes archived here SAS WORK Temporary/Intermediate files created by SAS DATA Very throughput and IO dependent (~3-6GB/second) Write intensive; large block sequential (or random) Large reporting cue of jobs; fed back to SAS DATA 10

XtremIO s Unique Architecture Consistent Predictable Performance + Efficiency SOFTWARE-DEFINED SCALE OUT Linear Scale IOPS, Bandwidth & Capacity DATA CENTER SERVICES HA/BC, App Management, Converged Infrastructure METADATA ENGINE Consistent sub 1ms latency INLINE AND UNSTOPPABLE DATA SERVICES Data Reduction Efficiencies, In-Memory Metadata Self Service Provisioning and Orchestration Validated Reference Architectures Intelligent vsphere VAAI Integration Continuous Data Protection and Disaster Recovery Storage Resource Management Enterprise Multi Pathing Converged Infrastructure VSI, DB, VDI 2-3 Site Continuous Availability Thin Provisioning Flash Data Protection Database Consistent Snapshot Management Virtualization Management Integration (VMW & MS) Deduplication Encryption EMC Storage Analytics Vmware vrealize Ops Compression Writeable Copies 11

XtremIO Copy Services for SAS DEV/TEST Efficiency, Checkpoint Protection, On-demand Backups 100% IN-MEMORY Any topology Instant creation Instant deletion 100% SPACE EFFICIENT No space reservations No metadata bloat 100% PERFORMANCE Identical read IOPS Identical write IOPS Identical latency 100% OPTIMIZED Identical data services Always on, always inline INCREDIBLE SCALE Instant application clones to petabyte scale UNMATCHED Use XtremIO where allflash arrays were never before viable Copyright 2014 EMC Corporation. All rights reserved. 12

On-demand Operations for SAS Delivered via XtremIO s Unique In-Memory Copy Services LIFECYCLE TEST BED for DEVELOPMENT & QA Develop with no impact to SAS jobs Move new code to Prod in real-time SAS File Systems TEST (Writeable) QA (Writeable) Training (Readable) Remove risks with new feature updates ARRAY-BASED CHECKPOINTS BACKUP & RECOVERY SAS Job #1.. 9:00 10:00 11:00 12:00 18:00 Preserve hours/days of SAS processing No overhead/impact to SAS jobs Protect SAS Jobs 24x7 vs. nights/weekends Readable copies feed backups to Data Domain Difficult using Traditional Storage 13

Isilon Scale-Out Data Lake Single Storage Pool For File Data Consolidation Shared Silos Storage Files Web Inconsistent Enterprise Security security Archive Report Multi-protocol Access Access Faster Time Time to to insights Insights Mobile Analyze 14

Redefining SAS Agility & Scalability XTREMIO & Isilon Scale-out = DATA LAKE FOUNDATION 1 2 3 4 On-demand ETL Workflow Run Time & Wall Clock Savings Lifecycle Test Bed Architecture & OPEX Flexibility On-Demand from production & loading into staging area; Snapshots offload Prod ~30% more jobs run/day; uncover hours of Wall clock savings; ZERO storage tuning; Checkpoint protection Dev/Test/QA in parallel to SAS jobs; move code to production in real-time XtremIO + Isilon = workflow partitioning; 20% Core/CPU recovery; On-demand backup & RPO; >8:1 Power/Cooling/Space Savings 15

SAS Grid EMC Storage Architecture Dataflow and Architecture Flexibility Data Sources Scale-out Storage SAS Grid Compute SAS Users FC (SAN) IP (NFS) LAN/WAN Consolidated Backup/Recovery 1 Data 2 3 4 Grid 5 Final 6 SAS SAS Users Future Growth: reads/writes DATA sets Add new users loaded in Users DATA receive WORK files on-demand; landing shared via start jobs into finished to scale-out zone Grid (via Isilon XtremIO jobs Grid/Isilon/XtremI Isilon) O 16

Automotive Manufacturer (1 of 2) XtremIO Redefines Performance, Efficiency & Protection CHALLENGE Critical warranty & recall analytics application; extremely storage bound at 2,000 reports/day (300 users running ad-hoc and daily reports) Traditional storage bottleneck due to de-staging of large block writes Accidental deleted data sets by users, forcing jobs reruns SOLUTION APPLICATION(S) XtremIO Workload Results 3:1 (dedupe & compression) ~1.08 ms latency (during peak processing hours) 40-146KB IO size 3.5GB/second (32% utilized) EMC XtremIO (4x10TB cluster); EMC PowerPath RESULTS SAS Analytics (Partitioned Data Sets) 30% improvement in jobs run per day with ZERO storage tuning and management required Removed cyclical reporting barriers during month & quarter end Copy services provided real-world instantaneous recovery of data sets Identified additional I/O locking problems at server to improve additional throughput (bottleneck shifted to O/S and application layer) 17

Automotive Manufacturer (2 of 2) XtremIO Transactional Analysis During Peak Processing IOPS Peak IOPS = 85K (compute bound) Response Time (ms) Average Latency = 1.08ms MB/sec Bandwidth=3.5GB/sec (32% utilized) 18

SAS Performance Testing on XtremIO Mixed Analytics SAS Workloads Testing Results* 3x better run-time savings; wall-clock time reduced by 1 hour ZERO array tuning required (major time savings) Only All-flash Array that kept all data services (dedupe & compression etc) active throughout testing Exceeded concurrent/mixed benchmarks for SAS DATA & SAS WORK on config limited to 25% of XtremIO s capable bandwidth and IOPs 'The EMC XtremIO all-flash array can be extremely beneficial for many SAS Workloads. Testing has shown it can significantly eliminate application IO latency, providing improved performance. - SAS Performance Engineering *Based on 2x 10TB X-bricks (small cluster); pushing 90% of max bandwidth 19

In Conclusion SAS is key to the Federated Big Data Lake (FBDL) Scale-out storage platforms compliment SAS Grid and enable a real-time shared resource model Isilon = Data Integration Platform (SAS & HDFS) XtremIO delivers more than just performance Unique copy services deliver new capabilities Formal RA with SAS on Isilon and XtremIO being discussed 20

Questions? 21