At-Scale Data Centers & Demand for New Architectures



Similar documents
At-Scale Data Centers & Demand for New Architectures

Enabling the Flash-Transformed Data Center

The Flash Transformed Data Center & the Unlimited Future of Flash John Scaramuzzo Sr. Vice President & General Manager, Enterprise Storage Solutions

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Forward Looking Statements

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

Transforming the Data Center

Advances in Flash Memory Technology & System Architecture to Achieve Savings in Data Center Power and TCO

ebay Storage, From Good to Great

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

Enabling High performance Big Data platform with RDMA

Ceph Optimization on All Flash Storage

Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk

Building All-Flash Software Defined Storages for Datacenters. Ji Hyuck Yun Storage Tech. Lab SK Telecom

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Product Spotlight. A Look at the Future of Storage. Featuring SUSE Enterprise Storage. Where IT perceptions are reality

Scalable Architecture on Amazon AWS Cloud

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

RED HAT STORAGE PORTFOLIO OVERVIEW

Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta

Addressing Storage Management Challenges using Open Source SDS Controller

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Dell Reference Configuration for Hortonworks Data Platform

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

Zadara Storage Cloud A

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

ioscale: The Holy Grail for Hyperscale

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Arif Goelmhd Goelammohamed Solutions Hyperconverged Infrastructure: The How-To and Why Now?

SimpliVity OmniStack with the HyTrust Platform

IBM ELASTIC STORAGE SEAN LEE

Traditional v/s CONVRGD

Dell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture. Dell Compellent Product Specialist Team

Deploying Ceph with High Performance Networks, Architectures and benchmarks for Block Storage Solutions

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

Successfully Deploying Alternative Storage Architectures for Hadoop Gus Horn Iyer Venkatesan NetApp

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple.

Product Brochure. Hedvig Distributed Storage Platform. Elastic. Modern Storage for Modern Business. One platform for any application. Simple.

Solid State Storage in the Evolution of the Data Center

NoSQL Data Base Basics

Windows Server 2003 Migration Guide: Nutanix Webscale Converged Infrastructure Eases Migration

Why ClearCube Technology for VDI?

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

Accelerating Cassandra Workloads using SanDisk Solid State Drives

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Software Define Storage (SDs) and its application to an Openstack Software Defined Infrastructure (SDi) implementation

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra

Building a Flash Fabric

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

SimpliVity OmniStack with Vormetric Transparent Encryption

Modernizing Hadoop Architecture for Superior Scalability, Efficiency & Productive Throughput. ddn.com

Enterprise Cloud Services HOSTED PRIVATE CLOUD

Hadoop Size does Hadoop Summit 2013

Running Highly Available, High Performance Databases in a SAN-Free Environment

Fabrics that Fit Matching the Network to Today s Data Center Traffic Conditions

HadoopTM Analytics DDN

Nutanix Tech Note. Configuration Best Practices for Nutanix Storage with VMware vsphere

FLOW-3D Performance Benchmark and Profiling. September 2012

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

VMware Software-Defined Storage Vision

SMART SCALE YOUR STORAGE - Object "Forever Live" Storage - Roberto Castelli EVP Sales & Marketing BCLOUD

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

Microsoft Hybrid Cloud IaaS Platforms

Getting Started with Database As a Service on OpenStack

Big data management with IBM General Parallel File System

HP recommended configuration for Microsoft Exchange Server 2010: HP LeftHand P4000 SAN

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

GigaSpaces Real-Time Analytics for Big Data

membase.org: The Simple, Fast, Elastic NoSQL Database NorthScale Matt Ingenthron OSCON 2010

Where IT perceptions are reality. Industry Brief. Ride the Next Big Wave of Storage Networking. Featured Technology

[Hadoop, Storm and Couchbase: Faster Big Data]

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

Big Data Trends and HDFS Evolution

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

IronPOD Piston OpenStack Cloud System Commodity Cloud IaaS Platforms for Enterprises & Service

Data Sheet FUJITSU Storage ETERNUS CD10000

CDH AND BUSINESS CONTINUITY:

Designing a Cloud Storage System

Big Fast Data Hadoop acceleration with Flash. June 2013

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

Connecting Flash in Cloud Storage

RED HAT CLOUD SUITE FOR APPLICATIONS

Introduction to NetApp Infinite Volume

Extreme Networks: Building Cloud-Scale Networks Using Open Fabric Architectures A SOLUTION WHITE PAPER

THE FUTURE OF STORAGE IS SOFTWARE DEFINED. Jasper Geraerts Business Manager Storage Benelux/Red Hat

Cloud Courses Description

The Zadara Storage Cloud A Validation of its Use Cases and Economic Benefits

EMC Unified Storage for Microsoft SQL Server 2008

Solution Brief Availability and Recovery Options: Microsoft Exchange Solutions on VMware

HGST Object Storage for a New Generation of IT

Hedvig Distributed Storage Platform with Cisco UCS

(R)Evolution im Software Defined Datacenter Hyper-Converged Infrastructure

Keys to Optimizing Your Backup Environment: Tivoli Storage Manager

EMC BACKUP-AS-A-SERVICE

Transcription:

Allen Samuels At-Scale Data Centers & Demand for New Architectures Software Architect, Software and Systems Solutions August 12, 2015 1

Forward-Looking Statements During our meeting today we may make forward-looking statements. Any statement that refers to expectations, projections or other characterizations of future events or circumstances is a forward-looking statement, including those relating to market growth, industry trends, technology transitions and developments in flash pricing and capacity. Actual results may differ materially from those expressed in these forward-looking statements due to factors detailed under the caption Risk Factors and elsewhere in the documents we file from time to time with the SEC, including our annual and quarterly reports. We undertake no obligation to update these forward-looking statements, which speak only as of the date hereof. 2

A Global Leader in Storage Solutions Produce Close to Half of Industry Bit Output Joint Ventures between SanDisk and Toshiba Computing: Enterprise, Client and Retail SSDs Qualified at 5 largest server & 4 largest storage OEMs Approved Supplier to Leading PC Manufacturers SanDisk Supports Open Source Innovation SanDisk is committed to Open Source FOSS is a key part of all of our mega markets of compute, mobile and consumer More on the possibilities of storage at bigdataflash.sandisk.com/ opensource * Gartner NAND Flash Supply & Demand, WW 1Q 13 4Q 15, March 2014. 3

The Data Tsunami is Still Getting Bigger CONTENT REPOSITORIES BIG DATA ANALYTICS MEDIA STREAMING Large containers for long periods with on-demand rapid access Mixed media container, active-archiving, backup, locality of data Hadoop, NoSQL Time-to-Value and Time-to-Insight High read intensive access from billions of edge devices Hi-def video driving even greater demand for capacity and performance 4

We are in the Middle of a Massive Change Big Data is changing our Data Center Architectures Compute-Centric Dataflow-Centric Scalability is the new paradigm Price/Performance trumps Performance You can always get performance from scale 5

Scalable Hardware Rethinking the Rack Rack Scale Architecture Decompose, aggregate and recompose High performance dense fabrics are key enablers Independent scaling of compute, memory, storage and networking Deployment flexibility and configuration Lifecycle management, operation efficiency Rack Today: Shared nothing, redundant systems driven by OEMs Stg I/F Mgmt. Stg I/F Mgmt. Stg I/F Mgmt. Stg I/F Mgmt Stg I/F Mgmt. ToR Switch Top of Rack Switch N/W I/F Flash N/W I/F Flash N/W I/F Flash N/W I/F Flash N/W I/F Flash Disaggregated Rack: Stg I/F Mgmt Flash Shared IO, simplified, clustered systems IO Boxes Wire or Photonics Fabric Pooled Storage Flash N/W I/F ory Pooled Storage: Boot, Disk, Flash Wire or Phtonoics Fabric 6

Scaling the WorkFlow Data Center scale operating system APIs, resource allocation, access control Same problems, different scale OpenStack, VMWare, Mesos, Kubernetes, Microsoft CPS Managing, monitoring, configuring and provisioning at scale Storage Block, file, object, key/value Cinder, Ceph, Gluster, Redis, Cassandra, MongoDB, Couchbase 7

Scalable Storage Systems Many storage scaling paradigms are already proven and deployed Object stores (Swift, etc.) Scale-out block (Ceph, Gluster, SheepDog, etc.) NoSQL/KV store (Cassandra, MongoDB, redis, memcached, etc.) Hadoop file system 8

Trends in Storage Components Performance oriented HDD vanishing Capacity optimized HDD thrives (SMR, Cloud Drives, etc.) BW/bit shrinks as capacities grow Flash cost reductions continue (3D) Flash diversifies and drives new standards and form factors New NVM technologies (PCM, STT, etc.) still waiting in the wings 9

HDD Based Computation HDD BW roughly constant into the future CPU processing power increasing into the future HDD / CPU ratios increases 2-3x each generation just to stand still HDD-based systems becoming constant $ / transaction No longer cheaper, faster or smaller. Just more capacious. 10

Flash to the Rescue Flash BW/bit is an economic choice BW/$ continues to decrease into the future Flash $ / bit continues exponential decrease $ / Computation for Flash decreases exponentially How can I afford to obtain the benefits of Flash? 11

Best of Both Worlds -- Tiering Primary Tier of erasure coded Flash Secondary Tier of cloud-speed capacity-optimized HDD Easy to sustain BW during inter-tier migrations 12

Primary Tier Design Data Replication with Flash is not best practice Sufficient BW without RAID-0 Overkill for data protection, Flash AFR << HDD AFR Wasteful of $ Erasure Coding is best practice with Flash High coding rates provide comparable protection to replication Overhead rates can be easily be less than 20% (< 1.2x) Recovery times drastically reduced, MTTDL reduced accordingly EC BW increase easily handled @ modern LAN speeds Rack level failure model directly obtainable 13

Building the Secondary (HDD) Tier Ideal use-case for SMR drives SMR Optimized file systems are under development Only 2x replication needed for many situations Some data sets don t justify high MTTDL values Some data sets don t justify mission critical availability Remote placement of one replica satisfies many DR needs 14

Open Source Support for EC and Tiering HDFS doesn t natively implement EC yet HDFS-RAID (Facebook) is available Ceph Object (RGW) does EC today Swift Erasure Coding in beta today Ceph direct HDFS connector does EC with local tiering 15

What s the Status? Current scale-out OSS software are $ inefficient on flash SanDisk Ceph contributions have achieved 7x-10x improvement 1 SanDisk work on Cassandra, MongoDB, and Redis shows 2-3x boost 2 Performance gap translates into significant $ at datacenter scale Proprietary and in-house solutions are filling the gap Fragmentation impedes portable application development 1 Based on internal testing comparing Emperor to Hammer releases using vdbench and fio on a two node cluster (Dell R720 2.8GHz, 2xE5-2680, 64GB DRAM), krbd client systems on Dell R620 (2x E5-2680, 32GB DRAM), 40Gbe interconnect 2 http://www.sandisk.com/assets/docs/sandisk-zetascale-whitepaper.pdf 16

Summary Flash as primary storage is here today, NOT tomorrow We want to work with you! Let s talk! 17

Thank you 2015 SanDisk Corporation. All rights reserved. SanDisk is a trademark of SanDisk Corporation, registered in the United States and other countries. Other brand names mentioned herein are for identification purposes only and may be the trademarks of their respective holder(s). 18