ebay Storage, From Good to Great



Similar documents
Solid State Storage in the Evolution of the Data Center

Flash 101. Violin Memory Switzerland. Violin Memory Inc. Proprietary 1

Scaling from Datacenter to Client

The next step in Software-Defined Storage with Virtual SAN

Accelerating Real Time Big Data Applications. PRESENTATION TITLE GOES HERE Bob Hansen

The Flash Transformed Data Center & the Unlimited Future of Flash John Scaramuzzo Sr. Vice President & General Manager, Enterprise Storage Solutions

ZD-XL SQL Accelerator 1.5

The Future of Software-Defined Storage What Does It Look Like in 3 Years Time? Richard McDougall, VMware, Inc

Flash Use Cases Traditional Infrastructure vs Hyperscale

Data Center Solutions

Accelerating I/O- Intensive Applications in IT Infrastructure with Innodisk FlexiArray Flash Appliance. Alex Ho, Product Manager Innodisk Corporation

Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk

Accelerating Datacenter Server Workloads with NVDIMMs

Data Center Solutions

Hadoop on the Gordon Data Intensive Cluster

How it can benefit your enterprise. Dejan Kocic Netapp

Data Center Storage Solutions

Solid State Technology What s New?

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

VMware Software-Defined Storage & Virtual SAN 5.5.1

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Performance Beyond PCI Express: Moving Storage to The Memory Bus A Technical Whitepaper

Getting the Most Out of Flash Storage

Building All-Flash Software Defined Storages for Datacenters. Ji Hyuck Yun Storage Tech. Lab SK Telecom

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

SALSA Flash-Optimized Software-Defined Storage

A Close Look at PCI Express SSDs. Shirish Jamthe Director of System Engineering Virident Systems, Inc. August 2011

The Data Placement Challenge

Enabling High performance Big Data platform with RDMA

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA

Storage Architectures for Big Data in the Cloud

How To Store Data On A Server Or Hard Drive (For A Cloud)

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

NAND Flash Architecture and Specification Trends

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Building a Flash Fabric

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Big Data: A Storage Systems Perspective Muthukumar Murugan Ph.D. HP Storage Division

The Technologies & Architectures. President, Demartek

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

SMB Direct for SQL Server and Private Cloud

VMware Software-Defined Storage Vision

Building a Scalable Storage with InfiniBand

Realizing the next step in storage/converged architectures

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

Enabling the Flash-Transformed Data Center

Open Source for Cloud Infrastructure

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

A Study of Application Performance with Non-Volatile Main Memory

Microsoft Hybrid Cloud IaaS Platforms

Hyperscale Use Cases for Scaling Out with Flash. David Olszewski

RFP-MM Enterprise Storage Addendum 1

Software-defined Storage Architecture for Analytics Computing

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

10th TF-Storage Meeting

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

Memory Channel Storage ( M C S ) Demystified. Jerome McFarland

Deploying Ceph with High Performance Networks, Architectures and benchmarks for Block Storage Solutions

Benchmarking Cassandra on Violin

OpenStack Benelux. Dan Chester. Seagate Cloud Systems & Solutions

Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012

Zadara Storage Cloud A

Intel RAID SSD Cache Controller RCS25ZB040

LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Flash Performance for Oracle RAC with PCIe Shared Storage A Revolutionary Oracle RAC Architecture

Benefits of Solid-State Storage

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

ioscale: The Holy Grail for Hyperscale

Installing Hadoop over Ceph, Using High Performance Networking

Software-Defined Storage & VMware Virtual SAN

SLIDE 1 Previous Next Exit

Storage: The Key to Global Transformation of IT Service Delivery. Jay Prassl, VP Marketing - SolidFire

Make A Right Choice -NAND Flash As Cache And Beyond

Maxta Storage Platform Enterprise Storage Re-defined

Software Define Storage (SDs) and its application to an Openstack Software Defined Infrastructure (SDi) implementation

System Architecture. In-Memory Database

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

NetApp Confidential Information, not for distribution

Trends driving software-defined storage

Performance Analysis of Flash Storage Devices and their Application in High Performance Computing

Transcription:

ebay Storage, From Good to Great Farid Yavari Sr. Storage Architect - Global Platform & Infrastructure September 11,2014

ebay Journey from Good to Great 2009 to 2011 TURNAROUND 2011 to 2013 POSITIONING FOR THE FUTURE 2013 to 2015 CAPITALIZE ON THE OPPORTUNITY 2

From Storage Perspective 20% Y/Y OLTP Growth 300% Y/Y Analytics Growth 3

Size of the Managed Infrastructure SAN ~ FC OLTP environment 80PB NAS ~ 6PB Object Store ~1EB by 2015 Analytics environment ~270PB ~130 enterprise SAN/NAS/FLASH storage arrays in 3 major DCs ~1.4mil peak hour IOPS in SAN environment Thousands of servers with external storage 4

Storage Ecosystems OLTP Cloud (C3) Analytics / Big Data NoSQL FC/NFS à ISCSI, Object Store Centralized Storage SQL transactional > 16TB/U à 150TB/RU ~80PB ISCSI, Block + Object Store Local attached à external Storage KVM Openstack Cinder (5PB) Swift >20TB/U à 120TB/U HDFS à Object Store Local Attached à Disaggregate compute (Map Reducers) 128TB/U à 256TB/U Uneven tiering on Flash Swift object store ~270PB à ~500PB ISCI Block MongoDB Local Attached à external storage Cinder (10PB) Cassandra Couch Base Supports mobile apps International Conference on Parallel Processing 5-2014 5

2017 Vision Flash Everywhere Memory Class Storage emerging Storage Density Scales to 1Pb/RU Storage networking moves to Ethernet Software Defined Everything Hyperscale OpenStack Exabytes of Analytics 2 Flash Tiers: Performance Flash and Big Data Flash 6

Foundation of an Elastic Infrastructure Definition: An infrastructure that can spawn, destroy, grow, shrink and move processes dynamically and efficiently within and across data centers. Automated Control Plane Resource Pool Compute Memory Storage Low latency, High bandwidth interconnect Traffic Management PCI Compliance Security QoS 7

Key Technologies to Enable an Elastic Infrastructure Control Plane Virtualization / Containers/ Hardware Orchestration of infrastructure resources Normalization of resources Resource Pool High Speed Networking (10Gbe, 40Gbe, 100Gbe, beyond) RDMA enabled (routable layer3) Lossless flow control Memory Class Storage New Media beyond Virtical Nand Traffic Virtual Lan Access Control Image Credit: Open Stack 8

Key Initiatives to Enable an Elastic Infrastructure Separation of Storage and Compute - Hadoop use case Software defined storage, software defined network Cloud, SLA, OLA based services Standardization Automation Show/Chargeback Self Service 9

Shifting Paradigm of Storage Tech Bus BW Latency Power 2014 2 yr 3 yr 4 yr HDD (LFF) SSD (SFF) SAS/ SATA SAS/ SATA 600MB/ s 1.2GB/s 600MB/s 1.2GB/s 3-12ms 6-15 W 5-6TB 7-8TB 10TB? 0.3-0.8ms 2-12 W 4-8TB 8-16TB 24TB 36-48TB Flash PCIE 2GB/s 3GB/s 2µs - 150µs 25 W 1-16TB 16TB+ Storage Class Memory PCIE 2+GB/s 1µs - 40µs 100 s ns 8-24TB 24TB+ Information based on Currently available products on the market and industry roadmap information 10

Big Data Flash Design Criteria Merely beat disk in IOPs per TB Heavy read workload, write seldom $.30/Gb Target 30 day power off retention and < 200ms power on response 6Gb/s throughput 8PB in a rack Hadoop Requirements Tiered replicas Abstract storage: Data Nodes separated from Map Reducers Line Rate Networking 11

Hadoop Disaggregated Storage Model Compute Compute... MR R1 R2 R3 Compute Flash Client HDD Client HDD Compute STORAGE Firewall WAN 12

Region Specific Analytics Model Source Of Truth Company wide global Analytics data Two replica local Analytics data R1 APAC R2 R3 Americas R1 R2 Two replica local Analytics data EMEA R1 R2 Two replica local Analytics data 13

Storage Class Memory ReRAM/Memrister High Storage Density Low Power Memory Class latency (~100ns) Higher Endurance Cost < Flash Standard CMOS Long Scalability Roadmap Excellent Retention Phase Change Memory (PCM) Moderate Density High Power Higher write latency Limited Endurance Cost < DRAM Non-CMOS Unknown scalability Write Disturbance Spintronics (STT-RAM) Low Density High Power Lower Latency Higher Endurance Cost > DRAM Non-CMOS Limited scalability Good Retention 14

The Challenges The Opportunities 1. Storage growth in areas not traditionally designed for solid state such as Big Data will cause scaling challenges. How do we realize the benefits of flash while controlling the costs. 2. Storage Systems not understanding Flash 3. Software not understanding Flash 4. Capex Gap between technologies can make financial acceptance difficult. 5. Aggregated resources cause greater utilization imbalance 6. When will Flash need to pass the baton to Storage Class Memory. 1. Flash designed with Big Data in mind will enable this technology to address the scaling and cost issues. 2. Scalable Software Defined Storage multi-rack ecosystems are well suited to data placement on flash vs. RAID. Tiering and metadata enhancements will introduce flash in more areas. 3. Gradually, applications, kernels and filesystems are beginning to treat flash as flash. 4. The Capex Gap can be met with Opex and comparative advantages. Quantify it! Additionally, Capex gap will close. 5. Networking advances will enable low cost storage disaggregation. (RDMA/IP, 40Gbps+, PFC) 6. Work is being done on optimizing for non-volatile memory but this will be an evolution. Doing this will allow for low cost memory scaling. 15

Q&A