Facebook Storage Tiers



Similar documents
Intel RAID SSD Cache Controller RCS25ZB040

Low-cost

How To Scale Myroster With Flash Memory From Hgst On A Flash Flash Flash Memory On A Slave Server

HP recommended configuration for Microsoft Exchange Server 2010: HP LeftHand P4000 SAN

UCS M-Series Modular Servers

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

IBM System x SAP HANA

Motherboard- based Servers versus ATCA- based Servers

GV-Hot Swap Backup Center System (Rev. B) 3U, 16 / 8-Bay

APACHE HADOOP PLATFORM HARDWARE INFRASTRUCTURE SOLUTIONS

MySpace Uses Fusion-Powered I/O to Drive Greener Datacenters

Annex 9: Private Cloud Specifications

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

The Hardware Dilemma. Stephanie Best, SGI Director Big Data Marketing Ray Morcos, SGI Big Data Engineering

Boost Database Performance with the Cisco UCS Storage Accelerator

Big Fast Data Hadoop acceleration with Flash. June 2013

Very Large Enterprise Network Deployment, 25,000+ Users

Cisco SmartPlay Select. Cisco Global Data Center Promotional Program

HUAWEI TECHNOLOGIES CO., LTD. HUAWEI FusionServer X6800 Data Center Server

Very Large Enterprise Network, Deployment, Users

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

SUN ORACLE DATABASE MACHINE

VTrak SATA RAID Storage System

Terms of Reference Microsoft Exchange and Domain Controller/ AD implementation

QUESTIONS & ANSWERS. ItB tender 72-09: IT Equipment. Elections Project

Overview: X5 Generation Database Machines

Data Sheet FUJITSU Server PRIMERGY CX400 M1 Multi-Node Server Enclosure

Brainlab Node TM Technical Specifications

Optimized dual-use server and high-end workstation performance

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

HP ProLiant DL580 Gen8 and HP LE PCIe Workload WHITE PAPER Accelerator 90TB Microsoft SQL Server Data Warehouse Fast Track Reference Architecture

Hyper ISE. Performance Driven Storage. XIO Storage. January 2013

HP recommended configurations for Microsoft Exchange Server 2013 and HP ProLiant Gen8 with direct attached storage (DAS)

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

2009 Oracle Corporation 1

Evaluation Report: HP Blade Server and HP MSA 16GFC Storage Evaluation

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Data Sheet FUJITSU Server PRIMERGY CX400 S2 Multi-Node Server Enclosure

Cisco WAE Deployed with Cisco ACNS: Product Function Matrix. Two 10/100/1000BASE-T. Two 10/100/1000BASE- T

Inge Os Sales Consulting Manager Oracle Norway

HP Cloudline Overview

SUN HARDWARE FROM ORACLE: PRICING FOR EDUCATION

Dell Virtual Remote Desktop Reference Architecture. Technical White Paper Version 1.0

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering

Lenovo Database Configuration for Microsoft SQL Server TB

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

Accelerate SQL Server 2014 AlwaysOn Availability Groups with Seagate. Nytro Flash Accelerator Cards

Minimum Hardware Specifications Upgrades

NOTICE ADDENDUM NO. TWO (2) JULY 8, 2011 CITY OF RIVIERA BEACH BID NO SERVER VIRTULIZATION/SAN PROJECT

præsentation oktober 2011

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Microsoft Private Cloud Fast Track Reference Architecture

Data Sheet FUJITSU Server PRIMERGY CX420 S1 Out-of-the-box Dual Node Cluster Server

Exadata Database Machine

VxRACK : L HYPER-CONVERGENCE AVEC L EXPERIENCE VCE JEUDI 19 NOVEMBRE Jean-Baptiste ROBERJOT - VCE - Software Defined Specialist

Cisco Prime Home 5.0 Minimum System Requirements (Standalone and High Availability)

SUN ORACLE EXADATA STORAGE SERVER

Cisco MCS 7825-H3 Unified Communications Manager Appliance

Reference Architecture - Microsoft Exchange 2013 on Dell PowerEdge R730xd

PARALLELS CLOUD STORAGE

HP recommended configuration for Microsoft Exchange Server 2010: ProLiant DL370 G6 supporting GB mailboxes

Cisco MCS 7825-H2 Unified CallManager Appliance

Benchmarking Cassandra on Violin

Parallels Cloud Storage

A Smart Investment for Flexible, Modular and Scalable Blade Architecture Designed for High-Performance Computing.

FUJITSU Enterprise Product & Solution Facts

SUN ORACLE DATABASE MACHINE

Appendix 3. Specifications for e-portal

SMB Direct for SQL Server and Private Cloud

REPLIES TO PRE BID REPLIES FOR REQUEST FOR PROPOSAL FOR SUPPLY, INSTALLATION AND MAINTENANCE OF SERVERS & STORAGE

Brocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

1. Specifiers may alternately wish to include this specification in the following sections:

Improving IT Operational Efficiency with a VMware vsphere Private Cloud on Lenovo Servers and Lenovo Storage SAN S3200

EMC VMAX3 FAMILY VMAX 100K, 200K, 400K

Microsoft Exchange 2010 on Dell Systems. Simple Distributed Configurations

HP 85 TB reference architectures for Microsoft SQL Server 2012 Fast Track Data Warehouse: HP ProLiant DL980 G7 and P2000 G3 MSA Storage

DD670, DD860, and DD890 Hardware Overview

Cisco Unified Communications on the Cisco Unified Computing System

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

Servers, Clients. Displaying max. 60 cameras at the same time Recording max. 80 cameras Server-side VCA Desktop or rackmount form factor

Virtualization of the MS Exchange Server Environment

Enterprise Network Deployment, 10,000 25,000 Users

How To Build A Cisco Ukcsob420 M3 Blade Server

INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

Enabling the Flash-Transformed Data Center

Data Sheet FUJITSU Storage ETERNUS CD10000

Cisco 7816-I5 Media Convergence Server

or later or later or later or later

Transcription:

Facebook Storage Tiers How the fleet is divided: Facebook runs three main storage tiers: Type III MySQL Database (user data) Type IV Hadoop (site activity analytics and logs, messages) Type V Haystack (Photo, video, email attachments) All storage tiers are currently based on standard 2U12HDD OEM solutions. (Place image of FS-12TY or DL165Gwhatever in here)

Type III Database The Foundation MySQL running InnoDB engine. Type III System Components Main system bottleneck is HDD IOPS. FusionIO PCI-E Flash card provides secondary cache to reduce IOPS. DB suffers from moving hotspot, where a small number of machines get queried intensely compared to overall site activity. CPU RAM Storage Flash NIC Chassis Dual Intel Xeon L5630 144GB 8x1TB SATA Enterprise HDD in RAID10 array 720GB Fusion IO 1Gb 2U12HDD Redundant PSU

Type IV Hadoop The Warehouse Data is stored on HDFS, a distributed file system, servers are used for intense data crunching computation as well as logging large data sets used for the analysis. Type IV System Components CPU RAM Storage Dual Intel Xeon X5650 48GB 12x2TB SATA Enterprise HDD Facebook deployment of Hadoop has 2.2x redundancy. No RAID. No explicit system bottlenecks. Balance is maintained between CPU and HDD usage. NIC Chassis 1Gb 2U12HDD Redundant PSU

Type V Haystack The Vault Custom built file system. Large files made from thousands of appended photos with indices stored in RAM, guaranteeing single disk IOP for any photo retrieval. Main concerns are HDD rebuild and full system recovery time. (Rebuilds approaching 1 day, full system recovery 1 week). Type IV System Components CPU RAM Storage NIC Chassis Intel X5630 16GB 12x2TB SATA Enterprise HDD in RAID6 array 1Gb 2U12HDD Redundant PSU

What is Knox? Project Knox goals: Design an optimized storage server for Facebook s storage heavy applications : Haystack, Video, Titan and Hadoop Leverage already existing Open Compute Server, Power Supply and Rack solutions. Create a design that will fit the varying requirements of the different applications. Ensure an assembly, integration and maintenance friendly design. Accommodate potential change in fleet or system architecture.

Introducing Knox A platform approach to storage Knox enables building storage servers with varying compute to storage ratios using the same physical building block. PCI-E interface is used to interconnect the compute unit to the storage unit. PCI-E scalability allows adding either more compute or storage nodes independently of each other.

Open Compute System Components Open Compute System Components 4 system fans + air duct 6HDD slots OC PSU 277 VAC w/ 48V DC backup Open Compute Intel motherboard 1.5U open chassis

Knox System Components Knox System Components 4X 120mm system fans Dual 25 hot plug HDD cabinets Dual 2xRAID+Expander controller cards Freedom PSU 277 VAC w/ 48V DC backup (Redundant power supply support) 12V/5V (20A each) power connector & cable 4.5U open chassis

Freedom Knox Type IV Rack Config Type IV Rack Configura:on Unit Height 7.5U (2*1.5U OC + 4.5U Knox) Units per column: 5 Compute nodes per column: 20 HDDs per column: 250

Freedom Knox Type V Rack Config Type V Rack Configura:on Unit Height 6U (1.5U OC + 4.5U Knox) Units per column: 7 Compute nodes per column: HDDs per column: 350 7

Knox Architectural Advantages Enables designing an optimized storage server configuration with minimal new hardware development. Leverages already existing Open Compute server and its supporting infrastructure. Compute to storage ratios aren t fixed - they can be adjusted during integration or after deployment for optimal hardware utilization. The storage building blocks are identical across various tiers, only the number of units per compute node, HDD capacity and the Knox storage controller board stuffing option vary. Enables redeployment of compute (OC server) or storage (Knox RBOD) nodes to other applications after production.

Knox Architectural Advantages Enables future design upgrades to be limited in scope to either compute or storage only. A lot of activity surrounding low power processors for the server market is taking place, and may have a transformative effect on the industry. Adoption for storage alone may save millions in server component cost and annual electricity usage. First step in the direction of a Universal Rack. Capacity planning team desires to develop an infrastructure that will allow adjustment of compute to storage ratios using standard server and storage hardware blocks. Technology to enable this over PCI-E is not mature, but Knox enables a first step in transitioning towards that solution.

Knox Architectural Advantages Multiple benefits to the consolidation of more HDDs behind fewer compute nodes. Hotspot effect in the database tier should be mitigated thanks to larger amount of max IOPs available per system. Database and Haystack tiers aren t CPU bound. Reduce CapEx by reducing the number of compute nodes per TB of storage.

Knox Controller Board Architecture Knox controller board can be configured with different component stuffing options enabling support for multiple configurations using a single PCB solution. Single (Haystack) or dual (Hadoop) controllers depending on tier requirements. Input to the board can be either SAS or PCI-E. Use AC caps to create stub free high speed T-routes.

Knox Controller Board Layout

Knox Chassis

Knox Chassis

Knox Chassis

Knox HDD buckets

Knox HDD buckets

Knox HDD buckets